/plushcap/analysis/datadog/hive

Monitor Apache Hive with Datadog

What's this blog post about?

Apache Hive is an open-source interface that enables users to query and analyze distributed datasets using SQL commands. It compiles SQL commands into an execution plan, which it then runs against your Hadoop deployment. Datadog's integration allows you to monitor Hive metrics and logs in context with the rest of your big data infrastructure. You can optimize Hive memory usage by tracking client sessions alongside memory usage from two Hive components: HiveServer2 and the Metastore. Additionally, you can troubleshoot slow queries by tracking the time SQL operations spend in different states and investigate execution errors in context using Datadog's log processing pipeline. The integration also provides visibility across your distributed big data architecture, including technologies like AWS Elastic MapReduce and ZooKeeper.

Company
Datadog

Date published
July 29, 2019

Author(s)
Paul Gottschling

Word count
521

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.