Monitor Apache Hive with Datadog
Apache Hive is an open-source interface that enables users to query and analyze distributed datasets using SQL commands. It compiles SQL commands into an execution plan, which it then runs against your Hadoop deployment. Datadog's integration allows you to monitor Hive metrics and logs in context with the rest of your big data infrastructure. You can optimize Hive memory usage by tracking client sessions alongside memory usage from two Hive components: HiveServer2 and the Metastore. Additionally, you can troubleshoot slow queries by tracking the time SQL operations spend in different states and investigate execution errors in context using Datadog's log processing pipeline. The integration also provides visibility across your distributed big data architecture, including technologies like AWS Elastic MapReduce and ZooKeeper.
Company
Datadog
Date published
July 29, 2019
Author(s)
Paul Gottschling
Word count
521
Hacker News points
None found.
Language
English