/plushcap/analysis/datadog/datadog-watchdog-automated-root-cause-analysis

Automated root cause analysis with Watchdog RCA

What's this blog post about?

Watchdog, an AI-based anomaly detection tool, has introduced its Root Cause Analysis (RCA) feature to automatically identify causal relationships between symptoms across applications and infrastructure, pinpointing the root cause of issues. This hands-free approach significantly reduces mean time to resolution (MTTR). When Watchdog detects an anomaly, it creates a "story" with contextual information such as impacted services and users. The RCA feature maps applications and infrastructure, understanding how components typically interact, and identifies the probable root cause of issues. It also surfaces resulting critical failures. In addition to problematic code changes, Watchdog can detect other root causes like increased traffic from a client, AWS instance failure, or disk reaching its maximum capacity. The tool also provides impact analysis to help prioritize troubleshooting efforts and uses real user monitoring metrics for better visibility into end-user experience.

Company
Datadog

Date published
April 13, 2022

Author(s)
Brooke Chen, Othmane Abou-Amal

Word count
796

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.