Automated root cause analysis with Watchdog RCA
Watchdog, an AI-based anomaly detection tool, has introduced its Root Cause Analysis (RCA) feature to automatically identify causal relationships between symptoms across applications and infrastructure, pinpointing the root cause of issues. This hands-free approach significantly reduces mean time to resolution (MTTR). When Watchdog detects an anomaly, it creates a "story" with contextual information such as impacted services and users. The RCA feature maps applications and infrastructure, understanding how components typically interact, and identifies the probable root cause of issues. It also surfaces resulting critical failures. In addition to problematic code changes, Watchdog can detect other root causes like increased traffic from a client, AWS instance failure, or disk reaching its maximum capacity. The tool also provides impact analysis to help prioritize troubleshooting efforts and uses real user monitoring metrics for better visibility into end-user experience.
Company
Datadog
Date published
April 13, 2022
Author(s)
Brooke Chen, Othmane Abou-Amal
Word count
796
Hacker News points
None found.
Language
English