Company
Date Published
Author
Anais Dotis-Georgiou
Word count
2461
Language
English
Hacker News points
None

Summary

The Median Absolute Deviation (MAD) algorithm is a powerful anomaly detection technique that can be used to identify time series data points that are behaving differently from others. MAD is a widely used algorithm in DevOps Monitoring, where it enables Site Reliability Engineers (SREs) to quickly identify unhealthy containers, VMs, or servers and diagnose infrastructure problems. The algorithm works by calculating the median absolute deviation of each time series at a given timestamp, and then flagging points that have a large deviation from the median as anomalous. MAD is highly effective and efficient, but can be sensitive to anomalies, especially when dealing with large datasets. To mitigate false positives, SREs can use techniques such as adjusting the threshold value or grouping data by region. The MAD algorithm has been implemented in Flux, a powerful open-source query language for InfluxDB, which allows users to write custom anomaly detection algorithms and integrate them into their monitoring pipelines. By using MAD, organizations can improve the efficiency of their root cause analysis efforts, reduce mean time to resolution (MTTR), and honor Service Level Objectives (SLOs).