/plushcap/analysis/datadog/engineering-robust-statistical-distances-for-machine-learning

Robust Statistical Distances for Machine Learning

What's this blog post about?

This post discusses the use of statistical distances in comparing distributions and detecting anomalies. Statistical distances are distances between distributions or samples, which can be used in various machine learning applications such as anomaly detection, ordinal regression, and generative adversarial networks (GANs). The article covers visual inspection methods like Q-Q plots and statistical distance measures including the Kolmogorov-Smirnov Distance, Earth Mover's Distance, and Cramér-von Mises Distance. It also highlights the properties of each distance measure and provides an interactive tool for comparing distributions. The post concludes by noting that these methods have their place in different scenarios and can be used effectively to detect anomalies or outliers.

Company
Datadog

Date published
Sept. 6, 2017

Author(s)
Charles Masson

Word count
1800

Hacker News points
16

Language
English


By Matt Makai. 2021-2024.