How Cloudflare analyzes 1M DNS queries per second
The text discusses the implementation of DNS analytics for all Cloudflare customers due to its massive scale. It outlines how logs are processed from the edge using structured log messages in Cap'n Proto format, Apache Kafka clusters, and stream processors generating aggregations. The author also explains why they chose not to store full DNS message payloads and instead logged only telemetry data such as response code, size, or query name. Furthermore, it details the use of ClickHouse for storing logs and how it handles data storage and indexing. It also discusses the integration with Superset and Grafana for visualization purposes and mentions that these tools are available in users' Cloudflare dashboards analytics. The text concludes by encouraging interested individuals to apply for positions at Cloudflare.
Company
Cloudflare
Date published
May 10, 2017
Author(s)
Marek VavruĊĦa
Word count
2441
Hacker News points
None found.
Language
English