/plushcap/analysis/cloudflare/more-data-more-data

More data, more data

What's this blog post about?

CloudFlare's Data Team reports on their edge network logs processing, detailing the numbers from an ordinary day in July 2016. They handle nearly 360TB of raw, Cap’n Proto event logs daily, using two Kafka clusters and a significant amount of hardware for data consolidation. The team highlights what has worked well in their system, such as the log forwarder, Kafka, persistence for log sharing, CitusDB, and the platform and SRE. They also discuss areas they want to improve or add in the near future, including making services more reliable, new analytics, new data pipelines, and better support for complex analysis. The Data Team invites feedback and comments on their work.

Company
Cloudflare

Date published
July 12, 2016

Author(s)
Hunter Blanks

Word count
1241

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.