/plushcap/analysis/clickhouse/clickhouse-clickpy-one-trillion-rows

ClickPy reaches one trillion rows

What's this blog post about?

ClickPy is a free service built on ClickHouse that enables real-time analytics on Python Package Index (PyPi) package downloads. The underlying data for projects and downloads is available in BigQuery, but it takes several hours to export the data. Therefore, the data was exported into Google Cloud Storage buckets as Parquet files. The service has been live for around 9 months, with the main table in the database exceeding 1 trillion rows. ClickPy's frontend is written in Next.JS and React, allowing users to explore their favorite packages.

Company
ClickHouse

Date published
Aug. 30, 2024

Author(s)
ClickHouse Team

Word count
1337

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.