ClickPy reaches one trillion rows
ClickPy is a free service built on ClickHouse that enables real-time analytics on Python Package Index (PyPi) package downloads. The underlying data for projects and downloads is available in BigQuery, but it takes several hours to export the data. Therefore, the data was exported into Google Cloud Storage buckets as Parquet files. The service has been live for around 9 months, with the main table in the database exceeding 1 trillion rows. ClickPy's frontend is written in Next.JS and React, allowing users to explore their favorite packages.
Company
ClickHouse
Date published
Aug. 30, 2024
Author(s)
ClickHouse Team
Word count
1337
Hacker News points
None found.
Language
English