The BlueSky social network's API has led to a surge in popularity, resulting in a high-volume stream of JSON events per second. To address the challenges of processing this data, a Medallion architecture is proposed, consisting of three layers: Bronze, Silver, and Gold. The Bronze layer processes raw data from the BlueSky API, filtering out malformed or incorrect timestamps and duplicates. It uses ClickHouse's new JSON type to efficiently store semi-structured data. The Silver layer transforms the data further, performing more filtering, standardizing schemas, and ensuring all duplicates are removed. It introduces deduplication windows to minimize delays. The Gold layer provides a final, query-ready dataset for analytics. Throughout the architecture, materialized views and incremental updates enable efficient querying and data processing. The proposed system tackles common challenges in handling semi-structured data, such as malformed timestamps, inconsistencies in structure, and high duplication rates.