/plushcap/analysis/confluent/confluent-skai-cloud-migration-topic-reduction

Migrating Kafka to the Cloud: How Skai Went From 90K to 1.8K Topics

What's this blog post about?

We managed an online advertising campaign using an interactive tool that let us pivot, analyze, filter, and display performance across all channels. However, as we added features and accrued more data, our UI updates became slower due to the relational databases' inability to scale. We investigated change-data-capture replication with fast in-memory databases using Apache Kafka and Debezium connector. Our initial architecture improved performance but had limitations, including high maintenance costs and inefficient utilization. To address these challenges, we consolidated tables into shared topics and pipelines, reducing the number of topics from 90K to 1.8K. We also implemented a new topic type for heavy-load tables with six partitions and a caching mechanism to reduce API calls. Our re-architecture reduced topics by over 90% without sacrificing performance, freeing up brokers and reducing maintenance time and cloud budget.

Company
Confluent

Date published
Aug. 12, 2024

Author(s)
Matt Mangia, Gil Friedlis, Robbie Palmer

Word count
1851

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.