/plushcap/analysis/datastax/datastax-year-real-time-apache-pulsar-streaming

The Year in Real-Time for Apache Pulsar and Streaming

What's this blog post about?

Apache Pulsar is a multi-tenant, high-performance messaging and streaming platform designed to manage billions of events in real-time. It supports multiple clusters in a Pulsar instance, built-in geo-replication of messages across clusters, low latency transmission, and scales to over a million topics. In 2022, the Pulsar community made significant progress with more than 1600 commits across five releases. The ecosystem is vibrant and includes projects like Starlight, which provides wire-level compatibility with traditional messaging APIs, enabling existing applications to take advantage of Pulsar's performance and scalability without requiring them to be rewritten. Apache Pulsar Functions abstract away the details normally handed by dedicated stream processing engines, allowing for data transformations, dynamic routing, data enrichment, analytics, and more. DataStax also improved the Pulsar Functions framework with DataStax Pulsar Transformations that allow it to run advanced data manipulation without writing code. CDC for Astra DB, built on both Astra's modular, serverless foundation, and Astra Streaming, enables real-time applications to subscribe to change events using client libraries in Java, Golang, Python, or Node.js.

Company
DataStax

Date published
Jan. 31, 2023

Author(s)
-

Word count
1388

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.