The current data and event ecosystem
This post discusses the components of modern event-driven data pipelines, which are crucial for businesses to answer specific questions using massive amounts of data from various sources. The author breaks down the data pipeline into four sections: Ingestion, Transport, Storage and Management, as well as Processing and Visualizing. Some popular tools mentioned include Embulk, StreamSets, Fluentd, Apache Sqoop, Flume, Spark, Apache Kafka, Amazon Kinesis, PostgreSQL, Redis, Cassandra, InfluxDB, and Grafana. The author also provides resources for learning more about data pipelines and related technologies.
Company
Aiven
Date published
Sept. 6, 2018
Author(s)
John Hammink
Word count
2148
Language
English
Hacker News points
None found.