/plushcap/analysis/aiven/how-to-build-a-pipeline-with-streamsets-and-aiven

How to build a data pipeline with StreamSets and Aiven

What's this blog post about?

The article discusses how to design and build a complete data architecture in minutes using free and open source tools, such as StreamSets' open source drag-and-drop data pipeline builder, Data Collector, along with Aiven's hosted and managed Apache Cassandra and OpenSearch. It explains how to create an end-to-end data pipeline by ingesting log data into OpenSearch and running analytics queries in the data stores. The process involves setting up a few things first, such as downloading sample data and GeoIP database, creating Docker volumes for use in Data Collector pipeline, and setting up Aiven for OpenSearch instance. It also explains how to create a pipeline that processes logs and stashes the results into an OpenSearch instance. The article concludes by discussing how StreamSets Data Collector and Aiven managed services can help build what took months in minutes without needing to burn time on day-to-day tasks.

Company
Aiven

Date published
Dec. 16, 2019

Author(s)
John Hammink

Word count
2238

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.