Company
Date Published
Author
Erich Ess
Word count
1049
Language
English
Hacker News points
None

Summary

Erich Ess, CTO of SimpleRelevance, discusses using Apache Spark with Cassandra for Extract, Transform, and Load (ETL) tasks. The text provides a detailed walkthrough on how to parse and load the MovieLens dataset into Cassandra, then perform simple analytics. It also highlights the benefits of caching in Spark for improved performance during ETL processes.