Company
Date Published
Sept. 29, 2014
Author
Artem Aliev
Word count
1082
Language
English
Hacker News points
None

Summary

DataStax Enterprise (DSE) version 4.7 now officially supports Apache Spark™ MLlib integration, allowing users to perform in-memory analytics utilizing integrated Apache Spark™. This update enables powerful real-time analytics capabilities and makes starting a Spark cluster simple. The article demonstrates how to use the Naive Bayes algorithm with Spark and Cassandra to build a classifier for the Iris flower data set, showcasing fundamental techniques that can be used at scale.