Company
Date Published
Author
Andrea Santurbano
Word count
1334
Language
English
Hacker News points
None

Summary

Apache Zeppelin is a web notebook with a pluggable architecture, composed of three pillars: the display system, the interpreter and helium modules. The display system renders output on the front-end, while the interpreter enables Zeppelin to use specific programming languages or data processing back-ends like Spark, Pig, or Frink. The third pillar, Zeppelin Helium, is a plug-in system that allows users to easily extend Zeppelin with new features and tools. CAPS (Cypher for Apache Spark) is a project that extends Spark, allowing users to create and query graph data models / property graph models all over Spark, with three main features: built on top of the Spark DataFrame API, supporting a subset of Cypher, and supporting a wide range of data sources. Interactive notebooks are rising in popularity and replacing PowerPoint presentations in many organizations due to their ability to address common tasks in data-driven organizations such as running code, exploring data, and presenting results in real-time collaboration between users.