DataStax Enterprise GraphFrames: Best Practices
This guide provides best practices for loading data using the DseGraphFrame package, which offers a Spark API for bulk operations and analytics on DataStax Graph. The package supports reading DataStax Graph data into a GraphFrame and writing GraphFrames from any format supported by Spark into DataStax Graph. Key points include handling null values during updates, managing caching levels, indexing with Materialized Views, updating vertices and edges, and tuning parameters for improved write performance during bulk loading. Additionally, users can specify which host a DseGraphFrame object should connect with to read graph contents from one cluster and write to another.
Company
DataStax
Date published
Nov. 1, 2019
Author(s)
Rocco Varela
Word count
1759
Hacker News points
None found.
Language
English