/plushcap/analysis/datastax/datastax-datastax-enterprise-graphframes-best-practices

DataStax Enterprise GraphFrames: Best Practices

What's this blog post about?

This guide provides best practices for loading data using the DseGraphFrame package, which offers a Spark API for bulk operations and analytics on DataStax Graph. The package supports reading DataStax Graph data into a GraphFrame and writing GraphFrames from any format supported by Spark into DataStax Graph. Key points include handling null values during updates, managing caching levels, indexing with Materialized Views, updating vertices and edges, and tuning parameters for improved write performance during bulk loading. Additionally, users can specify which host a DseGraphFrame object should connect with to read graph contents from one cluster and write to another.

Company
DataStax

Date published
Nov. 1, 2019

Author(s)
Rocco Varela

Word count
1759

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.