Using the Cassandra Bulk Loader, Updated
The text discusses the improvements made in Apache Cassandra's sstableloader tool, which is used for bulk loading data into a Cassandra cluster. It highlights that while usage of sstableloader has not changed much, it now requires specifying the address(es) of the node using the -d option. Additionally, the text introduces CQLSSTableWriter, a new feature in Apache Cassandra 2.1.0, which simplifies bulk loading data to complex schemas by allowing users to use DDL for table creation and INSERT statements for data insertion. Despite some limitations, such as lack of parallel processing and support for user-defined types, the tool has been improved over past releases, making it easier for users to experiment with generating SSTables and bulk loading data.
Company
DataStax
Date published
Sept. 26, 2014
Author(s)
Yuki Morishita
Word count
616
Language
English
Hacker News points
None found.