Six parameters to tune for cqlsh COPY FROM performance
This article discusses methods for improving performance when importing CSV data into Cassandra via cqlsh COPY FROM. Key suggestions include installing a Python driver with Cython and libev C extensions, setting the CQLSH_NO_BUNDLED environment variable to use an installed driver, compiling the clqsh copy module with Cython for additional performance boosts, adjusting CPU scheduling on Linux systems, and experimenting with six COPY FROM parameters to optimize performance for specific workloads. These methods can result in significant improvements in import speed and efficiency.
Company
DataStax
Date published
April 20, 2016
Author(s)
Stefania Alborghetti
Word count
750
Hacker News points
None found.
Language
English