New token allocation algorithm in Cassandra 3.0
Token allocation in distributed databases like Cassandra is a complex problem that requires balancing load distribution and the ability to add new nodes without disrupting the existing ones. A common solution has been using high numbers of randomly-allocated token ranges, but this approach can lead to disproportions in individual vnode ranges as more nodes are added. To address this issue, Cassandra 3.0 introduced a new token allocation algorithm that optimizes replicated token ownership distribution during bootstrap. This method allows for better load distribution and improved performance with smaller numbers of vnodes per node compared to random allocation. The algorithm generates candidate tokens and evaluates their effects on the overall replication topology, aiming to distribute the standard deviation in ownership evenly among all nodes.
Company
DataStax
Date published
Jan. 26, 2016
Author(s)
Branimir Lambov
Word count
1034
Hacker News points
None found.
Language
English