/plushcap/analysis/datastax/datastax-new-token-allocation-algorithm-cassandra-30

New token allocation algorithm in Cassandra 3.0

What's this blog post about?

Token allocation in distributed databases like Cassandra is a complex problem that requires balancing load distribution and the ability to add new nodes without disrupting the existing ones. A common solution has been using high numbers of randomly-allocated token ranges, but this approach can lead to disproportions in individual vnode ranges as more nodes are added. To address this issue, Cassandra 3.0 introduced a new token allocation algorithm that optimizes replicated token ownership distribution during bootstrap. This method allows for better load distribution and improved performance with smaller numbers of vnodes per node compared to random allocation. The algorithm generates candidate tokens and evaluates their effects on the overall replication topology, aiming to distribute the standard deviation in ownership evenly among all nodes.

Company
DataStax

Date published
Jan. 26, 2016

Author(s)
Branimir Lambov

Word count
1034

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.