/plushcap/analysis/datastax/datastax-what-persistence-and-why-does-it-matter

What is data persistence & why does it matter?

What's this blog post about?

Persistence in data storage refers to the continuance of an effect after its cause has been removed, meaning that data survives even after the process that created it has ended. This is achieved by writing data to non-volatile storage. There are four main design approaches for data stores: in-memory, update-in-place, commitlog-based, and snapshot-based systems. In-memory systems provide high speed but limited data sets and lack persistence. Snapshot-based systems offer periodic snapshots to disk but can lose updates within the interval. Commitlog-based systems store data immediately and provide durability with every write. Cassandra implements a commit-log based persistence design, allowing for tunable levels of durability to balance safety and performance. This makes it a logical choice for systems requiring durable, performant data storage.

Company
DataStax

Date published
Oct. 22, 2010

Author(s)
Matt Pfeil

Word count
445

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.