What is data persistence & why does it matter?

Company

DataStax

Date Published

Oct. 22, 2010

Author

Matt Pfeil

Word count

445

Language

English

Hacker News points

None

URL

www.datastax.com/blog/what-persistence-and-why-does-it-matter

Summary

Persistence in data storage refers to the continuance of an effect after its cause has been removed, meaning that data survives even after the process that created it has ended. This is achieved by writing data to non-volatile storage. There are four main design approaches for data stores: in-memory, update-in-place, commitlog-based, and snapshot-based systems. In-memory systems provide high speed but limited data sets and lack persistence. Snapshot-based systems offer periodic snapshots to disk but can lose updates within the interval. Commitlog-based systems store data immediately and provide durability with every write. Cassandra implements a commit-log based persistence design, allowing for tunable levels of durability to balance safety and performance. This makes it a logical choice for systems requiring durable, performant data storage.