Database Case Studies

LSM-Tree and SSTables

Writes go to memtable in-memory (could be balanced BST or SkipList etc.)
Flushed to disk when it gets too big
Merging and Compaction happens in the background
Pros
- Faster writes to Memory
Cons
- Reads maybe slow because need to search for many SSTables for value of a key

Single leader
- Pros: No Data Conflicts
- Cons: All writes go to one node
Multi-Leader
- Pros: Increase Write throughput across multiple nodes
- Cons: Data Conflicts

Document DB
Better Data Locality
Uses B-Trees
Supports distributed Tx supported
Useful for:
- Nothing special since you can also use Traditional SQL DB
- Might be useful if you want ACID in data which is more flexible

Wide Column DB, Row oriented
Inspired by Dynamo+Bigtable
Has a shard key and sort key
Flexible Schema, Ease of Partitioning
Uses LSM Tree and SSTable
- Super Fast writes
Configurable Replication
- Multi-Leader
- Leaderless
Causes write conflicts
Only supports LWW conflict resolution
Has very efficient CQL, subset of SQL with No Joins
Useful for:
- High Write Volume (Millions of Writes per second)
- Consistency is not important, it is Tunable also
- All reads and writes go to same shard (No Tx)
- Efficient Range Queries
Examples:
- Chat application with sharding key as chatID, sort key as timestamp
- Time Series and Analytics Data

https://stackoverflow.com/questions/75212982/how-is-cassandra-column-oriented-and-at-the-same-time-suitable-for-oltp
https://www.youtube.com/watch?v=6bxin9cZL_w
Cassandra is NOT column oriented Database
Cassandra is wide column or column family DB
Apple has a Cassandra cluster with over 75k nodes and 10 petabytes of data
Cassandra is very efficient in write queries
Cassandra Query Language (CQL) is not as good as SQL
- https://stackoverflow.com/questions/11154547/cassandra-cql-nosql-or-sql
- No joins or subqueries
- No transactions
- Except for the primary key, you can only apply a WHERE condition on a column if you have created an index on that column.
- Keyspace ⇒ like Database
- Column Family ⇒ like Table
- Primary Key ⇒ Partition key + Cluster Key
Partitioning
- using cluster key
- Hash Ring
- uses gossip protocol
- All reads and writes should go to one partition
- very little support for distributed transactions
Replication
- Leaderless
- Read Repair
- Anti entropy
- Quorums, configurable, hence consistency configurable
Write conflicts
- Last Write Wins (LWW)
Riak also alternative to Cassandra
- Riak supports CRDT (Conflict free Replicated Datatypes)
Cassandra Single Node
- LSM Tree + SSTable — write optimized
- only row level locking in transactions
Very poor data guarantees
- uses LWW
Uses
- Chat Application, cluster key = chatId

Key Value DB but not in-memory
Inspired by Dynamo
Uses LSM Tree and SSTable
- Super Fast writes
Configurable Replication
- Multi-Leader
- Leaderless
Causes write conflicts
Supports CRDTs and captures causal relationship
Has version vectors
Useful for
- Fast Writes
- Simple Queries, Key-lookups
- More than LWW conflict resolution
Examples
- Session Storage
- Tokens

Key Value Store, also supports Document data model
Scale: More than 1/2 million RPS over hundreds of Tables, So choose when you really need this scale
Uses primary key and sort key which can each have only one attribute compared to Cassandra which can have multiple columns
2007: Dynamo paper comes out, Dynamo originally built for cart service and highly available writes and was self managed
While Amazon DynamoDB is AWS managed cloud service
It is “infinitely scalable”
Unlike Dynamo, it uses Paxos over its replicas
It is multi-tenant
References
- Architecture choices: https://www.youtube.com/watch?v=csvPepC6tKk
- Deep Dive into Architecture: https://www.youtube.com/watch?v=Qzs8mU5dgx4
- Jordan has no life: https://www.youtube.com/watch?v=FeFYLKJQxTs
- https://www.scylladb.com/learn/dynamodb/introduction-to-dynamodb/comparison/