Learn More
Learn More
Learn More
Learn More
Learn More
Learn More
Learn More
Learn More
Introduction to NoSQL - Martin Fowler's crash course on NoSQL foundations
CMU Database Group's YouTube Channel - Andy Pavlo offers free online access to his much-talked-about CMU courses, which range from intro to advanced topics
Avoiding Data Hotspots at Scale - Konstantine Osipov (ScyllaDB) addresses the tradeoffs between hash and range-based sharding
Building State-of-the-Art LSM-tree compaction - Raphael Carvalho (ScyllaDB) on using RUM conjecture and controller theory to engineer state-of-art LSM-tree compaction
Scaling Raft - Konstantine Osipov (ScyllaDB) on the challenges of supporting hundreds of instances of the Raft protocol on a single node and how to significantly reduce the static burden on the network and disk each Raft group creates
Consistency Tradeoffs in Modern Distributed Database System Design - Dr. Daniel Abadi (UMD) introduces the PACELC theorem: In case of network partitioning (P) in a distributed computer system, one has to choose between availability (A) and consistency (C) (as per the CAP theorem), but else (E), even when the system is running normally in the absence of partitions, one has to choose between latency (L) and consistency (C)
Leveraging Consistent Hashing in your Python Applications - Alexys Jacob (Numberly) introduces consistent hashing, and the problems it solves, by walking through a practical use case in a Python application
You're Already Eventually Consistent - Jon Haddad (Rustyrazorblade) clears up some misconceptions about eventual consistency
Inconsistent Thoughts on Database Consistency - Alex Debrie (DeBrie Advisory) discusses the various definitions of the word "consistency" that are used in the distributed databases space then shares some of his issues with conversations about consistency
Understanding Eventual Consistency in DynamoDB - Alex Debrie (DeBrie Advisory) dispels some of the fear around eventual consistency in DynamoDB
Getting data modeling right is vital for database performance, but it's a massive topic that varies significantly across database models and types. We reference data modeling throughout the book, but did not feel that adding a dedicated chapter on data modeling would be feasible or adequate. Here are some resources that focus on the topic:
NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence - A book by Pramod J. Sadalage and Martin Fowler
Data Modeling Concepts and Principles - Pascal Desmarets (Hackolade) discusses the foundations of NoSQL data modeling
NoSQL Data Modeling 101 - Tzach Livatan (ScyllaDB) covers NoSQL vs SQL data modeling and how to get started with NoSQL (primary key selection, application query analysis, & more)
Top NoSQL Data Modeling Mistakes for Performance - Felipe Cardeneti Mendes (ScyllaDB) on how to avoid the top data modeling mistakes that impact performance (access the demonstrated "bad practices" in this GitHub repo)
Indexes, Filters, and Other Animals - Piotr Sarna (Turso) on the performance impacts of global/local secondary indexes and filtering
NoSQL Data Modeling Mistakes that Ruin Performance - Felipe Cardeneti Mendes explores three of the most common ways to ruin your NoSQL database performance, along with tips on how to avoid or resolve them
The Myth of Schema-less - Jon Haddad (Rustyrazorblade) debunking the myth of schemaless databases
Everything You Need to Know about DynamoDB Partitions - Alex Debrie (DeBrie Advisory) takes a deep look at DynamoDB partitions -- what they are, why they matter, and how they should affect your data modeling
The What, Why, and When of Single-Table Design with DynamoDB - Alex Debrie (DeBrie Advisory) provides a deep dive on the concepts behind DynamoDB's single-table design
Thanks Cynthia, Felipe, Piotr, and Pavel at scylladb for this awesome article.