Is this a video course?

No. This is an interactive, slide-based learning platform. Each lesson has rich text, animated diagrams, live code editors, and quizzes. You learn by reading, interacting, and doing, not by watching videos passively.

How long do I have access?

Forever. Both pricing tiers are one-time payments with lifetime access. This includes all current 766 lessons and any future content we add.

What level of experience do I need?

None. We start from absolute basics like 'What is latency?' and build up to distributed consensus protocols. The Foundation level assumes zero prior knowledge of system design.

How much does the system design course cost?

5 US dollars for lifetime access globally, or 299 Indian rupees for lifetime access in India. One-time payment, no subscription, no hidden fees. 11 lessons are free with no signup required.

What technologies are covered?

Everything from DNS and load balancers to Kubernetes, Kafka, distributed databases, consensus protocols, stream processing, security architecture, and observability. We cover principles and real-world implementations used at Netflix, Google, Amazon, Uber, Stripe, and more.

Is this useful for system design interview preparation?

Yes. The lessons are structured around the exact topics asked in system design interviews at FAANG and top-tier companies. Interactive diagrams help you practice whiteboard-style explanations. Covers everything from URL shortener design to distributed payment systems.

How is this different from ByteByteGo or Educative?

766 interactive lessons (4x more than most competitors), 16 different diagram types that build step by step, real production examples from Netflix, Google, Amazon, Uber, and Stripe, and lifetime access for a one-time payment of 5 dollars instead of annual subscriptions costing 100 to 200 dollars per year.

What is the difference between linearizability and serializability?

They sound similar but constrain different things. Linearizability is about single operations on a single object across replicas: once a write is visible, every later read everywhere sees it, in real-time order. Serializability is about whole transactions over many objects: the concurrent result must equal some serial order, but that order need not match real time. A system can be one without the other. The combination, often called strict serializability, is the strongest guarantee and is what systems like Spanner provide.

Is eventual consistency just a polite way of saying the data is wrong?

No. Eventual consistency promises that if writes stop, all replicas will converge to the same value. The data is not wrong, it is temporarily out of sync, and the window is usually milliseconds. The real question is whether your feature can tolerate that window. A view counter can; a bank balance cannot. The mistake is using eventual consistency for something that needs an invariant enforced at write time, not assuming eventual consistency is broken.

When should I use a CRDT instead of last-write-wins?

Use last-write-wins only when losing a concurrent update is acceptable, such as a user-profile field where the most recent edit should win outright. Use a CRDT when concurrent updates each carry intent you must not drop, such as items in a shopping cart, a collaborative document, or a distributed counter. CRDTs guarantee that any merge order converges to a correct result, so you avoid both lost writes and the need for a central coordinator to order edits.

What anomaly does snapshot isolation allow that serializability prevents?

Write skew. Two transactions each read the same shared state, each independently decide their write keeps an invariant intact, and they commit without conflicting on any single row. Together they violate an invariant neither broke alone, for example two on-call engineers each marking themselves off-duty because each saw the other still on. If your invariant spans rows that no single transaction writes, snapshot isolation will not protect it and you need serializable isolation or an explicit lock.

Why do strong consistency models hurt latency and availability?

Strong models require nodes to coordinate before responding, so an operation cannot complete until enough replicas agree, which means cross-node round trips on the critical path and higher tail latency. During a network partition, if the required replicas cannot be reached, the system must refuse the operation rather than risk showing divergent values. That is the CAP trade-off: under partition you pick consistency or availability, and strong models pick consistency.

How do write-ahead logging and point-in-time recovery relate to consistency?

Consistency models govern what readers see during normal operation; durability machinery governs what survives a crash. Write-ahead logging records each change's intent before applying it, so after a failure the database can replay or undo partial work and come back in a consistent state. Point-in-time recovery combines a base snapshot with the archived log to rebuild the database as of any chosen moment, which is how an accidental delete or a bad deploy becomes recoverable instead of permanent.

advanced

Consistency Models

Two people split a bill. One taps "pay," sees the balance drop, then refreshes and sees the old balance again. Money looks like it moved and then un-moved. Nothing was actually lost, but the system showed two truths a second apart, and now a customer is filing a support ticket. That gap between "what one node thinks is true" and "what another node shows" is the entire subject of consistency models. The moment your data lives on more than one machine, you have to decide exactly which version of reality a reader is allowed to see.

A consistency model is the contract your storage layer makes about the order and visibility of reads and writes. Pick a strong model and every reader sees a single, agreed-upon history, but you pay in latency and availability. Pick a weak model and you get speed and the ability to keep serving during a network partition, but you accept that two readers can briefly disagree. This category walks the full ladder, from the strongest guarantee (linearizability) down through sequential, causal, and the session guarantees, then into the database-transaction side (snapshot isolation, serializability) and the conflict-free approaches (CRDTs, operational transformation) that let collaborative apps merge edits without a coordinator. It also covers the durability machinery that makes any of these guarantees survivable: write-ahead logging, checkpointing, snapshotting, and point-in-time recovery.

Consistency Models: the landscape

What a Consistency Model Actually Promises

A consistency model is a precise statement about what a reader is allowed to observe given the writes that have happened. It is not about whether data is correct. It is about ordering and visibility. The strongest model, linearizability, says every operation appears to take effect at a single instant between its start and finish, so once any reader sees a new value, no reader can ever see the old one again. The system behaves as if there were one copy of the data and one global clock, even though there are many copies spread across machines.

Sequential consistency relaxes the global-clock part. All operations still appear in some single total order that every process agrees on, and each process's own operations keep their program order, but that order does not have to match real wall-clock time. Causal consistency relaxes further: it only guarantees that operations which are causally related (a reply must come after the message it answers) are seen in the right order everywhere. Unrelated operations can be seen in different orders by different readers, which is usually fine and much cheaper to provide.

Below those sit the session guarantees, which scope the promise to a single user's session rather than the whole system. Read-your-writes means you always see your own latest update. Monotonic reads means time never goes backward for you, so you will not see a value disappear after you have already seen it. Monotonic writes means your own writes are applied in the order you issued them. These are the guarantees users actually notice, and they are often the practical sweet spot.

Consistency in Databases: Isolation and Serializability

Distributed-systems consistency and database isolation describe the same problem from two angles. Where consistency models talk about replicas, transaction isolation talks about concurrent transactions touching shared rows. Serializability is the gold standard: the result of running transactions concurrently is identical to some order in which they ran one at a time. No lost updates, no anomalies, no surprises. It is the transactional cousin of linearizability and it is the easiest model to reason about, which is why it is worth the cost when money or inventory is involved.

Snapshot isolation is the model most production databases actually default to, and it is the one most engineers run without realizing it. Each transaction reads from a consistent snapshot taken at its start, so reads never block writes and writes never block reads. It eliminates most anomalies but allows write skew, where two transactions each read a shared state, each decide their write is safe, and together violate an invariant that neither broke alone. Knowing that snapshot isolation is not serializable, and knowing exactly which anomaly slips through, is the difference between a system that quietly corrupts data under load and one that does not.

Underneath all of these guarantees is durability, and durability is its own set of lessons here. Write-ahead logging records the intent to change data before the data itself is touched, so a crash mid-write can be replayed or rolled back cleanly. Checkpointing and snapshotting bound how much of that log has to be replayed after a restart. Point-in-time recovery uses the log plus a base snapshot to rebuild state as it existed at any chosen moment, which is what turns an accidental mass-delete into a recoverable incident rather than a resume-updating event.

Choosing a Model and Living With the Trade-Off

The honest framing is the CAP and PACELC tension: when the network splits, you choose between staying consistent and staying available, and even when the network is healthy you trade latency against consistency. Strong models like linearizability require coordination on every operation, which means cross-node round trips, higher tail latency, and an inability to serve writes when a quorum cannot be reached. Weak models skip the coordination, answer locally, and stay up through partitions, but hand you the job of resolving the disagreements they allow.

The right answer is per-feature, not per-system. A bank ledger or a seat-booking flow wants serializability or linearizability, because showing two truths there is a financial bug. A social feed, a like counter, or a presence indicator is perfectly happy with causal or eventual consistency, because a reader seeing a slightly stale count costs nothing and the latency savings are large. Most real products mix both: a strong core for the parts that move money or enforce uniqueness, and a fast eventually-consistent layer for everything else.

When you accept weak consistency, you need a strategy for the conflicts it produces, which is where the last cluster of lessons lives. Conflict resolution covers the policies: last-write-wins is simple but silently drops data, while smarter merges preserve intent. CRDTs (conflict-free replicated data types) are data structures whose merge operation is mathematically guaranteed to converge no matter the order updates arrive, which is how shopping carts and counters stay correct without a coordinator. Operational transformation solves the same merge problem for ordered text, transforming concurrent edits against each other so two people typing in the same paragraph end up with the same document.

How Real Systems Apply These Ideas

Google Spanner is the headline example of buying strong consistency at scale. It uses synchronized atomic clocks (the TrueTime API) to give external, linearizable transactions across data centers, which is why Google runs critical systems on a database that behaves like a single machine spanning continents. The price is that every commit waits out a small clock-uncertainty window. Amazon's DynamoDB takes the opposite stance by default, offering eventually consistent reads for speed and cost, with strongly consistent reads available as an explicit, more expensive option, so teams choose the guarantee per query.

Collaborative editors made these models visible to everyone. Google Docs is built on operational transformation so dozens of people can type at once and converge on the same document. Figma and many newer tools lean on CRDTs for the same reason, since CRDTs avoid the central transformation server that OT typically needs. Apple's Notes and offline-first mobile apps use CRDT-style merging so edits made on a plane reconcile cleanly when the device reconnects.

On the durability side, every serious database you have used runs the patterns in this category. PostgreSQL's write-ahead log is the source of truth for crash recovery and for streaming replication to standbys. Its base backups plus archived WAL are exactly what point-in-time recovery reads to rewind a database to the second before a bad migration. Redis offers both snapshotting (periodic point-in-time dumps) and an append-only log, letting operators trade durability against write throughput. These are not exotic features. They are the reason your data is still there after the process that wrote it crashed.

Frequently asked questions

Learn Consistency Models the interactive way

All 21 lessons with step by step diagrams, runnable code, and quizzes. One payment of ₹299 in India or $5 worldwide. Lifetime access, no subscription.

Consistency Models

What a Consistency Model Actually Promises

Consistency in Databases: Isolation and Serializability

Choosing a Model and Living With the Trade-Off

How Real Systems Apply These Ideas

Frequently asked questions

Consistency Models

What a Consistency Model Actually Promises

Consistency in Databases: Isolation and Serializability

Choosing a Model and Living With the Trade-Off

How Real Systems Apply These Ideas

All 21 lessons in Consistency Models

Frequently asked questions

Learn Consistency Models the interactive way

Consistency Models

What a Consistency Model Actually Promises

Consistency in Databases: Isolation and Serializability

Choosing a Model and Living With the Trade-Off

How Real Systems Apply These Ideas

All 21 lessons in Consistency Models

Frequently asked questions

Learn Consistency Models the interactive way