Is this a video course?

No. This is an interactive, slide-based learning platform. Each lesson has rich text, animated diagrams, live code editors, and quizzes. You learn by reading, interacting, and doing, not by watching videos passively.

How long do I have access?

Forever. Both pricing tiers are one-time payments with lifetime access. This includes all current 766 lessons and any future content we add.

What level of experience do I need?

None. We start from absolute basics like 'What is latency?' and build up to distributed consensus protocols. The Foundation level assumes zero prior knowledge of system design.

How much does the system design course cost?

5 US dollars for lifetime access globally, or 299 Indian rupees for lifetime access in India. One-time payment, no subscription, no hidden fees. 11 lessons are free with no signup required.

What technologies are covered?

Everything from DNS and load balancers to Kubernetes, Kafka, distributed databases, consensus protocols, stream processing, security architecture, and observability. We cover principles and real-world implementations used at Netflix, Google, Amazon, Uber, Stripe, and more.

Is this useful for system design interview preparation?

Yes. The lessons are structured around the exact topics asked in system design interviews at FAANG and top-tier companies. Interactive diagrams help you practice whiteboard-style explanations. Covers everything from URL shortener design to distributed payment systems.

How is this different from ByteByteGo or Educative?

766 interactive lessons (4x more than most competitors), 16 different diagram types that build step by step, real production examples from Netflix, Google, Amazon, Uber, and Stripe, and lifetime access for a one-time payment of 5 dollars instead of annual subscriptions costing 100 to 200 dollars per year.

Should I always add an index to speed up a slow query?

No. Indexes make reads faster but slow down writes and take up space, because every insert, update, or delete has to maintain them too. Add an index when a column is used often in WHERE, JOIN, or ORDER BY clauses and the table is large enough that a full scan is a problem. Always check the query plan first. Sometimes the database ignores an index you added, which tells you the real problem is elsewhere, like a missing composite index or a query that cannot use one at all.

What is the difference between partitioning and sharding?

Partitioning splits one table into smaller pieces, usually on the same database server, so each query touches less data. Sharding spreads those pieces across multiple separate servers so the total load is divided. Partitioning is mostly a single-machine performance tool and is relatively easy to add. Sharding is a distributed-systems decision that complicates joins, transactions, and queries that span shards, so it is the last scaling move you should make, not the first.

When should I use a soft delete instead of a hard delete?

Use a soft delete (flag the row as deleted instead of removing it) when you need an audit trail, an undo feature, or when other tables reference the row and a hard delete would break those relationships. Use a hard delete when the data has no future value, when storage or privacy rules require it to actually be gone, or when keeping deleted rows would bloat the table. Many systems soft delete first, then run a periodic hard delete or cleanup to remove old flagged rows.

What is a read replica and when do I need one?

A read replica is a copy of your database that stays in sync with the primary and serves read queries. You point reads at the replica and keep writes on the primary, which spreads load and lets you scale reads without a bigger primary. You need one when read traffic, like dashboards, search, and feeds, is overwhelming a single server. The trade-off is replication lag: a replica can be a fraction of a second behind, so do not read from it right after a write if you need the very latest value.

Why does my database get slower over time even without new code?

Because updates and deletes leave behind dead rows and fragmented storage that the database does not clean up instantly. Over time indexes bloat, tables accumulate dead tuples, and storage gets scattered. Maintenance operations fix this: vacuum reclaims space from dead rows, compaction merges and tidies storage in log-structured engines, and index rebuilding restores fragmented indexes. A database that is never maintained will slowly degrade even if your traffic and code never change.

Is normalization or denormalization the right choice?

It depends on whether you are optimizing for clean writes or fast reads. Normalization splits data to remove duplication, which keeps updates consistent and is the right default for transactional systems. Denormalization deliberately duplicates data so reads do not need expensive joins, which helps read-heavy workloads and reporting. Most real systems normalize the core schema and then denormalize specific hot paths, often using materialized views, once they can prove a query is actually too slow.

foundation

Database Fundamentals

Almost every outage you will ever debug traces back to the database. A query that ran in 5 milliseconds on your laptop with a thousand rows takes 40 seconds in production with 50 million, and now your checkout page is timing out and customers are leaving. The difference is rarely the application code. It is whether the right index exists, whether the table was partitioned, whether reads were sent to a replica, and whether the schema was designed for how the data is actually queried. These are not advanced topics you grow into later. They are the foundation, and ignoring them is the single most common reason small systems fall over as they grow.

This category covers how relational databases actually store, find, and protect your data, and how you keep them fast as the row count climbs. You will learn what an index really is and why B-trees power almost all of them, when to normalize versus denormalize, how to read a query plan and fix the slow part, and how the big scaling moves work: read replicas, replication topologies, partitioning, and sharding. Every lesson is built to be understood by a beginner and still respected by an engineer who has run databases in production.

Database Fundamentals: the landscape

What database fundamentals actually cover

A database is two jobs glued together: store data durably, and find it again quickly. Most of the topics here exist to make the second job fast without breaking the first. When you write a row, the database has to put it somewhere on disk, keep enough structure around it to find it later, and make sure a crash mid-write does not corrupt anything. When you read, it has to locate the rows you asked for among millions of others without scanning all of them.

The lessons start with the building blocks you touch every day. Database Indexing and B-Trees explain how a query finds 3 rows out of 50 million in milliseconds instead of reading the whole table. Database Normalization and Database Denormalization cover how you shape tables: splitting data to avoid duplication and update bugs, or deliberately duplicating it to make reads faster. Query Optimization teaches you to read the plan the database produces and understand why it chose a scan over an index.

From there the category moves into the parts of a database that hold logic and reduce repeated work: Database Views, Materialized Views, Database Functions, Stored Procedures, and Database Triggers. A view is a saved query you can treat like a table. A materialized view stores the result so you do not recompute it on every read. Triggers and stored procedures push logic into the database itself, which is powerful and also a common source of hidden behavior, so the lessons cover both the use and the cost.

Keeping a database healthy: deletes, versions, and maintenance

Deleting data is not as simple as it sounds, which is why this category spends real time on it. Hard Delete removes the row for good. Soft Delete keeps it and flags it as deleted, which is what you usually want when you need an audit trail, undo, or to avoid breaking foreign keys. Tombstoning is the distributed-systems cousin: a marker that says this record is gone, used so that replicas and log-structured stores agree on the deletion. Choosing wrong here leads to either lost data you needed or tables full of garbage you can never clean up.

Keeping history is its own discipline. Versioned Data and Temporal Tables let you answer questions like what did this record look like last Tuesday, which matters for billing disputes, compliance, and debugging. These patterns are how systems give you a clean undo and a defensible record of change without bolting on a separate audit system later.

Databases also need ongoing maintenance, and ignoring it is how a fast database slowly becomes a slow one. Vacuum reclaims space from rows that were updated or deleted but still sit on disk. Compaction merges and cleans up storage in log-structured engines. Index Rebuilding restores an index that has become bloated or fragmented so lookups stay fast. None of this is glamorous, but a database that is never vacuumed or compacted will eventually grind to a halt with no code change to blame.

Scaling reads and writes: replication, partitioning, sharding

One database server can only do so much. When read traffic outgrows a single machine, Read Replicas are the first move: copies that serve queries while writes still go to the primary. Database Replication explains the general mechanism, and Master-Slave Replication versus Master-Master Replication covers the two main shapes. Master-slave (one writer, many readers) is simpler and the default for most systems. Master-master lets multiple nodes accept writes, which buys availability but forces you to handle write conflicts, so reach for it only when you genuinely need it.

When the data itself is too big for one machine, you split it. Table Partitioning breaks one logical table into smaller physical pieces. Vertical Partitioning splits by columns, putting rarely-used or large columns in a separate store. Horizontal Partitioning splits by rows, and the lessons cover the strategies for choosing which rows go where: Range Partitioning (by value ranges, like dates), Hash Partitioning (by a hash for even spread), List Partitioning (by explicit categories, like region), and Composite Partitioning (combining strategies). The choice determines whether your queries hit one partition or all of them.

Database Sharding takes partitioning across separate servers and is the heaviest scaling tool here. It can take you to billions of rows, but it complicates everything: cross-shard queries, joins, transactions, and rebalancing all get harder. The lessons are honest that sharding is a last resort after indexing, replicas, and partitioning have been exhausted. Connection management matters too, which is why Database Connection Pooling is covered: opening a fresh connection per request is one of the quietest ways to overload an otherwise healthy database.

How real systems use these

Every large product is built on these patterns stacked together. A typical high-traffic application sends reads to a pool of read replicas and writes to a primary, with connection pooling in front so a traffic spike does not exhaust the database's connection limit. Hot tables are partitioned by date or by customer so old data can be archived and queries only touch the relevant slice. Indexes are tuned against the actual query plans rather than guessed at.

When a single primary can no longer hold the write volume, companies shard. Platforms running at the scale of social feeds, payments, or messaging shard by user id or tenant so each shard owns a manageable slice, then rely on careful index design and materialized views to keep per-shard queries fast. Soft deletes and temporal or versioned tables back the audit and compliance requirements that come with handling money or personal data.

The pattern to notice is order. Teams that scale well reach for the cheap, low-risk tools first: add an index, read a query plan, add a replica, partition a table. They turn to sharding and multi-master only when they have to, because those add operational cost that never goes away. Learning these fundamentals in that order is exactly how you avoid the classic mistake of jumping to a complex distributed setup to solve a problem that one well-placed index would have fixed.

Frequently asked questions

Learn Database Fundamentals the interactive way

All 31 lessons with step by step diagrams, runnable code, and quizzes. One payment of ₹299 in India or $5 worldwide. Lifetime access, no subscription.

Database Fundamentals

What database fundamentals actually cover

Keeping a database healthy: deletes, versions, and maintenance

Scaling reads and writes: replication, partitioning, sharding

How real systems use these

Frequently asked questions

Database Fundamentals

What database fundamentals actually cover

Keeping a database healthy: deletes, versions, and maintenance

Scaling reads and writes: replication, partitioning, sharding

How real systems use these

All 31 lessons in Database Fundamentals

Frequently asked questions

Learn Database Fundamentals the interactive way

Database Fundamentals

What database fundamentals actually cover

Keeping a database healthy: deletes, versions, and maintenance

Scaling reads and writes: replication, partitioning, sharding

How real systems use these

All 31 lessons in Database Fundamentals

Frequently asked questions

Learn Database Fundamentals the interactive way