Is this a video course?

No. This is an interactive, slide-based learning platform. Each lesson has rich text, animated diagrams, live code editors, and quizzes. You learn by reading, interacting, and doing, not by watching videos passively.

How long do I have access?

Forever. Both pricing tiers are one-time payments with lifetime access. This includes all current 766 lessons and any future content we add.

What level of experience do I need?

None. We start from absolute basics like 'What is latency?' and build up to distributed consensus protocols. The Foundation level assumes zero prior knowledge of system design.

How much does the system design course cost?

5 US dollars for lifetime access globally, or 299 Indian rupees for lifetime access in India. One-time payment, no subscription, no hidden fees. 11 lessons are free with no signup required.

What technologies are covered?

Everything from DNS and load balancers to Kubernetes, Kafka, distributed databases, consensus protocols, stream processing, security architecture, and observability. We cover principles and real-world implementations used at Netflix, Google, Amazon, Uber, Stripe, and more.

Is this useful for system design interview preparation?

Yes. The lessons are structured around the exact topics asked in system design interviews at FAANG and top-tier companies. Interactive diagrams help you practice whiteboard-style explanations. Covers everything from URL shortener design to distributed payment systems.

How is this different from ByteByteGo or Educative?

766 interactive lessons (4x more than most competitors), 16 different diagram types that build step by step, real production examples from Netflix, Google, Amazon, Uber, and Stripe, and lifetime access for a one-time payment of 5 dollars instead of annual subscriptions costing 100 to 200 dollars per year.

How does Instagram store photos at scale?

Photos go through a Media Processor that generates multiple variants (thumbnail, medium, full-size) and writes them to a blob store fronted by a CDN. Meta uses Haystack, a custom photo storage system optimized for many small writes.

How is the home feed generated?

Hybrid push-pull. Low-follower authors fan out to follower timelines on write. Celebrities are pulled at read time. A ranking model reorders the merged set based on engagement, freshness, and relationship strength.

How do Stories work technically?

Stories live in a separate Cassandra cluster with a 24-hour TTL on each row. The read path lists unviewed stories from followed accounts within the TTL window, ordered by close-friend priority. TTL handles cleanup automatically.

How does the Reels recommendation feed work?

Same two-stage pattern as YouTube and TikTok. A candidate generator narrows the catalog to hundreds of Reels per user. A ranker scores each candidate by predicted watch time and engagement. A real-time layer adapts to the current session.

Why use Cassandra for the feed instead of SQL?

Cassandra scales horizontal writes cleanly and handles the fan-out write storm. SQL would need manual sharding and would not scale as easily. The trade-off (no joins) is fine because each user's feed is independent.

System Design Interview Guide

Design Instagram: System Design Interview Guide

Instagram serves 2 billion users with 500 million daily active stories, 95 million photos and videos uploaded per day.

Designing Instagram combines photo upload pipelines, feed generation (similar to Twitter), Stories (a separate ephemeral feed), Direct Messages, and a heavy media CDN. The hardest piece is generating a personalized feed that mixes friends, followed accounts, and Reels recommendations in 200 milliseconds.

Asked at: Commonly asked at Meta (Instagram's parent), Google, Amazon, Snap, TikTok, and Pinterest. Often a more visual variant of Design Twitter.

Why this question is asked

Design Instagram tests photo and video upload pipelines, feed generation trade-offs (push vs pull), ephemeral content (Stories), and the recommendation feed for Reels. It is a richer version of Design Twitter with a real media pipeline attached.

Requirements

Always clarify these in the first 5 minutes of the interview. Do not start drawing boxes until both lists are agreed.

Functional requirements

Users upload photos and videos with captions
Users follow other users and see a chronological or ranked feed
Stories: ephemeral 24-hour posts
Direct Messages between users
Like, comment, share, save posts
Reels: a TikTok-style recommendation feed
Search by user, hashtag, location

Non-functional requirements

Feed load under 500 ms at the 99th percentile
Photo upload under 5 seconds at the 95th percentile on 4G
99.99% availability
Eventual consistency on like counts and feed
Scale to 2B users
Global media delivery via CDN

Back-of-envelope scale estimates

Show your math. Pulling numbers from thin air signals you have not thought about the load.

Total users

Public Meta reporting. Assume 1.2 average profiles per user (a few have business accounts).

Daily active users

Public reporting: 1B+ DAU on Instagram.

Photos and videos uploaded per day

95M

Public reporting. Average ~95M new posts daily.

Feed reads per second (peak)

500K

1B DAU times 10 feed loads per day, with a 4x peak factor.

Media storage growth

20 PB/year

95M new media items times average 4 MB times 5x for multi-resolution thumbnails and processed copies times 365 days.

High-level architecture

Upload path: client uploads media to an Upload Service that writes to a regional blob store. The Media Processor generates thumbnails, runs ML pipelines (face detection, content classification, NSFW filtering), and stores variants in a media store fronted by a CDN. The post metadata (caption, location, tags) is written to a sharded SQL store. Feed path: similar hybrid push-pull as Twitter. For low-follower users, posts are fanned out to follower timelines (Cassandra). For high-follower users, posts are pulled at feed-read time. The Reels feed is generated by a separate recommendation pipeline (candidate generation plus ranking) trained on watch behavior. Stories are stored separately with a 24-hour TTL. DMs run on a separate persistent-connection gateway, similar to WhatsApp.

In a real interview, sketch this on the whiteboard before diving into any single box.

Core components

Walk through each service. The interviewer wants to hear what each one owns, not just the names.

Upload Service

Resumable upload endpoint for photos and videos. Writes raw media to a regional blob store. Emits an UploadComplete event for the Media Processor.

Media Processor

Consumes UploadComplete events. Generates multiple thumbnail sizes, runs face and object detection, applies any filters or AI transforms, and writes variants to the CDN-fronted media store.

Post Service

Writes post metadata (caption, media URLs, location, tags) to sharded SQL. Emits a PostCreated event for fan-out and search indexing.

Feed Service

Generates the home feed. Reads the user's precomputed timeline (Cassandra), merges in pulled content from followed celebrities, and applies a ranking model.

Fan-Out Service

Consumes PostCreated events. For users below a follower threshold, writes the post ID into each follower's timeline. For high-follower users, skips fan-out.

Stories Service

Stores ephemeral 24-hour posts in a separate Cassandra cluster with TTL. Has its own feed read path: which stories are unviewed for this user, ordered by close-friend priority.

Reels Recommendation Service

Two-stage ranking like YouTube. Candidate generator selects ~hundreds of Reels per user. Ranker scores each based on predicted watch time and engagement. Real-time signals from current session refine the ordering.

DM Service

Persistent-connection gateway for direct messages, similar to WhatsApp's architecture but without E2E encryption by default (it is opt-in).

Data model

Pick the right store per table. Justify each choice with the access pattern, not by reflex.

users

user_id (PK)username (UNIQUE)profile_pic_urlfollower_count_cachedfollowing_count_cached

Sharded by user_id hash. Username has a unique constraint for handle reservation.

posts

post_id (PK, snowflake)author_idmedia_urls[]captionlocation_idcreated_at

Sharded by author_id. Snowflake IDs encode timestamp.

follows

follower_id (PK partition)followee_id (PK sort)created_at

Two denormalized tables for follower and following lookups, both sharded.

stories

story_id (PK)author_idmedia_urlcreated_atexpires_at

Cassandra with TTL. The TTL is 24 hours after created_at. After expiry, rows are auto-evicted.

user_feed

user_id (PK partition)post_id (clustering by timestamp)

Cassandra. Bounded to ~1000 most recent posts. Populated by fan-out for low-follower authors.

Deep dives

These are the conversations the interviewer is steering you toward. Practice each one until you can talk through it without notes.

Photo and video upload pipeline

The Upload Service writes raw media to a regional blob store and emits an event. The Media Processor consumes the event and generates several variants: a small thumbnail for the feed, a medium for the profile grid, a full-size for the detail view, and (for videos) multiple bitrate variants for adaptive playback. Each variant goes to the media store fronted by a CDN. During processing, ML jobs run: face detection (for tagging suggestions), content classification (for ads and discovery), and NSFW detection. The post is marked viewable only after processing completes, which usually takes a few seconds for photos and 30+ seconds for videos.

Feed generation: push, pull, and ranking

Same hybrid push-pull as Twitter. Low-follower authors fan out to follower timelines on write. Celebrity authors skip fan-out and are pulled at read time. The novelty in Instagram is the ranking step: even after merging push and pull content, a learned ranking model reorders posts based on predicted engagement, relationship strength (close friends rank higher), and freshness. The model runs as an online service called at feed-read time. Pure chronological feed is still offered as a setting.

Stories: ephemeral feed with TTL

Stories are a separate feed entirely. Each story has a 24-hour TTL written into a Cassandra cluster. The read path is: list stories from accounts I follow that are within the last 24 hours and that I have not yet viewed, ordered by close-friend priority and recency. View state (which stories I have already seen) is stored in a per-user table. The TTL on Cassandra rows handles cleanup automatically. Stories never go into the main feed.

Reels recommendation feed

Reels is a TikTok-style recommendation feed: short videos pulled from across the entire catalog, not just accounts I follow. It uses the YouTube-style two-stage pipeline. A candidate generator (two-tower neural network) produces a few hundred candidate Reels per user from billions in the catalog. A ranker scores each candidate using predicted watch time, completion rate, share rate, and engagement. The ranker also enforces diversity (do not show two Reels from the same author back-to-back) and freshness (mix recent posts with high-engagement classics). The real-time layer adapts to the current session: if you watched a cooking Reel, it boosts cooking-adjacent content in the next pull.

Trade-offs to discuss

Every senior interviewer expects you to surface at least 3 of these. Pick the decisions, state the alternatives, and justify your choice.

Cassandra vs SQL for the user feed

Cassandra scales writes horizontally and handles the fan-out write storm cleanly. SQL would need careful sharding and would not scale as easily. The cost is no joins (each follower's feed is independent). Cassandra wins for the feed workload.

Ranked vs chronological feed

Ranked lifts engagement but is opaque to users. Chronological is predictable but mixes important and trivial. Instagram offers both: Following is chronological, Home is ranked. Most users default to ranked.

TTL-based cleanup vs explicit deletion for Stories

TTL is cheap (Cassandra evicts rows automatically) and self-healing. Explicit deletion requires a scheduled job. TTL wins for a 24-hour ephemeral workload.

Reels candidate generator vs simple following-based feed

A candidate generator drives discovery beyond the follow graph, which is why TikTok dominated short-form video. A pure follow-based feed limits exposure to creators users have not seen. Reels would lose to TikTok without the recommendation pipeline.

E2E encryption on DMs

Instagram introduced E2E DMs as opt-in, not default. The cost of default E2E would be losing server-side spam detection and content moderation. Opt-in is the pragmatic middle ground.

How Instagram actually does it

Instagram famously scaled to 14 million users with a 3-engineer infra team on Python/Django and PostgreSQL. Today it runs on Meta's infrastructure: TAO (a graph cache layer over MySQL), Cassandra for feeds, Haystack for photo storage, and a CDN fronted by Meta's edge network. The Reels recommendation system reuses architecture from Facebook's ranking systems with adaptations for short-form video. DMs run on a separate stack that overlaps with WhatsApp infrastructure. The photo and video processing pipeline runs on a queue-driven worker fleet that handles millions of uploads per day; ML inference for face detection and content classification runs as a separate job tier so a slow model does not block media availability. Search uses Unicorn, Meta's graph-indexing system originally built for Facebook search, adapted to handle hashtag, user, and place queries with personalization signals from the social graph. Stories were a late addition to the architecture but kept clean by being a separate Cassandra cluster with its own read and write paths, so the main feed pipeline did not have to absorb the 24-hour TTL semantics.

Sources

Lessons to study before this interview

If any of these topics are fuzzy, the interviewer will catch it. Each lesson is 15 to 60 minutes with diagrams, code, and a quiz.

Fan-Out and Fan-In

intermediate / messaging event systems

Content Delivery Network

foundation / load balancing proxies

Database Sharding

foundation / database fundamentals

Key-Value Stores

intermediate / database types storage

Capstone: Design Twitter Feed

capstone / capstone

Frequently asked questions

Practice with 766 system design lessons

Lifetime access for INR 299 or $5. Interactive diagrams, runnable code, quizzes, and 20 capstone projects including Design Instagram.