- My Development Notes/
- System Design Components/
- Mechanical Infrastructure System Design Interview Companion/
Mechanical Infrastructure System Design Interview Companion
Mechanical Infrastructure System Design Interview Companion #
A focused companion to the Mechanical System Design Interview Framework for infrastructure and platform interviews. The core field graph and mechanism derivation stay shared. This document narrows the entry point, archetypes, technology silhouettes, and presentation style for infra-heavy prompts.
Why This Exists #
Infrastructure interviews usually emphasize:
- authoritative state instead of user workflow
- control plane vs data plane boundaries
- ordering, coordination, and replication scopes
- failure semantics before product projections
The shared interview framework already covers these, but product and infra prompts benefit from different first instincts. This companion provides that infra-first adaptation without duplicating the full field graph.
How To Use This Companion #
- Start with the shared framework for:
- field taxonomy
- dependency DAG
- complete field graph
- step-by-step worksheet
- write-shape and mechanism derivation
- Use this companion for:
- infra-first problem recognition
- infra archetype selection
- infra technology silhouettes
- infra presentation script
- infra-specific prompt mappings
If a prompt is clearly user/product/workflow oriented, use the main interview framework page directly.
Infra First Branch #
Treat the prompt as infrastructure-first if the dominant question is about one or more of:
- coordination
- routing
- replication
- control-plane propagation
- ordering
- partition ownership
- serving substrate
- cache or log semantics
Typical signals:
- queue, pub/sub, stream, broker
- cache, key-value, session, lock service
- load balancer, gateway, CDN
- service discovery, config, coordination, leader election
- scheduler, serverless worker substrate
- metadata store, distributed ID generation
Infrastructure Archetypes #
Use these as the first decomposition shortcut before filling the shared worksheets.
Canonical set:
I01 Coordination / Consensus MetadataI02 Claim / Lease / Exclusive OwnershipI03 Due-Time Release + Claimable RunI04 Frontier Scan + Claimable RunI05 Append Log + Consumer ProgressI06 Projection / Index / Search PipelineI07 Cache / Origin Projection / Edge DeliveryI08 Traffic Shaping / Admission ControlI09 Sequence / Identifier GenerationI10 Membership / Presence / RegistryI11 Control Plane + Snapshot DistributionI12 Workflow + External Side EffectI13 Shared Subject CoordinationI14 Immutable Artifact Namespace + DeliveryI15 Execution Fleet + Worker SubstrateI16 Key-Scoped Mutable State / Replicated KVI17 Traffic Steering / Request Mediation PlaneI18 Telemetry / Time-Series PipelineI19 Replicated Chunk / Block / File Storage Substrate
Use the dominant archetype first, then compose supporting ones as:
- input
- output
- decorator / middleware
- substrate
For the full taxonomy with entities, source of truth, write paths, read paths, sequence shapes, failure prevention and repair, scaling bottlenecks, and archetype connections, use:
If an infra prompt still feels overlapping after archetype selection, add the runtime overlay from:
That is most useful for:
- scheduler and workflow systems
- fleets and serverless substrates
- streaming and projection pipelines
- telemetry systems
- storage repair and rebalance systems
Common Infrastructure System Mappings #
| System | Dominant archetype(s) |
|---|---|
| Distributed cache | I16 Key-Scoped Mutable State / Replicated KV; I07 Cache / Origin Projection / Edge Delivery |
| Message queue / pub-sub | I05 Append Log + Consumer Progress |
| Real-time messaging substrate | I05 Append Log + Consumer Progress; I10 Membership / Presence / Registry |
| Load balancer | I17 Traffic Steering / Request Mediation Plane; I11 Control Plane + Snapshot Distribution |
| API gateway | I17 Traffic Steering / Request Mediation Plane; I08 Traffic Shaping / Admission Control; I11 Control Plane + Snapshot Distribution |
| Serverless compute service | I15 Execution Fleet + Worker Substrate; I03 Due-Time Release + Claimable Run; I11 Control Plane + Snapshot Distribution |
| Mobile OS update rollout | I11 Control Plane + Snapshot Distribution; I14 Immutable Artifact Namespace + Delivery |
| CDN | I07 Cache / Origin Projection / Edge Delivery; I11 Control Plane + Snapshot Distribution |
| Snowflake / distributed ID generator | I09 Sequence / Identifier Generation |
| ZooKeeper-like coordination store | I01 Coordination / Consensus Metadata; I02 Claim / Lease / Exclusive Ownership |
| Distributed KV / session store | I16 Key-Scoped Mutable State / Replicated KV |
| Metrics / monitoring platform | I18 Telemetry / Time-Series Pipeline |
| Block / distributed file storage | I19 Replicated Chunk / Block / File Storage Substrate |
Default Technology Silhouettes #
This section is shape-first, not vendor-first. Recall the storage, coordination, and serving shapes before naming concrete products.
| Archetype | Default storage shape | Default coordination shape | Default serving shape |
|---|---|---|---|
I01 Coordination / Consensus Metadata | small strongly consistent metadata store | consensus, lease, leader election | metadata reads and watch streams |
I02 Claim / Lease / Exclusive Ownership | lease/claim table or store | guarded claim + renewal + fencing | owner read / fenced actor path |
I03 Due-Time Release + Claimable Run | schedule truth + due index + runnable store | time-gating + idempotent materialization | runnable fetch / worker claim |
I04 Frontier Scan + Claimable Run | frontier store + checkpoint store | claim + checkpoint progression | worker scan / batch fetch |
I05 Append Log + Consumer Progress | append-only log + progress store | partition ordering + offset semantics | consumer reads or fanout |
I06 Projection / Index / Search Pipeline | source truth + projection/index store | projector checkpoint + replay | projection/search serving |
I07 Cache / Origin Projection / Edge Delivery | origin store + edge cache | invalidation, refresh, purge | cache/edge serving |
I08 Traffic Shaping / Admission Control | budget/token/concurrency state store | atomic admission update + policy propagation | evaluator fast path |
I09 Sequence / Identifier Generation | counter/range lease store | uniqueness, monotonicity, worker-id allocation | ID issue endpoint |
I10 Membership / Presence / Registry | membership/presence registry | heartbeat, expiry, watch/gossip | membership lookup / presence read |
I11 Control Plane + Snapshot Distribution | config truth + snapshot state | watch, propagation, rollout | local snapshot evaluator |
I12 Workflow + External Side Effect | workflow truth + outbox + delivery attempts | state transition + idempotent effect | status read / reconciled effect |
I13 Shared Subject Coordination | op log + subject state + snapshot | sequencing/merge/version discipline | subject sync/fanout |
I14 Immutable Artifact Namespace + Delivery | head/manifest store + blob store | publish-content-first + head CAS | resolve/fetch artifact |
I15 Execution Fleet + Worker Substrate | worker/capacity/placement store | placement, lease, reconciliation | worker runtime execution |
I16 Key-Scoped Mutable State / Replicated KV | partitioned KV state store | per-key overwrite, replication, TTL | direct key lookup |
I17 Traffic Steering / Request Mediation Plane | route/health/policy state | health-aware selection + policy mediation | proxy / steering plane |
I18 Telemetry / Time-Series Pipeline | WAL/block store + label index + alert state | ingest, compaction, rule evaluation | time-series query / alerting |
I19 Replicated Chunk / Block / File Storage Substrate | metadata store + chunk placement + replica state | placement, writer lease, repair/rebalance | metadata lookup + chunk fetch |
Rule:
- pick the dominant archetype
- recall the silhouette
- then choose products that fit the shape and NFRs
Infrastructure Rendering #
Use this verbal sequence in infra rounds.
- “I’ll first normalize the requirements into state mutations, serving paths, and control or maintenance flows.”
- “I’ll identify the correctness-critical state and the scopes where ordering, exclusivity, or coordination matter.”
- “I’ll identify the smallest primary state needed to preserve authoritative system state.”
- “For each critical path, I’ll state the hard invariants.”
- “Then I’ll declare the execution context that constrains valid mechanisms.”
- “Then I’ll classify the mutation and coordination shapes and choose mechanisms from the derived predicates.”
- “Then I’ll define how serving paths read authoritative, replicated, or derived state.”
- “Then I’ll cover retries, crashes, replication, and scaling bottlenecks.”
Infrastructure Normalization Hints #
Map infra prompts into the shared operation language using these common aliases:
| Infra phrase | Shared operation | Typical interaction kind |
|---|---|---|
| publish to log | append event | create target |
| advance offset / ack progress | overwrite state or state transition | update target |
| claim lease / leadership | conditional write or state transition | update target |
| renew heartbeat | overwrite state | hidden write target |
| rebalance assignment | async process | hidden write target |
| propagate config | async process | async side-effect target |
| serve cached object | read projection | read projection target |
| fetch authoritative metadata | read source | read source target |
This keeps infra prompts inside the same canonical field graph instead of inventing a second framework.
Infra Prompt Checklist #
Before leaving baseline design, make sure you have answered:
- what is the authoritative state?
- what is the correctness scope: key, partition, shard, replica group, or global?
- where does ordering matter?
- where does exclusivity or leadership matter?
- what can become stale, and what cannot?
- what happens when a holder crashes?
- what is the first hotspot at scale?
If these are explicit, the rest of the shared worksheet usually becomes straightforward.
There's no articles to list here yet.