DATA ARCHITECTURE

Data Modeling Knowledge Base

Zero to advanced, in one place. Relational foundations, dimensional and warehouse design, specialized and modern patterns, and the applied judgment to actually build models instead of just diagramming them.

Learning level

Foundations

Learning level

Dimensional & Warehouse

08
Dimensional & Warehouse
Chapter 8

Dimensional Modeling: Facts & Dimensions

The Kimball core of analytics modeling: facts measure events, dimensions give them context, built around a clear grain.

FactsDimensionsKimballGrainBus matrix
09
Dimensional & Warehouse
Chapter 9

Star vs Snowflake Schemas

The classic tradeoff between a flat star and a normalized snowflake, and exactly when each one is the right call.

StarSnowflakeJoinsNormalizationWhen
10
Dimensional & Warehouse
Chapter 10

Fact Table Design

Additive, semi-additive, and non-additive measures; transaction, snapshot, and accumulating facts; factless and degenerate.

MeasuresTransactionSnapshotFactlessDegenerate
11
Dimensional & Warehouse
Chapter 11

Slowly Changing Dimensions (SCD 0–6)

Track dimension history correctly, from overwrite to history rows to hybrids, the number-one modeling interview topic.

SCD0–6HistoryType 2Effective datesHybrids
12
Dimensional & Warehouse
Chapter 12

Inmon vs Kimball vs Data Vault

The three enterprise modeling philosophies compared, with a clear map of when each approach actually fits.

InmonKimballData VaultCIFWhen
13
Dimensional & Warehouse
Chapter 13

Data Vault 2.0

Hubs, links, and satellites: an auditable, scalable, parallel-loadable pattern for enterprise data warehouses.

HubsLinksSatellitesAuditabilityScale
14
Dimensional & Warehouse
Chapter 14

Modern Warehouse Modeling

Medallion (bronze/silver/gold), wide tables / One Big Table, dbt models, ELT, and the semantic / metrics layer.

MedallionOBTdbtELTSemantic layer
15
Dimensional & Warehouse
Chapter 15

Temporal & Historical Modeling

Effective dating, bitemporal models, snapshots, and event sourcing, so you can ask what was true and when.

Effective datesBitemporalSnapshotsEvent sourcingAs-of

Learning level

Specialized & Applied

16
Specialized & Applied
Chapter 16

NoSQL & Access-Pattern Modeling

Document, key-value, and wide-column stores: model by query, not by entity, and choose embedding vs referencing.

DocumentKey-valueWide-columnEmbed vs refAccess patterns
17
Specialized & Applied
Chapter 17

Graph Data Modeling

Nodes, edges, and properties (Neo4j-style): how to model when the relationships between things are the real data.

NodesEdgesPropertiesTraversalWhen
18
Specialized & Applied
Chapter 18

Master Data Management (MDM)

Golden records, entity resolution, and reference data: one trusted version of customers, products, and accounts.

Golden recordEntity resolutionReference dataSurvivorshipStewardship
19
Specialized & Applied
Chapter 19

Data Mesh & Domain Ownership

Domain-oriented data products, ownership, and federated governance for modeling at organizational scale.

DomainsData productsOwnershipFederationGovernance
20
Specialized & Applied
Chapter 20

ML Feature & Feature-Store Modeling

Model features for machine learning with point-in-time correctness, avoiding leakage and training/serving skew.

FeaturesPoint-in-timeLeakageTrain/serveFeature store
21
Specialized & Applied
Chapter 21

Data Contracts, Schema Evolution & Governance

Versioning, backward/forward compatibility, lineage, ownership, and quality, so models can change without breaking consumers.

ContractsVersioningCompatibilityLineageQuality
22
Specialized & Applied
Chapter 22

Applied Data Modeling: How To Answer Any Modeling Question

A repeatable requirements-to-model method, OLTP vs OLAP structural decisions with the why, worked examples, and interview narration.

MethodRequirementsOLTP/OLAPWorked examplesNarration
23
Specialized & Applied
Chapter 23

Production Data Modeling: Scale, Loading & Behavior

Partitioning, scalability under traffic, insert/update/history load mechanics, late-arriving data, tradeoffs, and a full star + snowflake model with sample queries.

PartitioningScaleLoadingLate dataStar + snowflake