Data Modeling Knowledge Base

EntitiesERDCardinalityM:NBridges

Chapter 2

Entities, Relationships & Cardinality

Read and draw ER diagrams: entities, attributes, crow’s-foot notation, 1:1 / 1:M / M:N, and bridge tables.

PrimarySurrogateCompositeForeignIntegrity

Chapter 3

Keys & Identity

Natural, surrogate, and composite keys, foreign keys, and referential integrity, where most real data bugs begin.

GrainAtomicityMixed grainCountingDeclaration

Chapter 4

Grain: What One Row Means

Declare the grain before you build, and avoid the mixed-grain disasters that silently corrupt every metric downstream.

1NF–BCNFAnomaliesDependenciesDenormalizeTradeoffs

Chapter 5

Normalization & Denormalization

Functional dependencies, 1NF through BCNF, update anomalies, and when to deliberately denormalize for reads.

RequirementsNormalizedTransactionsConstraintsWrites

Chapter 6

OLTP Modeling: Designing Operational Schemas

Turn requirements into a normalized operational schema, with the why: write integrity, transactions, and constraints.

OLTPOLAPWorkloadsTradeoffsDecisions

Chapter 7

OLTP vs OLAP: Choosing How To Structure Data

Why operational and analytical models differ, how to decide between them, and the reasoning behind each structure.

Learning level

Dimensional & Warehouse

Chapter 8

Dimensional Modeling: Facts & Dimensions

The Kimball core of analytics modeling: facts measure events, dimensions give them context, built around a clear grain.

FactsDimensionsKimballGrainBus matrix

StarSnowflakeJoinsNormalizationWhen

Chapter 9

Star vs Snowflake Schemas

The classic tradeoff between a flat star and a normalized snowflake, and exactly when each one is the right call.

MeasuresTransactionSnapshotFactlessDegenerate

Chapter 10

Fact Table Design

Additive, semi-additive, and non-additive measures; transaction, snapshot, and accumulating facts; factless and degenerate.

SCD0–6HistoryType 2Effective datesHybrids

Chapter 11

Slowly Changing Dimensions (SCD 0–6)

Track dimension history correctly, from overwrite to history rows to hybrids, the number-one modeling interview topic.

InmonKimballData VaultCIFWhen

Chapter 12

Inmon vs Kimball vs Data Vault

The three enterprise modeling philosophies compared, with a clear map of when each approach actually fits.

HubsLinksSatellitesAuditabilityScale

Chapter 13

Data Vault 2.0

Hubs, links, and satellites: an auditable, scalable, parallel-loadable pattern for enterprise data warehouses.

MedallionOBTdbtELTSemantic layer

Chapter 14

Modern Warehouse Modeling

Medallion (bronze/silver/gold), wide tables / One Big Table, dbt models, ELT, and the semantic / metrics layer.

Effective datesBitemporalSnapshotsEvent sourcingAs-of

Chapter 15

Temporal & Historical Modeling

Effective dating, bitemporal models, snapshots, and event sourcing, so you can ask what was true and when.

Learning level

Specialized & Applied

Chapter 16

NoSQL & Access-Pattern Modeling

Document, key-value, and wide-column stores: model by query, not by entity, and choose embedding vs referencing.

DocumentKey-valueWide-columnEmbed vs refAccess patterns

NodesEdgesPropertiesTraversalWhen

Chapter 17

Graph Data Modeling

Nodes, edges, and properties (Neo4j-style): how to model when the relationships between things are the real data.

Golden recordEntity resolutionReference dataSurvivorshipStewardship

Chapter 18

Master Data Management (MDM)

Golden records, entity resolution, and reference data: one trusted version of customers, products, and accounts.

DomainsData productsOwnershipFederationGovernance

Chapter 19

Data Mesh & Domain Ownership

Domain-oriented data products, ownership, and federated governance for modeling at organizational scale.

FeaturesPoint-in-timeLeakageTrain/serveFeature store

Chapter 20

ML Feature & Feature-Store Modeling

Model features for machine learning with point-in-time correctness, avoiding leakage and training/serving skew.

ContractsVersioningCompatibilityLineageQuality

Chapter 21

Data Contracts, Schema Evolution & Governance

Versioning, backward/forward compatibility, lineage, ownership, and quality, so models can change without breaking consumers.

MethodRequirementsOLTP/OLAPWorked examplesNarration

Chapter 22

Applied Data Modeling: How To Answer Any Modeling Question

A repeatable requirements-to-model method, OLTP vs OLAP structural decisions with the why, worked examples, and interview narration.