The Full Stack

 ┌──────────────────────────────────────────────┐
 │  STRATEGY        "For whom?"                  │
 │  Market data, EV scoring, use case ranking    │
 ├──────────────────────────────────────────────┤
 │  CONSTITUTION    "What do we believe?"         │
 │  Principles, constraints, anti-patterns        │
 ├──────────────────────────────────────────────┤
 │  ROADMAP         "When?"                       │
 │  Milestones, deadlines, sequencing             │
 ├──────────────────────────────────────────────┤
 │  SPECS           "What?"                       │
 │  Feature definitions, API contracts            │
 ├──────────────────────────────────────────────┤
 │  ADRs            "Why this way?"               │
 │  Decision records, permanent, supersedable     │
 ├──────────────────────────────────────────────┤
 │  TICKETS         "How?"                        │
 │  User stories, priorities, context packets     │
 ├──────────────────────────────────────────────┤
 │  CODE + TESTS    "Does it work?"               │
 │  Implementation, unit tests, E2E tests         │
 ├──────────────────────────────────────────────┤
 │  VALIDATION      "Is it ready?"                │
 │  Use case readiness, traceability audits       │
 ├──────────────────────────────────────────────┤
 │  BENCHMARKS      "How well?"                   │
 │  Real data vs reference tools, accuracy metrics│
 └──────────────────────────────────────────────┘

Factor	Weight	What It Measures
Market Size	25%	TAM for the vertical
Competitive Density	20%	Greenfield scores high
Funding Momentum	20%	VC activity = market timing
Buyer Accessibility	20%	Sales cycle, SMB vs enterprise
Pricing Headroom	15%	ACV potential

	Tests	Benchmarks
Data	Synthetic (numpy arrays)	Real (USGS, Sentinel, NLCD)
Reference	Expected values in code	Output from GDAL, QGIS, GEE
Result	Pass / fail	Metrics (RMSE, correlation, %)
Speed	Seconds (every commit)	Minutes (pre-milestone)
Catches	Regressions, math errors	Methodology drift, real-terrain edge cases

Spec-Driven Development in Practice

The Problem

The Idea

The Questions

The Knowledge Graph

The Full Stack

The Layers

Layer 1: Strategy

Layer 2: Constitution

Layer 3: Authority Hierarchy

Layer 4: Spec-Anchored Development

Layer 5: Decision Records (ADRs)

Layer 6: Triage Protocol

Layer 7: Traceability

Layer 8: Use Case Validation

Layer 9: Benchmarks

Tests vs Benchmarks

Op Benchmarks

Workflow Benchmarks

Benchmarks + Use Case Readiness

Putting It Together

The Knowledge Graph

The Flow

Why AI Agents Need This

Getting Started

Thank You