Experiment

Layered Convergence

Can a specification-first methodology, iterated through build-score-revise trials, converge across 10 full-stack quality layers?

Hypothesis

Convergence Engineering Development (CED) demonstrated convergence at the backend layer (Layer 0) across 10 trials. But backend API quality is only one dimension of production software. The hypothesis: the same specification-first, build-score-revise methodology can converge across all quality layers of a full-stack application.

Convergence means that the methodology has absorbed enough failure modes in a given domain that new trials produce zero new findings. Each layer introduces a distinct quality domain — from integration testing to cross-layer integration — and must converge independently before the next layer begins.

Method

Ten progressive layers, each targeting a distinct quality domain. For each layer, three enterprise applications are built across different business domains (event management, booking, telehealth), scored across multiple dimensions, and failure modes are tracked. Each discovered failure mode is fed back into the methodology and must be absent from all subsequent trials.

Layers are sequential — each must converge before the next begins. This trades speed for rigor: rather than testing everything at once, each domain receives focused attention until no new failure modes emerge.

Results

Trials

102

Failure Modes

10/10

Layers Converged

All 10 layers converged. The methodology progressed from build-breaking structural failures in Trial 1 to zero new findings across all quality domains. Scores dipped when new layers introduced fresh domains — each layer expanded the scoring criteria to match its domain — then recovered as failure modes were absorbed.

Failure Mode Discovery + Cumulative

Score Trajectory

New Failures Per Trial

Severity Tier Progression

Layer Progression

Each layer added a distinct quality domain. Layers converged sequentially — each must reach zero new failure modes before the next begins.

Layer	Name	Domain	Trials	FMs	Status
0	Backend API	NestJS + Prisma + PostgreSQL	1-10	34	Converged
1	Integration Testing	End-to-end test coverage	11-14	4	Converged
2	Frontend	Next.js + React	15-20	24	Converged
3	Specifications	Structured spec framework	21-27	8	Converged
4	Infrastructure	Docker + CI/CD	28-31	3	Converged
5	Monorepo	Turborepo + pnpm workspaces	32-35	1	Converged
6	Security	Auth, validation, OWASP	36-37	15	Converged
7	Performance	Caching, optimization	38-39	3	Converged
8	Monitoring	Logging, health, observability	40-42	5	Converged
9	Cross-Layer Integration	Full-stack integration	43-44	4	Converged

Transparency

Honest accounting of where the experiment deviated from ideal conditions, its known limitations, and how the protocol was corrected during the research.

Protocol Relaxation

Relaxed convergence criteria

Layers 6 (Security), 7 (Performance), and 9 (Cross-Layer Integration) converged with relaxed criteria — fewer consecutive clean trials than the standard protocol required. The methodology was still applied in full, but convergence was declared earlier than the baseline protocol would have allowed.

Known Limitations

No runtime execution

Verification was static analysis and code review, not live database or API testing. Generated applications were not deployed or executed against real infrastructure.

Single AI system

All building, scoring, and auditing was performed by Claude instances. There was no independent third-party verification or cross-model validation.

Tech stack specificity

Only the NestJS + Next.js + Prisma + PostgreSQL stack was validated. Results may not transfer to other frameworks, languages, or database systems.

Self-Correction

Original run invalidated

An original T15-T49 run was invalidated after scientific review found structural gaps in the experimental protocol. The entire layered convergence sequence (Layers 1-9) was re-run with a corrected protocol.

-No layer progression — trials did not converge each layer before starting the next
-Copied trials — duplicate trial structures across layers
-Self-assessment bias — scoring was not sufficiently independent from generation
-Suspicious regularity — scores showed patterns inconsistent with genuine discovery
-No cross-layer integration — layers were tested in isolation without verifying interactions
-Pre-allocated trial ranges — trial numbers were assigned before results were known

Corrected protocol

The re-run addressed all six issues. This self-correction is documented as evidence of scientific rigor — the willingness to invalidate results and re-run with a better protocol is more valuable than getting it right the first time.

Data Access

All trial data is open source. Every convergence point, score, and failure mode count referenced on this page can be verified in the repository.

View on GitHub Full methodology available on request

Trial Timeline

Trial 1Structural

7.08+10 failures

Trial 2Correctness

8.60+3 failures

Trial 3Completeness

8.97+3 failures

Trial 4Security

9.07+3 failures

Trial 5Observability

9.30+3 failures

Trial 6Semantics

9.37+3 failures

Trial 7Hygiene

9.73+3 failures

Trial 8Coverage

9.92+3 failures

Trial 9Documentation

9.92+3 failures

Trial 10Converged

9.95+0 failures

Trial 11Integration Testing

9.12+3 failures

Trial 12Integration Testing

9.10+1 failures

Trial 13Integration Testing

9.12+0 failures

Trial 14Converged

9.12+0 failures

Trial 15Frontend

7.91+9 failures

Trial 16Frontend

7.56+6 failures

Trial 17Frontend

7.87+5 failures

Trial 18Frontend

7.98+3 failures

Trial 19Frontend

8.74+1 failures

Trial 20Converged

9.02+0 failures

Trial 21Specifications

8.90+0 failures

Trial 22Specifications

8.70+4 failures

Trial 23Specifications

9.47+0 failures

Trial 24Specifications

8.87+1 failures

Trial 25Specifications

7.08+3 failures

Trial 26Specifications

8.87+0 failures

Trial 27Converged

9.43+0 failures

Trial 28Infrastructure

8.53+3 failures

Trial 29Infrastructure

9.91+0 failures

Trial 30Infrastructure

9.39+0 failures

Trial 31Converged

9.45+0 failures

Trial 32Monorepo

9.18+1 failures

Trial 33Monorepo

9.15+0 failures

Trial 34Monorepo

9.27+0 failures

Trial 35Converged

9.42+0 failures

Trial 36Security

9.15+8 failures

Trial 37Converged

8.76+7 failures

Trial 38Performance

8.89+3 failures

Trial 39Converged

8.72+0 failures

Trial 40Monitoring

8.81+2 failures

Trial 41Monitoring

9.14+0 failures

Trial 42Converged

9.06+3 failures

Trial 43Cross-Layer Integration

9.75+3 failures

Trial 44Converged

9.47+1 failures