Journal

Research Journal

Documenting the research as it happens — experiments, convergence data, and what I'm learning about making AI-generated software reliable.

Two Roads to Deployment

A 19K-line pipeline failed 13 times. An 833-line command succeeded in 3.5 hours. Both approaches continue.

74 trials taught me that measuring AI code quality is at least as hard as generating it.