A vision for Claude Code

What if AI coding
tools enforced
engineering discipline?

DORA 2025 proved that AI coding tools increase output and bugs simultaneously. The missing variable isn't the model. It's the methodology.

Engineered mode: a proposed Claude Code mode that enforces research-first development, self-audit gates, adversarial review, and enterprise standards compliance. Automatically.

See the evidence See the mockup

01 / The Problem

DORA 2025 measured 5,000 developers.
AI made their software worse.

The largest study of professional software development ever conducted found that AI coding tools dramatically increase output volume while simultaneously degrading quality. More PRs. More bugs. Bigger changes. Zero improvement in actual delivery.

Source: DORA 2025 — "AI's primary role in software development is that of an amplifier."

+21%

Tasks done

+98%

PRs merged

+9%

Bug rates

+91%

Review time

+154%

PR size

FLAT

Delivery

The amplifier problem

AI doesn't improve development. It amplifies whatever process already exists. Strong methodology + AI = force multiplier. No methodology + AI = faster dysfunction. DORA measured the second scenario exclusively.

02 / The Missing Variable

DORA never measured
structured methodology.

The study treats every AI-assisted developer identically. Whether you enforce quality gates, self-audit loops, and convergence verification — or you're tab-completing through a sprint — DORA cannot distinguish the two. That's the gap.

What DORA measured

x Raw AI tool + developer

x No methodology control

x No quality gate tracking

x No review rigor variable

x No testing discipline metric

x No tool-type distinction

What structured methodology adds

+ Research before code (always)

+ Self-audit gates (n=5 null convergence)

+ Adversarial review loops

+ Enterprise standards compliance

+ Stress testing at every stage

+ Quality gates enforced by the tool

It's not the model.
It's the methodology.

The question isn't "does AI help developers?" The question is: what happens when AI is paired with rigorous engineering methodology — and what happens when it isn't?

03 / The Proof

Orchestra.
Built by a non-coder in 13 days.

A production-quality desktop application for AI agentic pipeline orchestration. Rust backend. Svelte 5 frontend. SQLite persistence. Security hardened through adversarial cross-architecture review. Built entirely with Claude Code by an epistemological engineer with zero engineering background.

Tests

Tauri cmds

Lines Rust

Days

0/11

HIRE

Critical sec

Cold assessor verdicts

"Learning velocity from zero AI experience to principal-level methodology design in 6 months. Rated 10/10."

Independent assessor

"A hiring manager who disqualifies this person because the Python isn't production-grade is making the same error as someone who disqualifies a principal architect because they don't write the fastest C++."

Independent assessor

"Epistemic discipline -- the ability to ask 'how do we know this is true, where could it break, and how do we build the check into the system?' -- across every piece of work."

Independent assessor

Code quality rated 2-7/10. Methodology rated 8-9.5/10. All 11 assessors: HIRE. The methodology compensated for code-level gaps and produced enterprise-grade output.

04 / The Vision

Not just Plan mode.
Engineered mode.

What if the methodology that produced Orchestra was built directly into Claude Code? A mode that automatically enforces research-first development, self-audit gates, enterprise standards compliance, and adversarial review. Here's what it could look like.

Claude Code v2.1.0 -- ~/project

ENTERPRISE D2R v3

Pipeline Stages

→

Stage 00: Research

In progress...

Stage 01-A: Audit Gate

Waiting

Stage 02: Implementation

Waiting

Stage 03: Stress Test

Waiting

Stage 04: Adversarial Review

Waiting

Standards

● ISO 25010

● OWASP ASVS

● NIST SSDF

$ claude --mode engineered "Build a secure REST API with auth"

Engineered Mode Active | Stage 00 of 04

ISO 25010 ✓ OWASP ✓ NIST SSDF ✓

🔍

Research-First

No code is written until the research phase completes. Standards, existing patterns, and domain constraints are analyzed before a single line is generated.

🛡

Self-Audit Gates

n=5 null convergence verification at every stage boundary. The system proves its own work is correct before proceeding. No silent failures.

⚔

Adversarial Review

Built-in adversarial review loops stress-test every output. Edge cases, failure modes, and security vectors are probed before code ships.

05 / The Market

94% of humans aren't software engineers.
Many of them are systems thinkers.

Teachers managing 30 stochastic agents. Operations leaders orchestrating supply chains. Researchers designing experimental protocols. Behavioral scientists modeling human systems. These are epistemological engineers who can design AI orchestration without coding -- when the methodology is right.

Current state

✗ $440K average pipeline deployment cost

✗ 38% failure rate on structural output

✗ +9% bug rates with AI tooling (DORA)

✗ 95% of AI pilots never reach production

✗ Hiring from 6% of the workforce (engineers only)

With Engineered mode

✓ 703 tests, zero critical findings, 13 working days

✓ Quality gates enforced automatically by the tool

✓ Enterprise standards baked into the pipeline

✓ Non-engineers shipping production-grade software

✓ Entire untapped talent pool of systems thinkers

94%

of humans are not software engineers

But many of them are systems thinkers who already orchestrate complexity every day. Claude Code + Engineered mode makes them builders.

06 / Next Step

Interested in bringing
Engineered mode to Claude Code?

The methodology is proven. The proof of concept ships. The playbook is ready to be written.

Orchestra is proof of concept #1 -- a full production desktop app built by a non-coder using Claude Code + structured methodology. The next step is making that methodology available to everyone, built directly into the tool.

Let's talk View Orchestra on GitHub

What if AI coding tools enforced engineering discipline?

DORA 2025 measured 5,000 developers. AI made their software worse.

The amplifier problem

DORA never measured structured methodology.

What DORA measured

What structured methodology adds

Orchestra. Built by a non-coder in 13 days.

Cold assessor verdicts

Not just Plan mode. Engineered mode.

Research-First

Self-Audit Gates

Adversarial Review

94% of humans aren't software engineers. Many of them are systems thinkers.

Current state

With Engineered mode

Interested in bringing Engineered mode to Claude Code?

What if AI coding
tools enforced
engineering discipline?

DORA 2025 measured 5,000 developers.
AI made their software worse.

DORA never measured
structured methodology.

Orchestra.
Built by a non-coder in 13 days.

Not just Plan mode.
Engineered mode.

94% of humans aren't software engineers.
Many of them are systems thinkers.

Interested in bringing
Engineered mode to Claude Code?