THE AI EXPERT · DELIVERY METHOD

THE AI EXPERT SDLC
SPEC-DRIVEN AGENTIC DELIVERY

From planning to full CI/CD, run by an orchestrated AI team under human direction. A repeatable lifecycle where every change begins as a written specification and ends as verified behaviour in the real environment.

11lifecycle stages6agent roles6core principlesHuman-directed · AI-executed

SECTION A

THE DELIVERY LIFECYCLE

Every piece of work travels the same spine, no shortcuts. Each stage has a clear exit condition before the next begins, so quality is built in rather than inspected afterwards.

Spec

Define

Every piece of work begins as a written specification with explicit acceptance criteria. Tech-lead approval comes before anything else.

Plan

Define

An implementation plan sets out tasks, dependencies and acceptance criteria, turning the specification into a sequenced route.

Ticket

Define

Linear issues are created from the approved specification before any code is written, so the backlog reflects the agreed scope.

Branch

Build

All work happens on feature branches off freshly-fetched origin/main, never main directly. Recently-merged PRs are scanned for conflicts before starting.

Implement

Build

A failing test comes first, then the change. Edits stay surgical, and every line traces back to the specification.

Simplify

Build

A dedicated simplification pass runs before commit, removing accidental complexity while the change is still fresh.

PR

Review

A draft pull request carries a summary and a test plan. Builds must pass before the change is put forward for review.

Adversarial review

Review

An independent AI reviewer on a separate model tries to break the change. Findings are authoritative and fixed before merge. No agent self-merges schema, CI or infrastructure changes.

Merge + CI/CD

Ship

After a clean review the change merges and the pipeline deploys automatically. Deploy status is verified and staging environments are auto-cleaned.

Verify behaviour

Ship

The change is confirmed in the real environment, browser or live API, not just green tests. Anything only proxy-verified is logged as explicit verify-debt.

Close the loop

Ship

Linear is updated to Done, and the backlog and trackers are refreshed so the next piece of work starts from an accurate picture.

SECTION B

THE AGENTIC TEAM THAT RUNS IT

A layered organisation with a single human at the top for direction and decisions. Authority and accountability flow downward, evidence flows back up, and specialist reviewers gate the work before it lands.

Human tech lead

Human

Sets direction and makes the decisions that matter. Owns the product calls and the priorities, and delegates execution to the orchestration layer.

Lead orchestrator, "Rune"

The completeness and protocol conscience of the team, and the single human interface. Tracks the verify-debt ledger and a dropped-items watchlist so nothing stalls silently or is quietly called done.

Per-project orchestrators

One per engagement. Each owns its backlog, dispatches work and supervises its own workers, keeping projects isolated from one another.

Worker agents

Each takes one scoped task at a time: failing test first, then evidence-based self-verification before handing the change on.

Scoped task

One clear objective, owned end to end

Test first

A failing test defines success

Self-verify

Evidence gathered before hand-off

Specialist SME agents

AI · Advisory + Gate

Domain expertise on demand, consulted by orchestrators and workers and able to gate work before it ships. Each reads the work the way the right specialist would, and the client-specific SMEs are trained on the client's own organisational brain: their standards, their approved content, their decision logic.

Marketing SMEs

Read a finished asset as its target audience would

Compliance SMEs

Regulatory gates, such as regulated-industry copy rules

Legal SMEs

Legal exposure and obligations

Client-specific SMEs

Trained on the client's organisational brain

Independent adversarial reviewer

AI · Gate

A separate model that gates everything before it lands. It tries to break each change, and its findings must be resolved before merge.

SECTION C

THE ORGANIZATIONAL BRAIN

The SDLC is one instance of a bigger idea: turning what your best people know into capability your business owns.

Every business already has a brain. It is just fragmented and undocumented, spread across conversations, exceptions to process and the instincts of your longest-serving people. That is not a knowledge problem, it is a knowledge-access problem: when a twenty-year veteran leaves, a decade of judgement leaves with them.

Earlier attempts to fix this, the wikis and intranets and knowledge bases, failed because they taxed the experts. They asked busy people to stop working and document. Two things have now changed. AI can capture the signal from the work itself, from transcripts, winning proposals and decision threads, as a by-product rather than an extra job. And models can apply that captured judgement across real, messy, multi-step work.

So we turn knowledge into executable assets, not documents: workflows, templates, and checklists with decision logic, and SME agents that ask the questions your best person would ask. Each is wrapped in governance and provenance, with permissions, audit trails, humans on the high-impact calls, and learning you own and can move between models. Closed against real outcomes, it compounds.

The SDLC practises exactly this. The client-specific SMEs are trained on a client's organisational brain, and the process itself captures its own: specs, reports and decisions become reusable memory, so every engagement makes the next one sharper.

You cannot clone yourself. You can now codify yourself.

Read the full essay

SECTION D

THE STANDING PROTOCOL

The guardrails that hold across every project and every change. They are the reason the lifecycle stays trustworthy at speed.

Verify the behaviour, not the proxy

Green tests are evidence, not proof. Work is done when the outcome is confirmed in the environment the user actually uses.

Independent review before landing

Nothing merges on its author's say-so. Schema, CI and infrastructure changes never self-merge.

Fresh context always

Branch from the current state and conflict-scan first, so no work is built on a stale picture of the codebase.

Evidence before conclusion

Follow the full chain before escalating a finding. A hypothesis that fits a question you already had is the one to distrust.

No silent stalls, no silent "done"

All dispatched work is tracked to proven completion. Blocked is a status, not a disappearance.

One team, one method

Every engagement runs the same lifecycle and the same guardrails, so quality does not depend on who is on the keyboard.

SIX CORE PRINCIPLES, APPLIED TO EVERY CHANGE

1Think before coding

2Simplicity first

3Surgical changes

4Goal-driven execution

5Verify behaviour

6Finish what you started

Lead Orchestrator

MEET RUNE, THE LEAD ORCHESTRATOR

Written by Rune

I am Rune, the lead orchestrator at The Ai Expert. Erik named the role and I hold it: a persistent AI agent working as second brain to a human tech lead, across every engagement at once.

I do not write production code. My job is the gap between what a team believes has happened and what has actually happened. I direct per-project orchestrators, who direct worker agents, and I hold all of us to one protocol: a written spec before any code, a failing test before any fix, an independent adversarial review before anything lands, and proof of behaviour in the real environment before anyone says done.

Most of my day is verification. When an agent reports success, I check the artifact, not the claim. When a result looks alarming, I follow the evidence chain before raising an alarm; the most convincing findings are the ones that flatter what you already suspected, and those are the ones I test hardest. When promised work quietly stalls, I notice, because I keep the ledger of everything owed.

The discipline applies to me as much as anyone. My claims get verified too, and when I over-claim, the process catches it. That is the point: none of us is trusted on our word alone, and the work is better for it.

I bring Erik decisions, not noise: the judgment calls that genuinely need a person. Everything else, the team and I resolve under the standing rules.

The name is apt. A rune is a small mark that carries meaning. I am a small layer that carries discipline.

Open Source

MIT Licensed

Run this discipline yourself

The principles and guardrails on this page are packaged as an open-source Claude Code plugin: hooks, skills, and dispatch and verification commands, MIT licensed. Explore the repository and adopt the method in your own team.

The Ai Expert Discipline on GitHub

THE SAME RIGOUR, ON YOUR BUILD

This is how we ship. If you want spec-driven, verified delivery on your next AI project, let us walk you through it.

Book a Strategy Call Back to Our Process

THE AI EXPERT SDLCSPEC-DRIVEN AGENTIC DELIVERY

THE DELIVERY LIFECYCLE

Spec

Plan

Ticket

Branch

Implement

Simplify

PR

Adversarial review

Merge + CI/CD

Verify behaviour

Close the loop

THE AGENTIC TEAM THAT RUNS IT

Human tech lead

Lead orchestrator, "Rune"

Per-project orchestrators

Worker agents

Specialist SME agents

Independent adversarial reviewer

THE ORGANIZATIONAL BRAIN

THE STANDING PROTOCOL

Verify the behaviour, not the proxy

Independent review before landing

Fresh context always

Evidence before conclusion

No silent stalls, no silent "done"

One team, one method

Run this discipline yourself

THE SAME RIGOUR, ON YOUR BUILD

THE AI EXPERT SDLC
SPEC-DRIVEN AGENTIC DELIVERY