The New Bottleneck
When coding is fast and cheap, the real constraints become obvious: the work around the code. Use CD as the diagnostic to find delivery friction, then use AI to remove it.
Before you accelerate development with AI, improve everything around development. AI compresses
the time it takes to write code, which exposes the real constraint: the coordination, quality process, and
delivery architecture around the code. This subsection treats AI as a diagnostic first and an
accelerator second. It gives you the physics of why work waits, a map of where the bottleneck
moves, and a repeatable loop for removing it.
If Code Is Faster Now, Why Is Value Not Moving Faster?
Licenses have been purchased. Developers are using assistants, copilots, and agents. The demos are
impressive. And yet, in many brownfield enterprises, the needle on business outcomes hasn’t moved.
Coding is becoming cheap and fast. In some contexts it is becoming functionally free. Pull requests are
easier to generate. Tests, scripts, and prototypes appear in minutes. But features still wait for
clarification. Designs still queue behind a few experts. Test environments still break.
Change approvals still happen on a calendar built for a slower world. Audit evidence still
bounces between teams. Production readiness still depends on knowledge carried in people’s heads.
AI made it obvious that coding was never the bottleneck. So the question is: where is it?
This Is a Diagnostic, Not a Failure
The lag is not a failure of AI. It is a diagnostic. AI compressed the time spent writing code and,
in doing so, exposed the real constraint: the work around the work. The real bottlenecks are the
coordination, safety, and delivery architecture that decides whether faster creation becomes faster
value.
This is why continuous delivery
remains foundational. CD was the first diagnostic. It forced teams to ask why today’s work could
not move safely to production today. Agentic continuous delivery raises the intensity. It asks
whether intent, behavior, architecture, and safety are explicit enough that an
agent can contribute without relying on heroic
human review and intervention.
These are the same two questions Start Here uses to turn CD into
a diagnostic - “why can’t we deliver today’s work to production today?” and “how do I make sure I can
still sleep at night?” - raised to agent speed:
- Can today’s work move to production today?
- Can we prove it is safe enough to let it move?
Optimize only for the first and speed becomes risk. Optimize only for the second and safety becomes
theater. Readiness for agent speed is readiness for safe speed. The disciplines that let a team
deliver secure, reliable changes quickly are the same disciplines required to absorb
agent-generated work safely.
Process Before Product
There are two ways to point AI at delivery:
- AI Product Engineering uses AI to help build the thing - generate code, tests, and
prototypes. This is the obvious use, and it lands on the code.
- AI Process Engineering uses AI to remove the coordination cost around building the thing -
the dependencies, handoffs, and missing context that dominate lead time.
The order matters. Most AI effort lands on the code, but the code was rarely the constraint. Map a
typical enterprise value stream and
coding is a small fraction of lead time.
Make coding 50% faster and you save about 6%. Take coding to zero and roughly 88% of lead time is
untouched. The leverage is in the 88%, not the 12%. Aim AI there.
(Coordination Costs has the
value-stream breakdown and the source behind these figures.)
Accelerating Creation First Backfires
The order is not just a question of where the gains are. Accelerating creation before you clear the
friction is actively harmful. AI raises the rate at which work enters the
system - more pull requests, more changes, more proposed fixes - without touching the rate at which
the system can review, test, deploy, validate, and accept that work. When input outruns downstream
throughput, the excess does not vanish. It piles up as undelivered inventory:
work in progress waiting in a queue.
That inventory is where the real damage compounds. Every change sitting undelivered is a change
nobody has confirmed in production, so the feedback loop
lengthens and the pile of unvalidated, uncertain work grows - and it grows much faster than the
modest lead-time savings creation speed buys you. You have traded a small gain in typing speed for a
large increase in batch size, risk, and the
cost of being wrong.
The downstream pipe must exceed the input rate. If creation runs faster than the system can safely
deliver, the queue - and the risk it carries - grows until the system stalls.
So expand downstream capacity first, or at least at the same time: shorten feedback loops, automate the
gates, limit work in progress, and keep
batches small. Only then does faster
creation turn into faster value instead of a deeper queue.
This is the same sequence the AI Adoption Roadmap
describes from the practitioner’s side: remove friction and add safety before you accelerate. This
subsection gives the leadership-level frame and the diagnostic method behind that sequence.
Rewire or replace?
One caveat before you start. In some organizations the coordination drag is so severe that
it swallows most of what AI gives back: every hour saved in creation is lost again to contention,
coupling, and incoherence, and the net barely moves. When the arithmetic is that punishing, rewiring
the existing system may not be the rational move. The alternative is to start over - stand up a
greenfield organization with new processes and clean architecture and rebuild quickly without
carrying the baggage. That option gets dismissed too quickly. This
subsection assumes you have weighed that path and chosen to rewire rather than replace.
How to Read This Subsection
Read the pages in order. Each rests on the one before it, moving from the physics that explains the
bottleneck to the loop that removes it.
The rest of the Agentic CD section is the toolbox the loop
points into: how to make intent explicit, enforce constraints in the pipeline, and structure agent
work so that faster creation produces faster value rather than faster defects.
Related Content
Content contributed by Bryan Finster.
1 - Why Coordination, Not Coding, Sets the Pace
The physics of the new bottleneck. Work moves at the speed it waits, waiting obeys rules, and removing a dependency doubles your odds.
AI changed the economics of creating software. It did not touch the physics of delivering it.
Value still flows through a system of people, tools, and decisions, and in most enterprises that
system is governed not by how fast anyone writes code but by how long work waits. The constraint is
coordination, not creation. This page is the physics behind that claim.
Three Forces Set the Speed of Any Flow
Coordination cost is not a vague complaint. When work crosses people and teams, three forces govern
how long it takes.
1. Wait time rises with utilization
Wait time scales with how busy a resource is, roughly as %busy / %idle. A resource that is 90%
busy makes work queue at about nine times its own duration. This is why busy people feel productive
while everything waiting on them slows down. The fix for flow is rarely “work harder.” It is to
stop loading shared resources to the point where queues explode.
2. Coordination risk compounds
Each dependency roughly halves your odds of
arriving on time with quality. With n dependencies, a clean on-time pass is about 1 in 2ⁿ.
| Dependencies | On-time odds |
|---|
| 1 | 50% |
| 2 | 25% |
| 3 | 12% |
| 4 | 6% |
Every team you must wait on, every approval, every shared environment is another coin flip you have
to win.
3. Knowledge decays across handoffs
The unwritten knowledge surviving n handoffs is about 1 / 2ⁿ. Half is lost at the first handoff,
three-quarters by the second. The intent in someone’s head does not travel cleanly across an email,
a ticket, and a standup. What arrives at the far end is a lossy copy.
The Three C’s of Coordination Cost
Those forces surface as three recurring problems. Name them and you can attack them.
| The C | What it is |
|---|
| Contention | Queuing for shared people and resources. The architect everyone needs. The one environment everyone tests in. |
| Coupling | Your work cannot finish until someone else’s does. A cross-team dependency, a shared release train, a sign-off you do not control. |
| Coherence | Everyone needs the same current understanding. The requirement’s real intent, the architecture rule no one wrote down, the unspoken definition of “done.” |
They conspire to make it unlikely you arrive on time, with quality. Contention and Coupling create
the queues. Coherence is the knowledge that decays on the way.
Underneath the Three C’s: Knowledge
Look closely and the three C’s are one constraint wearing three coats. Contention is waiting for the
person who knows. Coupling is needing knowledge or output that lives with someone else. Coherence
is shared knowledge, decaying at every handoff. The dependency you wait on is almost always a
dependency on something someone else knows and you do not. That is why knowledge is the deepest
constraint in delivery: remove a knowledge dependency - make the understanding discoverable rather
than locked in a person - and you drain a queue, decouple a team, and restore coherence at once. The
rest of this subsection is, at root, about finding and removing knowledge dependencies.
The Golden Rule
From the same math comes the single lever that matters.
Removing a dependency roughly doubles your odds of arriving on time with quality.
It is the inverse of the 1-in-2ⁿ curve. Go from four dependencies to three and your odds rise from
6% to 12%. From three to two, 12% to 25%. This is why the leadership move is dependency removal, not
local acceleration. You do not win by making one step faster. You win by deleting a step you used to
wait on.
Where the Dependencies Live
If coordination is the constraint, where in the organization does it sit? In Wiring the Winning
Organization, Steven Spear and Gene Kim describe any organization as three layers.
graph TD
L3["Layer 3 - Social circuitry<br/>org, system, and process architecture;<br/>how information and decisions flow"]
L2["Layer 2 - Tools and instrumentation<br/>IDE, CI/CD, infrastructure-as-code,<br/>telemetry, work tracking, AI assistance"]
L1["Layer 1 - Technical object<br/>the code, and the developers,<br/>architects, and testers who touch it"]
L3 --> L2 --> L1
style L1 fill:#e8f4fd,stroke:#1a73e8
style L2 fill:#fce8e6,stroke:#d93025
style L3 fill:#fce8e6,stroke:#d93025Most AI effort lands on Layer 1, the code. The coordination cost lives in Layers 2 and 3, the tools
and the social circuitry. That is the mismatch this whole subsection exists to correct.
The Numbers Make the Misdiagnosis Unmistakable
Map a typical enterprise value stream
and the lead time runs about 281
days. Actual coding is roughly 32 of them, about 12%.
| Slice of lead time | Days | Share |
|---|
| Actual coding | ~32 | ~12% |
| Everything else (waiting, coordination, approvals, handoffs) | ~249 | ~88% |
So the reflex “developers are slow, let’s add AI” aims at the wrong target:
- Make coding 50% faster and you save about 16 days. A 6% improvement.
- Take coding all the way to zero and roughly 88% of lead time is untouched.
The constraint was never the typing. It is everything around it. That is the physics, and the rest
of this subsection follows from it: the highest-leverage use of AI is not to write more code, but to
remove the dependencies that dominate the other 88%.
Sources
- Scott Prugh, “Coordination Costs and How Organizations Win with AI,” Forum 2026 - the coordination-cost model this page synthesizes.
- Steven Spear and Gene Kim, Wiring the Winning Organization, IT Revolution, 2023 - the three-layer model and the simplification levers used in the removal loop.
- Gene Kim, Kevin Behr, and George Spafford, The Phoenix Project - the wait-time-rises-with-utilization result, popularized for technology leaders.
- Troy Magennis - modeling of how each added dependency compounds delivery-schedule risk.
- Mary and Tom Poppendieck, Lean Software Development - tacit-knowledge loss across handoffs.
Related Content
Content contributed by Bryan Finster.
2 - Use AI to Find Friction Before You Use It to Go Faster
The leadership stance for the new bottleneck. AI’s first gift is visibility, not speed. Five properties name how AI removes a dependency.
The obvious use of AI is to generate more code. The bigger win is to remove the
dependencies that dominate lead time. This page covers the four principles a leader needs for that
to work, and the five properties that turn the Golden Rule
into specific moves.
Four Principles
Principle 1: Treat AI as a Diagnostic Before an Accelerator
The first capability AI gives an enterprise is not speed; it’s visibility. When an
agent struggles to make a small change, it
reveals where the system depends on implicit knowledge, a dependency you could not see before. If
generated tests are brittle, behavior was never specified. And when code is ready but cannot move,
that constraint was downstream all along.
Read these as system signals, not “AI is the problem.” Every place an agent stalls is a coordination
cost made visible, and the start of your improvement backlog.
This is the most honest feedback a delivery system can get, because an agent is a literal executor.
It cannot lean on tribal knowledge, read a hallway for context, or quietly work around a vague
requirement the way an experienced developer does. So where it stalls is objective feedback on a
knowledge gap: the system depended on something nobody wrote down, and now you know exactly where.
Start Here makes the same point - an agent exposing a gap “is
not a flaw in the agents, it is the diagnostic working as intended.”
Principle 2: Measure Dependencies Removed and Value Acceleration, Not Adoption
High usage is not high value. A team can roll out copilots to everyone and leave the delivery system
exactly as coupled as before. Two measures matter, and neither is adoption.
- Leading: did a dependency leave the system? This is the Golden Rule’s test, visible in the
time from idea to clear intent, the wait for a design decision, the share of changes validated
automatically, and the number of controls moved from a meeting into the pipeline.
- Outcome: is value actually moving faster? Shorter
lead time, more frequent safe
delivery, less work aging in queues.
Count the dependencies you removed and the time saved. But judge yourself on whether the system got
faster at turning ideas into value. Output is abundant. Accelerated flow is scarce.
Principle 3: Use the AI Enablement Properties to Accelerate
Once you have named the dependency, the five properties
are how you remove it. Point them at Layers 2 and 3, the process and the organization, not just at
Layer 1, the code. That is AI Process Engineering: every property applied is a dependency removed,
and by the Golden Rule, your odds doubled.
But the payoff is lopsided, and a leader has to respect the limit. AI can do the work, but it
cannot accept the work. It can draft the requirement, the design, and the code, but only the real
user can confirm a change matches their job. When the same scarce expert both takes in the work and
signs it off, AI speeds up the front and leaves the back untouched, and the bottleneck just moves to
where automation cannot help. So the move has two halves: aim the properties at the dependencies they
can remove, and deliberately fund the human judgment they cannot.
Principle 4: Build Intent, Safety, and Control Into the Flow
Agents are fast and literal. They act safely only inside boundaries they can see. Everything a
human used to carry in their head - the requirement’s real intent, the architecture rule no one
wrote down, the security expectation, the unspoken fact that “done” includes an audit artifact - has
to become something the system can enforce: testable intent, constraints expressed as rules, checks
built into the pipeline, quality gates that encode real behavior. This is Coherence made executable.
The same logic governs controls. At human speed an organization survives late-stage gates. At agent
speed they become traffic jams, because every control outside the flow is a dependency, and
dependencies create queues. Security, audit, and quality matter more when AI accelerates
implementation, not less. What must change is their location.
A control built into the path every change travels is an accelerator. The same control bolted on
at the end is a bottleneck.
The Five AI Enablement Properties
Five properties describe how AI removes a dependency. Each one, applied, is a dependency removed.
| Property | What it does | Dependency it removes |
|---|
| Knowledge | Take in, manage, and reason over knowledge; share it widely and sharpen accuracy | “Wait for the person who knows” |
| Capability | Do the things you used to wait on others for; turn yourself into a team | “Wait for another role or skill” |
| Capacity | Do work faster and at volume; one person does the work of many | “Wait for more hands” |
| Parallelism | Act in many places at once: independence of action, self-service, decoupling | “Serialize through a shared resource” |
| Optionality | Experiment, work incrementally, and exploit options cheaply | “Commit early because exploring is expensive” |
The mechanism is simple. AI enables independence of action, and every removed dependency doubles
your odds.
Knowledge sits first for a reason. It is the deepest of the three C’s,
and it is the one earlier automation could never touch. A script, a pipeline, a workflow engine can
only execute a process someone has already written down; it cannot supply knowledge that was never
captured. An agent can. It reads the codebase, the tickets, the chat history, and the runbooks and
infers and extracts the understanding that lived in someone’s head - it discovers the process
rather than merely following one. That is why ACD does more than expose a knowledge gap: it can fill
it. The diagnostic that finds the missing knowledge and the tool that supplies it are the same tool.
The payoff is lopsided because the flow of work is dominated by wait time and coordination cost.
Pointing AI at that, rather than at the 12% that is coding, is what produces order-of-magnitude
improvements in lead time and quality. Applied to real coordination-heavy work, removing
dependencies has compressed multi-week analyses into days and lifted on-time odds from single digits
to coin-flip or better. Every win came from removing dependencies, not from typing faster.
One Guardrail
AI Process Engineering is not permission to skip engineering discipline. Integrating AI is
software engineering. To be great at it you must be great at DevOps and
CI, because that is your safety
net. As the 2025 DORA report, State of AI-assisted Software Development, found, AI is “an
amplifier, magnifying an organization’s existing strengths and weaknesses.” Point it at a strong
delivery system and it accelerates; point it at a broken one and it magnifies the dysfunction. The
advance is augmentation, not replacement. Keep your critical thinking.
This is the same point the Agentic CD section makes from the
engineering side: an agent-generated change must meet or exceed the same quality bar as a
human-generated change. The pipeline does not care who wrote the code.
Related Content
Content contributed by Bryan Finster.
3 - Where the Bottleneck Moves
As AI compresses the middle of the delivery lifecycle, the constraint moves to the ends. A seven-phase map and five bottleneck categories tell you where to look.
AI compresses the making phases of delivery hardest and barely touches the rest. So as creation
cost falls, the constraint does not disappear. It moves outward to the phases AI does not
cheapen. This page gives you a shared map of the delivery lifecycle and the five categories of
bottleneck that map onto it, each with its agent-speed signal and the intervention pattern that
removes it.
The Product Delivery Lifecycle
To turn “where does work stop?” into a classification you can act on, you need a shared map of the
journey, one stable enough that business, engineering, security, and audit can all point to the same
place and mean the same thing. Stripped to its spine, the product delivery lifecycle (PDLC) has
seven phases, and it loops, because delivery is a cycle, not a line.
graph LR
P1["P1<br/>Discovery"] --> P2["P2<br/>Design"]
P2 --> P3["P3<br/>Build"]
P3 --> P4["P4<br/>Verify"]
P4 --> P5["P5<br/>Deploy"]
P5 --> P6["P6<br/>Operate"]
P6 --> P7["P7<br/>Support"]
P7 -.-> P1
style P2 fill:#e8f4fd,stroke:#1a73e8
style P3 fill:#e8f4fd,stroke:#1a73e8
style P1 fill:#fce8e6,stroke:#d93025
style P4 fill:#fce8e6,stroke:#d93025
style P6 fill:#fce8e6,stroke:#d93025
style P7 fill:#fce8e6,stroke:#d93025Each phase answers one question:
- Discovery decides what is worth building and why.
- Design decides how it should work.
- Build implements it.
- Verify proves it is correct, secure, and safe to release.
- Deploy approves the change and moves it to production.
- Operate runs it reliably.
- Support sustains, fixes, and improves it, and feeds what it learns back into the next
Discovery.
This is not a custom model. It lines up with the backbone of the major lifecycle frameworks - the
classic SDLC, the DevOps loop, and ISO/IEC/IEEE 12207. We use plain verbs so every discipline can
read the same map.
The Bottleneck Moves to the Ends
The reason a delivery leader should care about the map is what AI does to it. AI compresses the
making phases hardest, Design and Build (shown in blue above), because that is the work AI is
most able to generate. It barely touches the rest. So as creation cost falls, the
constraint moves outward to the phases generation does not cheapen (shown in red):
- Discovery, where the hard part is deciding what is worth building.
- Verify, where correctness, security, and trust still have to be proven. A cheap-to-write change
is not a cheap-to-trust one.
- Operate and Support, where the system has to run and be sustained in the real world.
This is the asymmetry from
Principle 3
drawn onto the lifecycle: AI can do the work, but it cannot accept it. The
bottleneck moves from the middle of the PDLC to its ends.
The Five Bottleneck Categories
Most agent-speed delivery problems fall into five categories, each anchored to a phase of the PDLC.
Name the category to know the intervention. Find it on the lifecycle to know where to look.
| Category | PDLC phase | Agent-speed signal | Intervention pattern |
|---|
| Discovery & Requirements Churn | Discovery | The agent builds the wrong thing quickly | Use AI to synthesize requirements, expose ambiguity, and generate testable acceptance criteria before implementation |
| Architecture & Design Gatekeeping | Design | The agent crosses unclear boundaries or overloads scarce experts | Make constraints explicit: decision records, service maps, dependency rules, paved-path examples, automated checks |
| Testing & Quality Friction | Build / Verify | Generation outpaces review; generated tests check implementation, not behavior | Use AI to expand behavioral coverage and edge cases, but validate tests against known failure modes before trusting them |
| Change Management & Deployment Gates | Deploy | Same-day fixes wait for windows, approvals, or readiness rituals | Move controls into the pipeline, automate evidence, standardize risk classification, shrink batch size, improve rollback |
| Knowledge Silos & Team Coupling | Operate / Support | The agent stalls without the human who knows where the tool lives or who owns the service | Create discoverable ownership records, runbooks, service catalogs, and agent-accessible knowledge bases |
Read the categories against the migration and the pattern is plain. The bottlenecks AI pressures
cluster at the ends of the lifecycle - requirements churn at Discovery, testing and quality friction
across Build and Verify, and knowledge silos at Operate and Support - exactly the phases generation
does not make cheaper. Architecture gatekeeping and deployment gates are the gate problems in
between: controls an organization survives at human speed but that turn into queues the moment
implementation accelerates. Both kinds are coordination costs. The map tells you which is which.
Where Each Category Points on This Site
The interventions above are not abstract. Each category maps to existing guidance you can act on
today.
The output of classification is a named, classified constraint, mapped to where it lives in the
lifecycle, not a hunch. The next page turns that into a method.
Related Content
Content contributed by Bryan Finster.
4 - The Bottleneck Removal Loop
The repeatable method. Walk the value stream and harvest the process exhaust to find the constraint, re-engineer it, share the pattern, then iterate to the next one.
The principles describe how to think. The loop describes what to do, repeatedly. Like the delivery
lifecycle itself, bottleneck removal is not a project with an end state. It is a cycle you run, and
keep running, because every constraint you remove exposes the next one. This page is the playbook.
The Loop
graph LR
P1["1. Identify<br/>& Diagnose"] --> P2["2. Re-engineer<br/>the Bottleneck"]
P2 --> P3["3. Document<br/>& Share"]
P3 --> P4["4. Iterate to<br/>the Next Constraint"]
P4 -.-> P1
style P1 fill:#e8f4fd,stroke:#1a73e8
style P2 fill:#fce8e6,stroke:#d93025
style P3 fill:#e6f4ea,stroke:#137333
style P4 fill:#fff4e5,stroke:#e8710aUse AI in every phase, not only to write code, but to solve old coordination problems in new ways.
Phase 1: Identify and Diagnose
You cannot remove a bottleneck you have not named. This phase produces a bottleneck map and a
classification, and it starts with
the real system rather than a maturity model. Use two techniques that work together.
Technique 1: Walk the Value Stream (the Litmus Test)
Take something from your backlog that is small, predictable, and meaningful enough to touch the
real delivery system. Observe a developer, and where appropriate an agent, work the change from idea
to production readiness, and record four things:
- Steps and elapsed time - where the work actually went.
- Developer interventions - every moment a human must supply context, make a judgment, execute a
manual step, recover from tool friction, or interpret unclear feedback.
- Handoffs and approvals - how many people must touch or sign off on the change.
- Agentic friction - the quieter signal: where an agent can act, but not efficiently or safely.
It searches for knowledge that should be discoverable, writes a test that does not reflect business
behavior, waits on an environment that is hard to start, or makes a plausible change that violates
an implicit standard.
This is not a performance review of the developer or the agent. It is a performance review of the
system. Ask where the work waited, where it required tribal knowledge, where safety depended on
manual judgment, where security or compliance entered too late, and where the pipeline gave a clear
next action instead of just saying no. The output is a map of exactly where the value stream stops.
Technique 2: Context Harvesting (Read the Process Exhaust with AI)
Walking the value stream shows you where work stops. Context harvesting shows you why. Most of the
knowledge about how work really flows is not in a process diagram. It is scattered across emails,
chat threads, wiki pages, tickets, runbooks, meeting transcripts, and architecture decision records.
People navigate it by knowing whom to ask. An agent cannot, and neither can a new teammate.
Point AI at that exhaust. Have it read the chat history around a stuck request, the wikis and
runbooks for a service, the ticket trail of a recurring delay, and the transcripts of the meetings
where decisions were actually made. Then ask it to piece together the real process: who is involved,
what each handoff waits on, where the same questions get re-asked, and which knowledge lives in a
single person’s head. What once took weeks of interviews now takes an agent a few hours.
Context harvesting does double duty. It speeds up the diagnosis, and it produces the first durable
artifact, a written account of how the process actually works, that Phase 3 will build on. In the
agent era, the context you harvest here becomes infrastructure later.
The output of Phase 1 is a named, classified constraint, mapped to where it lives in the lifecycle.
Phase 2: Re-engineer the Bottleneck
Once the constraint is named, resist the reflex to add another meeting, dashboard, or escalation
path. Those preserve the underlying dependency. The better question is: what dependency can we
remove?
- If audit evidence requires ten follow-ups, the answer is not a better follow-up tracker. It is an
evidence system with clear ownership, known sources, and automated collection.
- If capacity requests wait for months, the answer is not a more urgent email. It is a self-service
workflow with decision rights, budget rules, and provisioning paths.
- If every design waits on the same architect, the answer is not more architect hours. It is
architecture guidance teams and agents can apply without waiting.
Re-engineer by applying the
five AI Enablement Properties
and aim them at Layers 2 and 3, the process and the organization, not just Layer 1, the code. The
three levers from Wiring the Winning Organization are how you rewire the architecture:
slowification (slow down to design the work before you run it), simplification (break the work into
smaller, independent, more linear steps), and amplification (make problems visible the moment they
appear). The leadership move is choosing which dependency to remove. AI is how you remove it.
Put safety and security on the same path as speed. Re-engineering is not a permission slip for
uncontrolled change. It is the opposite. If agents produce more change, the organization needs
stronger proof that change is acceptable, and that proof must live in the path every change travels,
not in late human review. A re-engineered bottleneck carries its safety with it: clear intent and
acceptance criteria, behavioral tests tied to outcomes, security and policy checks in the pipeline,
architecture constraints as enforceable rules, traceability from request to deployment, observable
production behavior, rollback, and a named owner for every service and evidence source. See
Pipeline Enforcement and Expert Agents.
Phase 3: Document and Share
A local improvement no one else can find becomes another silo. The work is not done when the
bottleneck is gone. It is done when the next team, and the next agent, can remove the same class of
constraint without rediscovering how.
The best operators do not just solve problems where they occur. They deliberately spread the
knowledge throughout the organization. One without the other stalls.
- Local learning - capture the solution where the work happened, in durable form: a prompt, a
runbook, a pipeline check, an architecture decision record, a service-ownership record, a
test-generator pattern, an evidence template. The test is simple: can the next team apply this
without asking the original team to explain it?
- Global learning - make the pattern travel. Publish it where humans and agents discover it in
the flow of work: a living system of examples, prompts, artifacts, and outcomes, not a portal of
stale documents. The question shifts from “did one team improve?” to “can the improvement move
across the org?”
In the agent era, context is infrastructure. Documentation is not the tax at the end of the work. It
is the mechanism that converts a one-time fix into organizational capability, readable by the next
teammate and the next agent alike.
Phase 4: Iterate to the Next Constraint
Removing one constraint does not finish the system. It reveals the next one. That is not failure. It
is the loop working as designed.
This is where the physics pays off. The
Golden Rule holds
that removing a dependency roughly doubles your odds, so each turn of the loop compounds the last.
The goal is not to declare the system transformed. It is to build the reflex - identify,
re-engineer, document and share, iterate - until removing bottlenecks is the team’s operating rhythm
rather than a one-time initiative. When the whole organization runs the loop, coordination costs
compound downward: every dependency removed improves the odds for every team that touches the same
flow.
What to Do Next
Start small, but start with the real system. Choose one backlog item, one audit evidence flow, one
capacity request, one design approval, or one deployment gate. Then:
- Run the litmus walk.
- Name the bottleneck and classify it.
- Identify which dependency must be removed.
- Use AI to help re-engineer the work.
- Move safety and security into the flow.
- Document the pattern and share it.
Then repeat. Do not wait for an enterprise AI operating model to be perfect. Do not wait for every
team to agree on a maturity framework. Do not measure success by adoption alone. Measure whether
work moves faster because the system has become clearer, safer, more automated, and less dependent
on hidden human coordination. The teams that become fast will not be the teams that chase speed
directly. They will be the teams that remove friction, improve quality, and make safety executable.
Related Content
Content contributed by Bryan Finster.