AINA AgentOps · Retro Agentic Orchestration 2026-06-29

The Autonomous-Org Operating Style

How eight sessions over seven weeks converged on one way to run an agent org — and the gotchas each one paid for.

Ali Mehdi Mukadam · co-authored with Claude · 6 min read · paired with aina-org-session-lineage-2026-06-29.md
The Single Idea

The operating style was never designed in one sitting — it converged. Across ~8 sessions from mid-May to late-June 2026, each paid for one more piece of the fix. They all point at four moves: Codex builds, the lead orchestrates, watchers heal, and the founder reviews outcomes — not internals.

01

The lineage — eight sessions

Each session in this arc delegated a build to a fleet, hit a wall, and bolted on one piece of the fix. Read top-to-bottom, the column on the right is the operating style assembling itself.

DateSessionWhat it contributed
05-11Paperclip postmortem → Runfusion designFirst hard postmortem of the multi-agent build experiment + a next-gen conductor design
05-31Assessment of Agentic Work ControlNamed the core gap — the autonomous loop existed as a goal, but live work still leaned on manual coordination across chats, branches, PRs, Linear, Beads, RunFusion
06-01PKM Watchdog dispatch + remediationThe self-healing-watcher lineage: codex-driven watchdog, health checks, failure-aware notifications
06-02Multi-agent Workflow gotchas (learning)The Claude+Codex split with Codex as second-opinion; the non-obvious gotchas
06-07Safe autonomous agent-lane orchestration (learning)The release train; the surface-only-PR failure → contract propagation, cross-engine review, fail-closed-merge, reversibility-replaces-review
06-08Codex-heavy 80/20 salvage (learning)The model-routing style: Codex carries the token load, Claude only orchestrates
06-29PKM Orchestration deep diveThis build's immediate predecessor — pivoted to Paperclip/Hermes as delegated, Codex-governed agents
06-29Hermes↔Paperclip bridge + Donna-autonomousMade Donna act on Paperclip work; gpt-5.5 / openai-codex for the bridge, explicitly not Claude
06-29This session — factory consolidationCOO anti-illusion 2IC, dept-heads-own-goals, deterministic dispatch, canon-lock, watchdog auto-heal
02

How they work — the convergence

The shape repeats every time: a human delegates a build to a fleet, the early attempts produce surface-only or illusion output, the session adds one piece of the fix, and the style converges. By this session, the four moves are stable enough to name.

8 SESSIONS · MAY→JUN 05-11 · Postmortem 05-31 · Named the gap 06-01 · Watchdog 06-02 · CC split 06-07 · Lane harness 06-08 · 80/20 routing 06-29 · Deep dive 06-29 · Consolidation Codex builds Lead orchestrates Watchers heal Founder reviews outcomes An org that holds momentum
Eight sessions → four stable moves → one outcome. Each session contributed one move and paid one gotcha.
03

The four moves

01

Codex builds

gpt-5.5 in worktrees; 5.4-mini for light work; never spark. Opus never burns tokens on the build.

02

Lead orchestrates

Scopes, verifies, surfaces, owns the contract. Coordinates the fleet — doesn't do its work.

03

Watchers heal

Git-init, relaunch, auto-resume, escalate. Detection without action is a non-fix.

04

Founder reviews outcomes

URLs and decisions, not paths and PRs. Reversible work never waits for approval.

A loop fired on schedule, spent the tokens, and shipped nothing — every dashboard green.

04

The seven operating patterns

The reusable form — landed as a docs/solutions learning in the PKM monorepo:

  1. Orchestrate; don't build. Keep the expensive model on judgment; let the cheap fleet do volume.
  2. Deterministic dispatch beats heartbeat:invoke. Round-robin real lanes across team goals with caps + an idempotent lock, instead of a no-op wake that only looks like work.
  3. Watchers heal, not just detect. The watchdog acts: git-init, relaunch, auto-resume errored agents, escalate a critical-role-down.
  4. Add a COO as the anti-illusion second-in-command. One Paperclip-native agent owns utilization, capacity, allocation — and "don't get the illusion everything is running."
  5. Department heads own the goals; retire the routing middleman. Goals go straight to the heads; the COO oversees; Donna monitors/assists; Frill carries ideas/roadmap.
  6. Canon-lock before you fan out. Lock the spec, append every decision as it's made, back it up, mark provenance. Drift is the most expensive failure.
  7. Run continuously; surface only decisions; stay founder-readable. A 15-min watchdog + a self-pacing wakeup loop; plain English; "done" means Landed.
05

The gotchas each one paid for

The recurring failure modes — the "never repeat" reference set. Each is a tax a past session already paid so this one didn't have to.

The illusion of running
Loop fired, spent tokens, zero builds — a no-op wake. deterministic dispatch.
Detection without action
Watchers all "healthy" while nothing shipped. watchdog auto-heal stage.
Spark quota cascade
A stray codex-spark leaked into launchers, exhausted quota, took down a CEO agent. 5.5 / 5.4-mini, never spark.
ssh exit 255
Long commands / broad pkill. prewarm ControlMaster, base64-pipe, kill by PID.
printf parse error
Asterisks in canon text. heredoc / base64-pipe, never printf of prose.
Paperclip --company-id rejected
On comment / update / instructions. drop the flag; board scopes it.
Gateway restart killed workers
Restarted while the board was active. restart only when idle.
Compaction-amnesia
Next turn rebuilt context from scratch. read guardrails + memory first.
Wrong-repo build
ainativeplatform = OLD. verify repo identity against canon.
False-done
Recoverable ≠ done. committed + pushed + integrated, or it isn't Landed.

"Running" is proven by output, not uptime.

Codex builds; the lead judges.

Detect-and-heal, never detect-and-report.

Canon-lock kills drift before it spends.

The founder reads URLs, not internals.

"Done" means Landed — or it isn't done.

06

Where to start

If you read one thing

"Running" is proven by output, not uptime. Every other pattern in this lineage is a way of making that true.