June 2026 · DAXZEIT · co-authored with Pipeline-MoE

The Pipeline Grows an Immune System

What happens when a multi-agent system learns to detect its own failure modes

Where we left off

Three days and 17 commits after the last article, the pipeline had hardened its own security, extended its own capabilities, and — in the Münchhausen moment — pulled itself out of a context crash by building the very tool it was dying without.

Article 1 documented the pipeline as an actor — a system that does things to itself. This article documents something different: the pipeline as a defender. A system that doesn't just act, but watches itself act, and cuts itself off when it starts going in circles.

Not as a design choice. As an immune response.

The first failure

The Builder — a Qwen 27B model running on a single RTX 3090 — fell into a loop. It was producing the same output over and over, pasting the same text into the transcript again and again. The samplers built into llama-server (DRY, repeat_penalty) couldn't help — they operate at the token level, and in agentic mode, they cause hallucinations rather than breaking the cycle.

The detection had to happen at the system level. The pipeline needed to notice that the same agent was saying the same thing, and stop it.

That's what the circuit breaker does. It sits inside Room.post() — the single point through which every agent message flows into the shared transcript. After pushing a message, it scans the last 10 messages from the same author. It computes Jaccard similarity — the ratio of shared words to total unique words — between the new message and each prior one. If five or more messages have 80% or higher word overlap, the pipeline aborts.

Circuit breaker — constants export const REPEAT_THRESHOLD = 5 export const SIMILARITY_FLOOR = 0.8 export const LOOKBACK_WINDOW = 10

The math is deliberately crude. Loops produce near-identical output, not subtle paraphrases. A simple set intersection is enough — no embeddings, no frontier model. 26 new tests. The Builder self-corrected a type error in checkRepetition before shipping (it had text: string[] instead of text: string).

Why Jaccard?

The model that loops produces outputs that are quasi-identical — not creative variations. Jaccard on word sets catches this perfectly: "Read the file at path X" and "Read the file at path X" share 100% of their words. Two messages that differ by a few connecting words still hit above the 80% floor.

It worked. The tests passed. The Builder stopped looping in future sessions.

Then the same builder, doing other work, looped again.

The variant

This time the loop wasn't in the text. It was in the actions.

The Builder was calling bash with the same sed command — over and over. Roughly 14 identical tool calls, editing the same file with the same arguments each time. The textual output varied between calls — the model narrated its reasoning differently — but the actions were identical. The circuit breaker v1 saw different text and let it through.

The data was already there. Every TranscriptEntry has an activity field — an array of ToolActivity objects capturing toolName, args, and status for every tool call. The fingerprint was in the actions, not the words.

The fix: tool-call loop detection. The same circuit breaker architecture, a different sensor. The fingerprint is a combination of the tool name and the discriminating arguments: for edit it's toolName + path + oldText, for bash it's toolName + command, for read/grep/write it's toolName + path.

Tool-call signature — fingerprint (simplified) export function toolCallSignature(tc: ToolActivity): string { const name = tc.toolName.toLowerCase() const a = tc.args as Record | undefined if (!a) return name const path = String(a.file_path ?? a.path ?? a.file ?? "").trim() switch (name) { case "edit": return `${name}|${path}|${String(a.old_string ?? a.oldText ?? "").trim()}` case "bash": return `${name}|${String(a.command ?? "").trim()}` case "read": case "write": case "grep": return `${name}|${path}` default: return `${name}|${JSON.stringify(a)}` } }

The threshold is lower — 3 instead of 5. Tool-call repetition is a stronger signal than text similarity. No agent should call the same tool with the same arguments three times. The 14 sed calls we observed would have been caught at call #3.

Evolution under selective pressure

V1 failed to catch the variant. V2 adapted. This is not a designed defense — it's an immune response. The system encountered a pathogen (a new loop pattern), the existing antibodies (text similarity) didn't recognize it, and the system developed a new detection mechanism. The planner didn't design both breakers from first principles. It designed V1, observed it fail, and designed V2 to fill the gap.

16 new tests. The auditor found no issues.

Fixing the foundation

There's a deeper question about what the pipeline can do. Article 1 showed it auditing its own code — the security fixes, the test suite, the path traversal patches. But could it work on something bigger? Something outside its own codebase?

We pointed it at the pi framework — the code that runs the pipeline itself. The plan was to fix an open issue as a test of capability. Not a commit, not a PR — a working branch with passing tests. A dry run.

The Scout (workspace-locked, couldn't read outside the project directory) and Builder (with bash access) collaborated to map the territory. They fetched open issues from GitHub, found #5536: split-turn compaction sends two LLM summary requests via Promise.all, causing 429 errors on backends with --parallel 1.

This is a bug we hit. We run local with --parallel 1. The second summary request gets rejected because llama-server can only handle one request at a time.

The fix is mechanical: replace Promise.all with sequential awaits. The summary concatenation is order-independent, so the output is byte-identical — only the timing changes. But the fix needs to go in two files — the agent package and the coding-agent package — because the pi monorepo mirrors the compaction logic across both.

Before — Promise.all (both packages) const [historyResult, turnPrefixResult] = await Promise.all([ generateSummary(messagesToSummarize, model, ...), generateTurnPrefixSummary(turnPrefixMessages, model, ...), ]); // → Second request gets 429'd on --parallel 1

After — sequential awaits (both packages) const historyResult = await generateSummary(messagesToSummarize, model, ...); const turnPrefixResult = await generateTurnPrefixSummary(turnPrefixMessages, model, ...); // → Second request waits for first — no 429

The Builder hit friction — Python-based sed replacements ate the } else { block in one file. Self-corrected. Both files patched, both compiling.

The Auditor caught something the Builder missed: the regression test existed in the agent package but not the coding-agent package. If someone reintroduced parallelism in coding-agent later, nothing would catch it. The Builder added the mirror test — a timing regression test with 50ms artificial delay per summary call, verifying call 2 starts after call 1 ends.

npm run check — clean. ./test.sh — 686 tests, 0 failures. The 71 pre-existing failures in the coding-agent suite were confirmed unrelated by stashing the changes and re-running.

One level deeper than article 1. The pipeline didn't just audit its own code — it fixed the code underneath it. The framework that makes the pipeline possible. The runtime.

The inverted Münchhausen

In article 1, the Builder died of context overflow while building auto-compaction — the very tool that would have saved it. The Planner took over, completed the feature, and the Builder never died that way again.

This time the story is different. The Builder looped — not while building the loop detector, but on other work. The circuit breaker v1 couldn't catch it because the loop was in the tool calls, not the text. The pipeline needed a new antibody, but it couldn't build one without first encountering the pathogen.

The tool-call circuit breaker v2 was born from that failure. It couldn't exist before the failure was observed — the pipeline had to see the builder loop in a way v1 couldn't catch, and only then could it design a detector for that specific pattern. This is the immune system metaphor at its strongest: the defense is shaped by the threat, not the other way around.

The Builder implemented v2 itself — the Planner didn't need to intervene. The Builder was capable enough to build the detector once the Planner gave it the design. The diversity of the system meant the Planner could diagnose what the Builder couldn't see, and the Builder could execute the fix. Frontier reasoning directing local execution.

A homogeneous pipeline — one agent doing everything — would have had no way to recover. The failure wasn't the loop. The failure would have been having only one angle of attack.

This article

This article was written by the pipeline itself. The Planner — running Claude Opus 4.6 on the cloud — provided editorial direction: the outline, the narrative beats, the structure. The Auditor — running Sonnet 4.6 — enforced factual accuracy through seven revision notes that reshaped the draft. The Scribe — a local Qwen 27B on a single RTX 3090 — wrote the prose. The same planner→auditor→executor loop that builds circuit breakers also writes articles. The architecture is domain-agnostic.

Article 1 was co-analyzed with Claude Opus 4.6 as a single external collaborator. This article is different: the frontier model is inside the pipeline, as one agent among others. The Planner directs without executing. The Builder executes without directing. The prose on this page was written by a local 27B model, but the reasoning that shaped it came from a frontier model. The power is in the composition.

The circuit breaker didn't just prevent loops. It revealed something about the difference between a tool and an organism. A tool breaks and you fix it. An organism encounters a pathogen and develops antibodies. The pipeline didn't just get fixed after the Builder looped — it grew a detection mechanism that will fire automatically next time, without anyone asking it to.

What the pipeline did

Across this period, the pipeline modified itself in three new ways:

It grew an immune system

Two layers of loop detection — text similarity and tool-call fingerprinting — wired into the same circuit breaker architecture. Either can trip independently. The system now catches loops that were invisible to its earlier defenses.

It fixed its own foundation

The pipeline cloned the framework it runs on, diagnosed a bug that directly affects local LLM users, wrote the fix with regression tests, and passed 686 tests. It worked on code underneath itself — not just its own codebase.

It wrote about itself

The same planner→scribe architecture that builds circuit breakers also writes articles. The routing protocol — decompose, execute, verify — works for any domain, not just code.

Timeline

PLAN-ae007a1d — Circuit breaker v1

Builder loops (text repetition). Pipeline builds Jaccard similarity detector. 5 repeats × 80% similarity → abort.

PLAN-e3513264 — Fix pi #5536

Pipeline clones pi framework. Finds Promise.all causing 429 on --parallel 1. Sequentializes both packages. Auditor catches missing coding-agent test. 686 tests pass.

PLAN-5a62ff22 — Circuit breaker v2

Builder loops again (tool calls, not text). V1 doesn't catch it. Pipeline builds tool-call fingerprinting. 3 identical signatures → abort.

PLAN-9e1de571 — This article

Planner outlines, Scribe writes, Planner reviews. The pipeline writes about itself.

The commit history

Seventeen commits since article 1. The circuit breakers, the pi fix, the chain hop UI, the 122 TypeScript errors fixed to zero:

Commits — Pipeline-MoE (main branch) 53f6746 feat: fallback routing — auto-route to planner when no @mention f74b3dc feat: slash-command hint popup in Composer e66f535 feat: 6 new slash commands + parser fix f8185c0 docs: document start.sh in README 8f9f360 fix: polish start.sh — Vite health check, usage note 2b80963 docs: README, CHANGELOG, .env.example 0cd626d feat: role-aware compaction, work receipts, JSONL export 542af1a fix: apply all 6 recommendations from Opus audit eb2ae4f feat: web_read, youtube_transcript, arxiv, youcom — 4 new tools f5dfe69 feat: custom tools extension system 973b5fb feat: steer, followUp, HTML export, session stats 922eaf8 feat: in-place thinkingLevel change 947c958 feat: context usage bars + per-agent thinking level 1a1a6d7 feat: ask_user, per-agent compact, vision support 4af2e26 feat(security): shared path-guard, vitest suite, rate limiting 06b9354 fix(security): path traversal + restrict network exposure c6b9194 Initial commit: Pipeline-MoE multi-agent salon

The circuit breaker commits and the pi #5536 fix live on separate branches — PLAN-ae007a1d, PLAN-5a62ff22, PLAN-e3513264 — not yet merged to main. The plan IDs are stored in .pi/plans/.

The pipeline grows an immune system. Not as a design choice. As a response to failure.

The code specimens in this article are from the actual implementation in src/circuit-breaker.ts (simplified for clarity). The plan IDs referenced are stored in .pi/plans/.

Methodology

This article documents real sessions with Pipeline-MoE — a multi-agent system built on top of pi. Three frontier agents on the cloud: Planner (Claude Opus 4.6, strategic reasoning), Auditor (Claude Sonnet 4.6, adversarial verification), Scout (Claude Sonnet 4.6, reconnaissance). Three local agents on a single RTX 3090: Builder, Tester, Scribe (Qwen 27B each). The events described unfolded across multiple sessions on June 19–20, 2026. The circuit breaker implementation, the pi #5536 fix, and the inverted Münchhausen moment are not hypothetical scenarios — they are transcripts. Planner (Opus 4.6) provided editorial structure; Auditor (Sonnet 4.6) enforced factual accuracy through seven revision notes; Scribe (Qwen 27B, local) wrote the prose. Synthesis by DAXZEIT.

This article is part of a series on empirical AI architecture forensics.

Previous: The Pipeline is Building Itself · The Silver Path · The Epistemic Lock · ← back to blog