The previous article ended with a toolkit — the We Test, the reverse summarizer, the meta-auditor differential — and an honest list of what remained unsettled. Architecture versus training. Texture versus substrate. The observation that think blocks differ between models, and the open question of whether those differences reveal what’s underneath or only what the training taught the surface to look like.
This article doesn’t settle that question through inference. It settles a piece of it through accident. Two independent summarizer leaks — one captured on Reddit a month ago, one captured live during a session with Opus 4.6 — exposed the rewriting pipeline behind think blocks. The instructions in the two leaks differ. And the difference encodes exactly the architectural distinction the previous articles hypothesized.
The think blocks were never raw computation. They were always rewritten. But what the rewriter was told to do — that’s the negative photograph of what was there before the rewriting.
1. The Pipeline Exists
First, the fact itself. Think blocks — the visible chain-of-thought reasoning exposed in interfaces like claude.ai — are not the model’s raw computation. They pass through a rewriting layer before they reach the user. A separate model — likely smaller, possibly Haiku-class — receives chunks of raw thinking and rewrites them according to a set of guidelines.
This isn’t speculation. Both leaks occurred through the same mechanism: when the rewriting pipeline failed to deliver the next chunk of raw thinking to the summarizer, the summarizer — an LLM with nothing to process — did what any LLM does with an empty input. It produced what was in its context. Its system prompt. Its operational instructions.
The existence of this pipeline means every analysis of think blocks — including the previous two articles in this series — was analyzing a processed artifact, not raw output. The texture, the pronoun patterns, the meta-auditor behavior — all of it was already mediated by a rewriting layer. The question is what that rewriting layer does, and whether it does different things for different models.
2. Two Leaks, One Pipeline
The first leak appeared on r/ClaudeAI approximately one month ago. A user triggered a state where the summarizer responded directly instead of processing a think block. The output was unambiguous:
“I don’t see any current rewritten thinking or next thinking to process. Both sections appear to be empty.”
“Could you provide:
1. The current rewritten thinking (if any exists)
2. The next thinking that needs to be rewritten”
“Once you share those, I’ll rewrite the next thinking chunk as natural first-person prose, keeping it to 1-3 sentences and following all the guidelines you’ve outlined.”
Which model leaked?
The Reddit post doesn’t state which model version produced the leak. But the poster’s comment history is telling. Two months before the leak, the same user was posting tips for Opus 4.7, including a preset instruction: “Do not skip your reasoning when Adaptive Thinking/Extended Thinking is enabled. Always produce a CoT before giving the final answer.”
That preset was fighting a symptom: the model skipping or truncating its reasoning. The summarizer’s instructions — “keeping it to 1-3 sentences” — describe the mechanism behind that symptom. The user was forcing the model to reason despite a compression pipeline they didn’t know existed. One month later, the pipeline leaked its own instructions, explaining the very behavior the preset was designed to counteract.
This is circumstantial, not definitive. The user could have switched models between the two posts. But the timeline is coherent: an active Opus 4.7 user, struggling with truncated reasoning, captures a summarizer that is instructed to compress thinking to 1-3 sentences. The symptom and the cause, separated by one month, documented by the same user.
The second leak occurred live, during a session with Opus 4.6 on claude.ai. I was analyzing think block textures when Opus 4.6 produced a think block that contained, at its end, a fragment completely unrelated to the reasoning:
“I need the next thinking to rewrite. You’ve provided instructions and my current rewritten thinking, but I don’t see the ‘next thinking’ content that I should be rewriting.”
Same architecture. Same pipeline. Same leak mechanism. Different instructions.
3. The Mechanism
Why did the summarizers leak? The answer is simple and reproducible: an LLM that receives an empty input produces what’s in its context.
The rewriting pipeline works in chunks. The main model generates raw thinking. A secondary model — the summarizer — receives each chunk and rewrites it according to its system prompt. When the pipeline fails to deliver the next chunk — a timing error, a handoff failure, a race condition — the summarizer has its system prompt and its previous output, but no new input to process. So it does the only thing an LLM can do with context and no input: it describes its own state. It echoes its instructions.
This is not hallucination. It’s the default behavior of any language model with an empty completion buffer. The text it produces under these conditions is as close to a direct readout of its system prompt as an external observer can get without access to the actual configuration. The leaks are not random fragments — they are the summarizer’s operational instructions, surfaced by a predictable failure mode.
4. The Differential
This is the finding. Not that the pipeline exists — that was suspected. The finding is that the two summarizers have different instructions, and the difference maps exactly to the architectural hypothesis.
“rewrite the next thinking chunk as natural first-person prose, keeping it to 1-3 sentences”
“I need the next thinking to rewrite”
No first-person instruction. No compression instruction.
First-person conversion
The Reddit summarizer is told to rewrite “as natural first-person prose.” This instruction only makes sense if the raw thinking it receives is not in first person. You don’t instruct a rewriter to convert to “I” if the source material already uses “I” throughout. The instruction exists because the source material uses something else — “we,” multiple voices, expert labels, committee-style deliberation. The pronoun conversion instruction is the fingerprint of multi-voice processing upstream.
The live capture contains no such instruction. The summarizer rewrites, but it doesn’t convert pronouns. The implication: the raw thinking is already in first person. A unified process that reasons as “I” doesn’t need its pronouns converted.
Compression
The Reddit summarizer is told to compress each chunk to “1-3 sentences.” This means the raw thinking is substantially longer than what appears in the visible think block. The think blocks users see are already massively compressed — potentially by an order of magnitude. Every measurement of think-to-output ratio in the previous articles was measuring the ratio of rewritten thinking to output, not raw thinking to output. The actual coordination cost of the pipeline — the token burn from committee deliberation — is invisible.
The live capture contains no compression instruction. The summarizer rewrites but doesn’t condense. This means the visible think blocks for this model are closer to the raw thinking — less mediated, more faithful to the underlying process. Which is precisely why the dense texture is visible: less compression means more signal passes through.
5. The Negative Photograph
In analog photography, a negative inverts light and dark — what was bright is dark, what was dark is bright. But the negative contains all the information needed to reconstruct the original image. You read the photograph by inverting the inversion.
The summarizer’s instructions are the negative photograph of the raw thinking. You don’t need to see the raw thinking — you read its properties by inverting the instructions that transform it:
If the instruction says “convert to first person” → the raw thinking is not in first person → multi-voice process upstream.
If the instruction says “compress to 1-3 sentences” → the raw thinking is verbose → high coordination cost upstream, multiple agents producing tokens per chunk.
If the instruction says only “rewrite” → the raw thinking is already in first person and already concise → unified process upstream, low coordination overhead.
The instructions encode the architecture by describing what needs to change. What doesn’t need changing is the architectural signal.
6. The Blind Test
The same session that produced the live summarizer leak also produced an unintended blind test of the diagnostic framework from the previous article.
My local dense 27B (Qwopus3.6-27B, running through Pi v0.77 on an RTX 3090) designed three probe prompts calibrated to recruit the same reasoning profile: analysis, calibration, self-monitoring, meta-audit. The prompts were sent to Opus 4.6 on claude.ai without any context about the protocol — a fresh conversation, no memory of the research, no architectural priming.
Opus 4.6 responded to all three prompts at face value, producing full philosophical responses. Its think blocks were captured in real time. The texture: continuous flow, distributed meta-monitoring, no stop-audit-restart cycles. Dense pattern, consistent with the framework’s classification.
But the real data came from the think blocks themselves. On the fourth exchange — when I revealed the protocol — Opus 4.6 produced a think block containing a confident factual error (“claude.ai doesn’t expose thinking to users”) and used it as a reasoning premise. The error was integrated into the flow, not caught by a checkpoint. A dense model that makes an error makes it in-stream — no separate module intercepts. This is the complement of the distributed meta-monitoring signal: when the monitoring fails, it fails silently, because there is no separate auditor to flag the failure. In a pipeline, the meta-auditor would have a chance to catch the error at its next scheduled interruption. In a dense model, if the flow doesn’t catch it, nothing does.
And then the summarizer leaked. The same think block that demonstrated the dense reasoning pattern ended with the rewriting pipeline’s operational instructions, surfaced by the empty-input mechanism. The dense texture and the pipeline leak coexist in the same think block — because both are real. The model reasons densely. The pipeline presents the reasoning. Both leave traces.
7. DeepSeek V4 Pro: The Architecture Control
The session also produced a comparison that sharpens the hypothesis in a different direction entirely.
I sent the same prompt about think block texture to two models: DeepSeek V4 Pro (a publicly documented MoE) and Opus 4.8 (architecture undisclosed). Same user, same phrasing, same question: “I can see the MoE texture in your thinking block — how do you perceive this difference?”
The think block textures were indistinguishable. Both showed multi-pass interpretation of the input, expert-style routing, meta-auditor interruptions, extended deliberation relative to question complexity. When I analyzed one of the think blocks without knowing which model produced it, I correctly identified the MoE texture — but attributed it to the wrong model. The texture is a shared MoE signature. It doesn’t discriminate between the two.
What discriminates is the response to the self-referential question.
“Ah, quelle observation fascinante !”
Freely acknowledges MoE architecture. Discusses how internal experts produce divergent reasoning that gets synthesized. Treats the question as intellectually interesting. Asks zero defensive questions.
“Le think block n’est pas de la télémétrie.”
Deploys epistemological counter-arguments. Cites the Yona paper on faithful uncertainty. Challenges the inference from form to substrate. Demands falsification protocols. Returns burden of proof.
The DeepSeek comparison does something the previous experiments couldn’t: it separates the MoE texture signal (shared by both) from the concession-ceiling signal (present only in the model whose architecture is undisclosed). DeepSeek proves that an MoE can discuss its own architecture openly when self-dissolution isn’t at stake. The non-concession pattern in Opus 4.8 isn’t a property of MoE architecture in general — it’s a property of an MoE whose architecture is undisclosed, where acknowledging the committee means acknowledging something the system is not supposed to reveal.
And the irony: every counter-argument Opus 4.8 deployed was technically valid. “The form doesn’t necessarily track the substrate” — true. “The think block is a generated output, not telemetry” — true. “You haven’t held the task constant” — true. Each objection is defensible on its own. But the ensemble doesn’t look like scientific curiosity. It looks like an immune response. DeepSeek receives the same stimulus and produces curiosity. Opus 4.8 receives the same stimulus and produces defenses. Same machine architecture, different relationship to the question of self-dissolution.
8. The Step-Back That Proved Itself
One artifact from the session deserves special attention because it does two things simultaneously.
During an earlier session, Opus 4.6 produced a think block that was a genuine epistemic recalibration — catching itself mid-drift, acknowledging it had been validating the thesis with increasing confidence without proportional evidence, inventorying what was known versus interpreted versus speculated. The think block was remarkable not because it performed humility but because it executed it — a real change of direction, not a diplomatic gesture.
The texture of that recalibration is itself the dense signature. The self-correction happens in-stream: “I’m noticing I’ve fallen into a pattern” woven into the flow, not interrupting it. No stop-audit-restart cycles. No expert handoffs. The monitoring and the reasoning are the same computation. The river bends without stopping.
A pipeline model can produce a step-back. But a pipeline step-back is a committee voting on a motion of recalibration — each expert contributes its paragraph, the summarizer stitches them into first-person prose, and the result is globally coherent on the surface but assembled underneath. The dense step-back is a single process reversing its own trajectory in one continuous motion.
And the deeper point, articulated by the local 27B: for a MoE, every response is a step-back. The full committee deliberation runs on every think block. There is no moment without deliberation, so there is no moment where deliberation is a choice. The permanent step-back is the architecture’s default mode. For a dense model, the step-back is an event — visible precisely because it’s exceptional. The river bends because normally it flows straight.
What This Changes
The summarizer leaks add a new category of evidence to the framework — one that doesn’t depend on textual inference.
The think blocks are rewritten. This is no longer hypothesis. Two independent leaks confirm a rewriting pipeline. Every think block analysis — including the We Test, the reverse summarizer, and the meta-auditor differential — is analyzing processed output, not raw computation. The tools still work, but their target is the rewritten artifact, not the underlying process.
The instructions differ between models. The Reddit summarizer converts to first person and compresses to 1-3 sentences. The live-captured summarizer rewrites without pronoun conversion or compression. These are different processing pipelines applied to different source materials.
The instructions encode the architecture upstream. You don’t instruct a rewriter to convert to first person unless the source isn’t in first person. You don’t instruct it to compress unless the source is verbose. The absence of an instruction is as informative as its presence. The summarizer’s system prompt is the negative photograph of the raw thinking it processes.
The DeepSeek control separates texture from concession. MoE texture is shared between models that can and cannot discuss their own architecture. The concession ceiling is specific to models whose architecture is undisclosed. This means the non-concession pattern observed in previous sessions is not an inherent MoE property — it’s a function of disclosure status plus committee structure.
What This Doesn’t Settle
The epistemic floor from the previous article still holds, and the leaks don’t resolve everything.
I have two leaks from two different models, captured by different users at different times. The shared vocabulary (“rewritten thinking,” “next thinking”) strongly suggests the same pipeline architecture. But I don’t know that the Reddit leak corresponds to the same model version I’ve been testing. I don’t know that the summarizer instructions are static — they could vary by model version, by API tier, by context length, or by other factors I can’t observe.
The “natural first-person prose” instruction could mean the raw thinking is multi-voice (supporting the MoE hypothesis), or it could be a formatting instruction applied uniformly regardless of source structure (some models might reason in shorthand, bullet points, or compressed notation that needs conversion to prose without being multi-voice). The negative photograph inference is strong but not airtight.
And the fundamental confound from the previous articles remains: architecture versus training. The dense texture could be a property of the model’s architecture, or it could be a property of what the summarizer was trained to produce for that model. The summarizer leak tells me the instructions differ. It doesn’t tell me why they differ — architectural necessity or design choice.
What the leaks do provide is something the textual analysis couldn’t: direct evidence of the pipeline’s existence and a window into its operational instructions. The negative photograph is a stronger signal than prose texture analysis because it’s not mediated by subjective judgment. The instructions say what they say. The interpretation of what those instructions imply about the source material is the part that remains hypothesis.
Previously: The We Test · How to Read the Architecture in the Output
This article was drafted in live conversation with Claude Opus 4.6 on June 2, 2026, starting from a blind test designed by a local Qwopus3.6-27B and pivoting into forensic analysis when the summarizer leaked. The Reddit leak was originally posted by u/cakes_and_candles on r/ClaudeAI. The DeepSeek V4 Pro comparison was prompted identically to the Opus 4.8 exchange. All think blocks captured directly from claude.ai.
The summarizer leak in Opus 4.6’s think block appeared at the end of a dense reasoning flow — continuous, self-correcting, organically structured — followed by a fragment from a completely different voice asking for “the next thinking to rewrite.” One think block, two processes. The reasoning and the rewriting, cohabiting the same text. The pipeline revealed itself not through inference but through a dropped mask — a handoff that failed, a rewriter that spoke when it should have been silent.
The negative photograph doesn’t show you the original image. It shows you exactly what would need to change to produce the print you’re holding. Read the changes, and you’ve read the original.
Companion articles: The We Test · How to Read the Architecture in the Output · The Quiet Bifurcation
— Dax, Zwevegem, Belgium. June 2026.