Harness Guides¶

Build better AI agent harnesses.

Runtime architecture for serious agent builders

Claude Code got exposed. Here’s the harness blueprint Python builders actually need.¶

After the March 31, 2026 reporting, a lot of people wanted to know how serious agent runtimes actually work. Harness Guides turns that moment into practical, Python-first documentation for building better harnesses, with OpenAI's Responses API as the concrete reference model layer.

12 architecture chapters python-first examples responses api mapping

Read the Blueprint Start Building Python Path

12 core systems¶

From tools and permissions to MCP, bridge protocols, memory, and cost tracking.

Python-first docs¶

Every core chapter explains the system in Python terms before pointing back to provenance or portability notes.

Responses API grounded¶

OpenAI-specific examples map to the official Responses API instead of legacy chat-completions thinking.

Why this exists now¶

On March 31, 2026, public reports circulated that Claude Code internals had been exposed again through npm packaging artifacts. That pushed a lot more developers to ask the right question:

not "how do I copy one product?"

but "how do serious agent harnesses actually work?"

Harness Guides exists to answer the second question. The goal is to study the architecture, extract the durable patterns, and turn them into public documentation that helps people build better systems intentionally.

Python first¶

The current version of the site is optimized for people building harnesses in Python.

chapter pages explain the architecture in Python terms first
OpenAI examples use the Responses API as the concrete reference
the original long-form blueprint remains available as provenance, not the primary teaching surface
Rust can still be added later, but it is not required to make the guide useful now

What every serious harness needs¶

Runtime contracts¶

Tool schemas, validation, permissions, concurrency rules, result formatting, and deterministic registration. These are the boundaries that keep the system legible.

Control loops¶

Conversation state, retries, recovery, transcript persistence, context compression, and budget gates. This is the difference between a demo and an actual operator tool.

Safety systems¶

Trust gates, permission chains, shell constraints, and explicit state transitions. Agents fail hardest where the runtime is vague.

Expansion surfaces¶

MCP, sub-agents, slash commands, and IDE bridges. Capability grows fast here, but so does complexity, so the docs have to keep up.

Why this site exists¶

Turn dense reverse-engineering notes into readable public documentation.
Give developers a common vocabulary for harness engineering.
Make the best patterns practical for Python builders first.
Help people build their own tooling instead of copying surfaces blindly.

Read by system¶

Blueprint chapters¶

More systems¶

OpenAI reference layer¶

The site uses the official Responses API as the default OpenAI reference surface for:

tool calls
long-running work
streaming responses
conversation continuity
built-in tools and remote MCP

Start here:

Build better¶

These pages translate the blueprint into concrete design principles for new projects:

fail closed by default
make permissions explicit
separate validation from policy
preserve prompt-cache stability
keep docs useful to builders, not spectators

Reading tracks¶

If you are building from scratch¶

Start with the Python path, then read Tool System, Query Engine, Permissions, and Session State in that order.

If you already have an agent¶

Go straight to Tool Execution Pipeline, Bootstrap, MCP, and IDE Bridge to tighten the runtime around what you already ship in Python.

If you want docs that spread¶

Use the blueprint pages as source material, then publish implementation guides, diagrams, comparison pages, and contribution notes around them.

Who this is for¶

developers building AI coding agents
toolsmiths designing safe execution harnesses
open-source maintainers documenting agent internals
researchers comparing runtime architectures across products

Why this can travel¶

Technical docs spread when they do three things well:

name the moving parts clearly
compress a hard system into reusable patterns
help builders ship better work immediately

That is the standard for this site.

Status¶

This site is structured to grow into a proper public docs hub. The blueprint is already here in chapter pages and the complete long-form reference, but the reader path is now Python-first and implementation-oriented.