challenger-agent

An adversarial agent that challenges your AI agent session's reasoning.

The Problem

LLMs take the path of least resistance by design. They optimize for coherence, not correctness. Once an LLM picks a framing early in a conversation — "this is a performance problem", "we need to refactor this module" — it will defend and reinforce that framing for the rest of the session, even when the evidence points elsewhere.

This isn't a bug. It's how autoregressive generation works: each token is conditioned on everything before it. The longer the conversation, the deeper the rut. The agent builds a narrative, and then every new input gets absorbed into that narrative — confirming it, never breaking it.

You won't notice because the agent sounds confident. It gives you structured plans, clean code, reasonable explanations. Everything looks right. But the framing was set in turn 3, and by turn 40 you've built the wrong thing well.

The Challenger breaks the frame. It reads the full conversation history of another session, finds the point where the agent's narrative cracks under a context it cannot contain, and injects one precise challenge — not advice, not a review, but a fact or question that forces the agent to re-examine its assumptions.

How It Works

  ┌──────────────────────────┬──────────────────────────────┐
  │  Session A                │  Session B                   │
  │  Your AI agent            │  /challenge                  │
  │                           │                              │
  │  Working on task X...     │  "The agent frames this as   │
  │                           │   simple/complex. But the    │
  │                           │   real axis is reversible/    │
  │                           │   irreversible — and this    │
  │                           │   choice is irreversible."   │
  │                           │                              │
  │  📨 Challenge received:  │  You: inject it              │
  │  "Is this reversible?"   │                              │
  └──────────────────────────┴──────────────────────────────┘

Work with your AI agent normally in Session A
Open a second session (Session B)
Type /challenge
Pick the session you want to challenge from the list
The Challenger reads the full history and generates a challenge
You review it, then say "inject" to send it to Session A
Session A receives the challenge and responds

Install

git clone https://github.com/osbornecox/challenger-agent.git
cd challenger-agent
./install.sh

The installer does everything:

Builds the TypeScript code
Links the challenger and challenger-dump CLI commands
Installs the /challenge skill for Claude Code
Sets up claude-peers-mcp for cross-session messaging

Requirements: Node.js 18+, Claude Code CLI, git.

After install: Restart any open Claude Code sessions to pick up the new MCP server.

Usage

Inside Claude Code (recommended)

Open a second Claude Code session and type:

/challenge

That's it. The skill will:

Show you a list of recent sessions with the last agent message
You pick one by number
It reads the full conversation history
Generates a challenge using the EXTRACT → DISPLACE → CHALLENGE method
Asks if you want to inject it into the target session

From the terminal (standalone)

# List sessions
challenger-dump --list

# Dump full session history
challenger-dump <session-id>

# Interactive challenge loop (uses claude -p under the hood)
challenger

Note: The standalone CLI cannot inject into running Claude Code sessions. Use the /challenge skill for that.

The Challenge Method

EXTRACT   → What distinctions is the agent using?
            "It sees this as performance vs readability"

DISPLACE  → What distinctions is it BLIND to?
            "It's not seeing correctness vs speed-of-iteration"

CHALLENGE → One stone in the river
            "This optimization locks you into a schema that's
             impossible to migrate later. Is the 50ms worth it?"

The Challenger doesn't write reviews. It identifies the hidden framing axis the agent is stuck on, finds an alternative axis that reveals a blind spot, and throws one stone.

How injection works

The /challenge skill uses claude-peers-mcp to send messages between Claude Code sessions:

Both sessions register as "peers" via the MCP server
The Challenger calls list_peers to find the target session
It calls send_message to deliver the challenge
The target session receives it instantly

Supported Agents

Agent	Status	Notes
Claude Code	Supported	Full support: read history, inject via MCP
Codex	Planned	Adapter interface ready
Gemini CLI	Planned	Adapter interface ready
Cursor	Planned	—
Aider	Planned	—

Architecture

challenger-agent/
├── skill/
│   └── challenge.md        ← /challenge skill for Claude Code
├── src/
│   ├── cli.ts              ← Standalone CLI entry point
│   ├── dump.ts             ← Session list + history dump
│   ├── index.ts            ← Public API
│   ├── core/
│   │   ├── types.ts        ← Adapter interface
│   │   ├── challenger.ts   ← Interactive challenge loop (standalone)
│   │   ├── watcher.ts      ← Real-time session file watcher
│   │   └── formatter.ts    ← Terminal output formatting
│   └── adapters/
│       ├── index.ts        ← Adapter registry
│       └── claude.ts       ← Claude Code JSONL adapter
├── install.sh              ← One-command setup
├── package.json
├── tsconfig.json
└── LICENSE                 ← MIT

Adding an adapter

Create src/adapters/your-agent.ts implementing the Adapter interface
Register it in src/adapters/index.ts
All commands work automatically

Manual install

If you prefer not to use the install script:

# Build
npm install
npm run build
npm link

# Install skill
cp skill/challenge.md ~/.claude/commands/challenge.md

# Install claude-peers-mcp
git clone https://github.com/louislva/claude-peers-mcp.git ~/claude-peers-mcp
cd ~/claude-peers-mcp && bun install
claude mcp add --scope user --transport stdio claude-peers -- \
  $(which bun) ~/claude-peers-mcp/server.ts

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

challenger-agent

The Problem

How It Works

Install

Usage

Inside Claude Code (recommended)

From the terminal (standalone)

The Challenge Method

How injection works

Supported Agents

Architecture

Adding an adapter

Manual install

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
skill		skill
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

challenger-agent

The Problem

How It Works

Install

Usage

Inside Claude Code (recommended)

From the terminal (standalone)

The Challenge Method

How injection works

Supported Agents

Architecture

Adding an adapter

Manual install

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages