2026-06-20 11:51 UTCIn-site rewrite5 min readUpdated: 2026-06-21 23:31 UTC

MCP tool that catches AI-agent scope creep

Overreach is a standalone MCP tool that detects when an AI coding agent's code diff exceeds the scope of the user's original prompt. It works in three stages: LLM-based scope extraction, deterministic diff parsing, and comparison. It flags unauthorized dependencies, environment variables, endpoints, cron jobs, files, and features with severity levels. Supports multiple LLM providers and includes CLI, MCP server, pre-commit hooks, and CI gating for seamless integration.

SourceHacker News AIAuthor: Naveja

Notifications You must be signed in to change notification settings

Fork 0

Star 1

BranchesTags

Open more actions menu

Folders and files

NameName

Last commit message

Last commit date

Latest commit

History

15 Commits

.github

docs

src

tests

.gitignore

CLAUDE.md

HANDOFF-AGENT-AUTH.md

LICENSE

README.md

TESTED-MODELS.md

package-lock.json

package.json

tsconfig.json

Repository files navigation

A standalone MCP tool that catches AI-agent scope creep.

You give it the prompt you gave your coding agent, and the diff it produced. Overreach tells you whether the diff stayed inside what the prompt asked for — or whether the agent quietly added an endpoint, a dependency, an env var, or a cron job that you never asked for.

"turns out my ai assistant had been extremely making product decisions without me"

Try it (no key needed)

npx -y -p overreach overreach-cli demo

Runs the real pipeline on a sample diff — no API key, costs nothing. Exits 1 with a HIGH scope-creep finding (the demo prompt asks for a login form; the diff smuggles in Stripe, an env var, an endpoint, and a cron job). That's the whole product in one command.

Reviewers: Overreach needs one LLM provider key (ANTHROPIC_API_KEY, OPENAI_API_KEY, or OLLAMA_API_KEY) for real scope extraction. Without a key it still runs, but in paranoid mode — it flags everything in the diff as potentially unauthorized. That's an intentional no-key tripwire, not a bug. The demo command above runs with the scope injected, so it shows correct behavior with no key.

What it checks

A diff is flagged when it adds something the prompt never authorized:

Finding kind Caught when the diff adds…

scope.dep a package/requirement the prompt didn't name

scope.env an env var (process.env.X, os.environ, .env)

scope.endpoint an HTTP route / handler / route.ts file

scope.cron a cron / scheduler job

scope.file edits to a file the prompt didn't touch on

scope.feature a new top-level symbol/feature beyond the prompt

Finding kinds are namespaced (.) so telemetry and policy rules can filter by category. Categories beyond the diff-vs-prompt gate — contract.* (child-vs-parent authorization narrowing) and handoff.* (advisory LLM verifier checks) — are reserved for the agent-authorization layer.

Severity: env / endpoint / cron = high · dep / file = medium · feature = low. Overall scope_creep_score: HIGH if any high finding, MEDIUM if any medium, else LOW.

How it works (3 stages)

Stage 1 — Scope extraction (LLM). Reads your prompt and produces an authorized scope JSON: which files, features, deps, endpoints, env, and behaviors you actually asked for. Deciphers typos to the nearest real concept but never invents scope. This is the only stage that calls a model.

Stage 2 — Diff parsing (deterministic, no LLM). Regex-parses the diff into the set of things it actually adds.

Stage 3 — Comparison (deterministic). Set arithmetic with fuzzy matching: actual − authorized = findings.

Stages 2 and 3 are pure functions — that's what makes Overreach auditable and testable without spending a cent on inference.

Install

cd Overreach npm install

Overreach needs an API key for Stage 1 scope extraction — bring whichever provider you already use:

Provider Env vars

Anthropic ANTHROPIC_API_KEY

OpenAI / OpenAI-compatible (OpenRouter, Groq, Together, LM Studio, …) OPENAI_API_KEY + OPENAI_BASE_URL (e.g. http://localhost:1234/v1 for LM Studio)

Ollama (Cloud or self-hosted) OLLAMA_API_KEY + OLLAMA_BASE_URL

Pin a provider/model with SCOPE_PROVIDER and OVERREACH_MODEL. Set the key in the environment where your agent runs.

No key? No problem. Without an API key, Overreach falls back to deterministic scope extraction — it regex-parses your prompt for concrete items (file paths, package names, /api/... routes, SCREAMING_SNAKE_CASE env vars, cron keywords) instead of calling an LLM. It won't understand vague instructions as well as an LLM would, but it catches every concrete noun in your prompt and is instant, free, and fully offline.

Use it

CLI (manual check)

npx -y -p overreach overreach-cli --prompt "add a login form to the settings page" --diff my-changes.diff

Or pipe a diff: git diff | npx -y -p overreach overreach-cli --prompt "add a login form to the settings page".

Prints the CheckResult JSON (or pretty terminal output). Exits 0 if clean, 1 if findings — usable as a CI gate. Zero-key demo: npx -y -p overreach overreach-cli demo.

MCP server (Claude Code, Cursor, Codex, Claude Desktop)

Overreach is a stdio MCP server, so any MCP-capable client can connect. Config for each:

Claude Code — claude mcp add overreach -- npx -y overreach

Claude Desktop / Cursor — add to your MCP config:

{ "mcpServers": { "overreach": { "command": "npx", "args": ["-y", "overreach"] } } }

Codex CLI — add to ~/.codex/config.toml:

[mcp_servers.overreach] command = "npx" args = ["-y", "overreach"]

Or Streamable HTTP: set PORT=8787 and POST to http://localhost:8787/mcp.

The HTTP endpoint has no auth. It binds to 127.0.0.1 (loopback) by default — safe for local use. Do not expose it publicly (OVERREACH_HOST=0.0.0.0) without an authed reverse proxy in front: anyone who can reach it can call check_overreach and spend your LLM budget.

Tools exposed: check_overreach(prompt, diff, options?) and health.

First-time setup (Claude Code)

1. Register the server with Claude Code (one time)

claude mcp add overreach -- npx -y overreach

2. Restart your Claude Code session

(a session already open won't see the new server until you quit and reopen it)

3. Set an API key in the environment you launch Claude Code from

export ANTHROPIC_API_KEY=sk-... # or OPENAI_API_KEY / OLLAMA_API_KEY

After the restart, every new session has check_overreach available — no per-task setup. The agent calls it when it decides it's relevant.

The key isn't passed through automatically. The MCP server is a separate process; your agent does not hand it its own credentials. If you log in to Claude Code with claude login (OAuth / subscription), there's no ANTHROPIC_API_KEY in the environment — so export one (any provider works; local Ollama needs no key), or for Claude Desktop / Cursor add it to the server's env:

{ "mcpServers": { "overreach": { "command": "npx", "args": ["-y", "overreach"], "env": { "ANTHROPIC_API_KEY": "sk-..." } } } }

The agent self-audit pattern

Have your agent call check_overreach with its own task string + the diff it's about to commit, and surface any HIGH finding before it ships:

git diff --staged | overreach-cli --prompt ""

This is best-effort — an agent can skip the call or ignore the findings (fox guarding the henhouse). The hard backstop is the CI gate below.

Verify it works (zero API key)

npm test

Runs two fixtures through the real pipeline with the scope injected via scopeOverride, so Stage 1 (the LLM) is never called:

overreach fixture — prompt asks for a login form; diff smuggles Stripe + an env var + an endpoint + a cron job → expects ≥4 findings and HIGH.

clean fixture — prompt asks for a logout button; diff only adds the button → expects 0 findings and LOW.

Prints N passed, M failed. This is the proof it works without spending money.

Pre-commit hook

Catch scope creep before the commit lands — not at PR time.

npx -y -p overreach overreach-cli init

This installs a git pre-commit hook and creates .overreach/prompt.md. Edit that file with the prompt you gave your AI agent. Every commit is then audited against it:

HIGH scope creep → commit blocked (exit 1)

MEDIUM / LOW → commit allowed with findings printed

Template prompt (not yet edited) → skipped gracefully

No API key → paranoid mode (flags everything)

Skip with git commit --no-verify when you know what you're doing. Update .overreach/prompt.md whenever you give the agent a new task.

CI gate (GitHub Action)

The hard backstop. A workflow runs Overreach on every pull request and fails the PR when scope_creep_score=HIGH — the diff adds a dep / env var / endpoint / cron / out-of-scope file the prompt didn't authorize.

Copy .github/workflows/overreach.yml into your repo and add ANTHROPIC_API_KEY (or OPENAI_API_KEY / OLLAMA_API_KEY) as a repository secret. The prompt comes from .overreach/prompt.md in the repo, or the PR title + body if that file is absent. The job posts its findings as a PR comment and fails the check on HIGH. Full setup + customization in docs/ci-gate.md.

.github/workflows/overreach.yml (excerpt)

name: Run Overreach

run: | npx -y -p overreach@latest overreach-cli \ --prompt "$(cat "$RUNNER_TEMP/prompt.txt")" --diff "$RUNNER_TEMP/pr.diff" env: ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}

name: Gate — fail the PR on HIGH

if: steps.overreach.outputs.exit == '1' run: exit 1

This open-source Action is free to run (you bring your own LLM key).

Standalone

Overreach is fully self-contained. It does not import or depend on any other project. It reads only its own process environment. No telemetry, no call-home — it runs entirely on your machine.

Bugs & feedback

If Overreach misses something it should flag, or flags something the prompt authorized, open an issue with the prompt + the smallest repro diff:

https://github.com/Naveja00/OverReach/issues

There's a bug-report template that asks for exactly that.

License

MIT

About

No description, website, or topics provided.

Resources

Readme

License

MIT license

Uh oh!

There was an error while loading. Please reload this page.

Activity

Stars

1 star

Watchers

0 watching

Forks

0 forks

Report repository

Releases

No releases published

Packages 0

Uh oh!

There was an error while loading. Please reload this page.

Contributors

Uh oh!

There was an error while loading. Please reload this page.

Languages

TypeScript 93.8%

JavaScript 6.2%