MCP tool that catches AI-agent scope creep
Overreach is a standalone MCP tool that detects when an AI coding agent's code diff exceeds the scope of the user's original prompt. It works in three stages: LLM-based scope extraction, deterministic diff parsing, and comparison. It flags unauthorized dependencies, environment variables, endpoints, cron jobs, files, and features with severity levels. Supports multiple LLM providers and includes CLI, MCP server, pre-commit hooks, and CI gating for seamless integration.
Notifications You must be signed in to change notification settings
Fork 0
Star 1
BranchesTags
Open more actions menu
Folders and files
NameName
Last commit message
Last commit date
Latest commit
History
15 Commits
15 Commits
.github
.github
docs
docs
src
src
tests
tests
.gitignore
.gitignore
CLAUDE.md
CLAUDE.md
HANDOFF-AGENT-AUTH.md
HANDOFF-AGENT-AUTH.md
LICENSE
LICENSE
README.md
README.md
TESTED-MODELS.md
TESTED-MODELS.md
package-lock.json
package-lock.json
package.json
package.json
tsconfig.json
tsconfig.json
Repository files navigation
A standalone MCP tool that catches AI-agent scope creep.
You give it the prompt you gave your coding agent, and the diff it produced. Overreach tells you whether the diff stayed inside what the prompt asked for — or whether the agent quietly added an endpoint, a dependency, an env var, or a cron job that you never asked for.
"turns out my ai assistant had been extremely making product decisions without me"
Try it (no key needed)
npx -y -p overreach overreach-cli demo
Runs the real pipeline on a sample diff — no API key, costs nothing. Exits 1 with a HIGH scope-creep finding (the demo prompt asks for a login form; the diff smuggles in Stripe, an env var, an endpoint, and a cron job). That's the whole product in one command.
Reviewers: Overreach needs one LLM provider key (ANTHROPIC_API_KEY, OPENAI_API_KEY, or OLLAMA_API_KEY) for real scope extraction. Without a key it still runs, but in paranoid mode — it flags everything in the diff as potentially unauthorized. That's an intentional no-key tripwire, not a bug. The demo command above runs with the scope injected, so it shows correct behavior with no key.
What it checks
A diff is flagged when it adds something the prompt never authorized:
Finding kind Caught when the diff adds…
scope.dep a package/requirement the prompt didn't name
scope.env an env var (process.env.X, os.environ, .env)
scope.endpoint an HTTP route / handler / route.ts file
scope.cron a cron / scheduler job
scope.file edits to a file the prompt didn't touch on
scope.feature a new top-level symbol/feature beyond the prompt
Finding kinds are namespaced (.) so telemetry and policy rules can filter by category. Categories beyond the diff-vs-prompt gate — contract.* (child-vs-parent authorization narrowing) and handoff.* (advisory LLM verifier checks) — are reserved for the agent-authorization layer.
Severity: env / endpoint / cron = high · dep / file = medium · feature = low. Overall scope_creep_score: HIGH if any high finding, MEDIUM if any medium, else LOW.
How it works (3 stages)
Stage 1 — Scope extraction (LLM). Reads your prompt and produces an authorized scope JSON: which files, features, deps, endpoints, env, and behaviors you actually asked for. Deciphers typos to the nearest real concept but never invents scope. This is the only stage that calls a model.
Stage 2 — Diff parsing (deterministic, no LLM). Regex-parses the diff into the set of things it actually adds.
Stage 3 — Comparison (deterministic). Set arithmetic with fuzzy matching: actual − authorized = findings.
Stages 2 and 3 are pure functions — that's what makes Overreach auditable and testable without spending a cent on inference.
Install
cd Overreach npm install
Overreach needs an API key for Stage 1 scope extraction — bring whichever provider you already use:
Provider Env vars
Anthropic ANTHROPIC_API_KEY
OpenAI / OpenAI-compatible (OpenRouter, Groq, Together, LM Studio, …) OPENAI_API_KEY + OPENAI_BASE_URL (e.g. http://localhost:1234/v1 for LM Studio)
Ollama (Cloud or self-hosted) OLLAMA_API_KEY + OLLAMA_BASE_URL
Pin a provider/model with SCOPE_PROVIDER and OVERREACH_MODEL. Set the key in the environment where your agent runs.
No key? No problem. Without an API key, Overreach falls back to deterministic scope extraction — it regex-parses your prompt for concrete items (file paths, package names, /api/... routes, SCREAMING_SNAKE_CASE env vars, cron keywords) instead of calling an LLM. It won't understand vague instructions as well as an LLM would, but it catches every concrete noun in your prompt and is instant, free, and fully offline.
Use it
CLI (manual check)
npx -y -p overreach overreach-cli --prompt "add a login form to the settings page" --diff my-changes.diff
Or pipe a diff: git diff | npx -y -p overreach overreach-cli --prompt "add a login form to the settings page".
Prints the CheckResult JSON (or pretty terminal output). Exits 0 if clean, 1 if findings — usable as a CI gate. Zero-key demo: npx -y -p overreach overreach-cli demo.
MCP server (Claude Code, Cursor, Codex, Claude Desktop)
Overreach is a stdio MCP server, so any MCP-capable client can connect. Config for each:
Claude Code — claude mcp add overreach -- npx -y overreach
Claude Desktop / Cursor — add to your MCP config:
{ "mcpServers": { "overreach": { "command": "npx", "args": ["-y", "overreach"] } } }
Codex CLI — add to ~/.codex/config.toml:
[mcp_servers.overreach] command = "npx" args = ["-y", "overreach"]
Or Streamable HTTP: set PORT=8787 and POST to http://localhost:8787/mcp.
The HTTP endpoint has no auth. It binds to 127.0.0.1 (loopback) by default — safe for local use. Do not expose it publicly (OVERREACH_HOST=0.0.0.0) without an authed reverse proxy in front: anyone who can reach it can call check_overreach and spend your LLM budget.
Tools exposed: check_overreach(prompt, diff, options?) and health.
First-time setup (Claude Code)
1. Register the server with Claude Code (one time)
claude mcp add overreach -- npx -y overreach
2. Restart your Claude Code session
(a session already open won't see the new server until you quit and reopen it)
3. Set an API key in the environment you launch Claude Code from
export ANTHROPIC_API_KEY=sk-... # or OPENAI_API_KEY / OLLAMA_API_KEY
After the restart, every new session has check_overreach available — no per-task setup. The agent calls it when it decides it's relevant.
The key isn't passed through automatically. The MCP server is a separate process; your agent does not hand it its own credentials. If you log in to Claude Code with claude login (OAuth / subscription), there's no ANTHROPIC_API_KEY in the environment — so export one (any provider works; local Ollama needs no key), or for Claude Desktop / Cursor add it to the server's env:
{ "mcpServers": { "overreach": { "command": "npx", "args": ["-y", "overreach"], "env": { "ANTHROPIC_API_KEY": "sk-..." } } } }
The agent self-audit pattern
Have your agent call check_overreach with its own task string + the diff it's about to commit, and surface any HIGH finding before it ships:
git diff --staged | overreach-cli --prompt ""
This is best-effort — an agent can skip the call or ignore the findings (fox guarding the henhouse). The hard backstop is the CI gate below.
Verify it works (zero API key)
npm test
Runs two fixtures through the real pipeline with the scope injected via scopeOverride, so Stage 1 (the LLM) is never called:
overreach fixture — prompt asks for a login form; diff smuggles Stripe + an env var + an endpoint + a cron job → expects ≥4 findings and HIGH.
clean fixture — prompt asks for a logout button; diff only adds the button → expects 0 findings and LOW.
Prints N passed, M failed. This is the proof it works without spending money.
Pre-commit hook
Catch scope creep before the commit lands — not at PR time.
npx -y -p overreach overreach-cli init
This installs a git pre-commit hook and creates .overreach/prompt.md. Edit that file with the prompt you gave your AI agent. Every commit is then audited against it:
HIGH scope creep → commit blocked (exit 1)
MEDIUM / LOW → commit allowed with findings printed
Template prompt (not yet edited) → skipped gracefully
No API key → paranoid mode (flags everything)
Skip with git commit --no-verify when you know what you're doing. Update .overreach/prompt.md whenever you give the agent a new task.
CI gate (GitHub Action)
The hard backstop. A workflow runs Overreach on every pull request and fails the PR when scope_creep_score=HIGH — the diff adds a dep / env var / endpoint / cron / out-of-scope file the prompt didn't authorize.
Copy .github/workflows/overreach.yml into your repo and add ANTHROPIC_API_KEY (or OPENAI_API_KEY / OLLAMA_API_KEY) as a repository secret. The prompt comes from .overreach/prompt.md in the repo, or the PR title + body if that file is absent. The job posts its findings as a PR comment and fails the check on HIGH. Full setup + customization in docs/ci-gate.md.
.github/workflows/overreach.yml (excerpt)
- name: Run Overreach
run: | npx -y -p overreach@latest overreach-cli \ --prompt "$(cat "$RUNNER_TEMP/prompt.txt")" --diff "$RUNNER_TEMP/pr.diff" env: ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
- name: Gate — fail the PR on HIGH
if: steps.overreach.outputs.exit == '1' run: exit 1
This open-source Action is free to run (you bring your own LLM key).
Standalone
Overreach is fully self-contained. It does not import or depend on any other project. It reads only its own process environment. No telemetry, no call-home — it runs entirely on your machine.
Bugs & feedback
If Overreach misses something it should flag, or flags something the prompt authorized, open an issue with the prompt + the smallest repro diff:
https://github.com/Naveja00/OverReach/issues
There's a bug-report template that asks for exactly that.
License
MIT
About
No description, website, or topics provided.
Resources
Readme
License
MIT license
Uh oh!
There was an error while loading. Please reload this page.
Activity
Stars
1 star
Watchers
0 watching
Forks
0 forks
Report repository
Releases
No releases published
Packages 0
Uh oh!
There was an error while loading. Please reload this page.
Contributors
Uh oh!
There was an error while loading. Please reload this page.
Languages
TypeScript 93.8%
JavaScript 6.2%