GraphPilot gives coding agents a persistent structural index of your TypeScript or JavaScript codebase so they stop burning tokens on repetitive greps. It builds a local code graph (symbols, callers, callees, blast radius) and exposes four MCP tools: gp_recall for symbol lookup, gp_callers for finding who calls a function, gp_impact for refactor blast radius, and gp_index for on-demand reindexing. Every response includes file:line evidence anchors with commit SHAs. Run graphpilot index once to build the graph, graphpilot watch to keep it fresh on file saves, then wire graphpilot mcp into Claude, Cursor, or Cline. Benchmarks show 61% token reduction on structural queries compared to file reads alone, with sub-millisecond query times even on large repos.
GRAPHPILOT_ROOTAbsolute path to the repository GraphPilot should index and serve. Optional — falls back to the MCP client's workspace root or a parent-directory walk.
Structural memory for coding agents.
A refactor-safe, branch-aware, evidence-backed code graph that runs entirely on your machine.
Quickstart · Tools · How it works · Editor setup · Limitations · Benchmarks
GraphPilot is a local CLI + MCP server that indexes your TypeScript/JavaScript repo into a structural graph (symbols, callers, callees, blast radius) and exposes it to coding agents — Claude Code, Cursor, Cline, Windsurf, Continue — so they stop re-grepping the same files every conversation.
The problem it solves: agents burn tokens, hallucinate function names, and miss structural relationships ("what calls this?", "what breaks if I rename it?") because each session starts from zero. GraphPilot is the persistent structural memory in between.
Token cost drops. Hallucinations drop. Refactors get safer.
Put a real coding agent (claude-sonnet-4-5) on 40 structural questions about fastify — a ~300-file Node.js framework — and give it nothing but file reads. Then hand it GraphPilot's four tools and ask the same 40. The agent with GraphPilot uses 61 % fewer tokens, costs $3.68 instead of $8.88 — $5.20 saved per session — and gets more of them right, not fewer (37 correct vs 33). Same model, same questions, same repo; the only change is whether the structural index is there. Reproduce it →
A separate correctness benchmark backs the savings with precision: on 10 standardized structural queries GraphPilot scores F1 0.89 vs grep's 0.42 while reading 99.9 % fewer bytes (721 B vs 528 KB), and the byte-cost win holds at scale — indexing microsoft/TypeScript (601 files, 17 k symbols, 70 k call edges in 10 s) gives sub-millisecond queries and a 99.99 % bytes-read reduction. Full methodology →
GraphPilot ships as a single npm package (@graphpilot-oss/graphpilot) with two runtime modes — most users run both.
| Mode | Command | What it does |
|---|---|---|
| CLI | graphpilot index <path> | Walks your repo, builds the structural graph, writes it to ~/.graphpilot/ |
graphpilot watch <path> | Keeps the graph fresh (~10 ms per file save) | |
graphpilot status <path> | Health probe — when the graph was last refreshed, file/symbol/edge counts | |
| MCP server | graphpilot mcp | Speaks MCP over stdio — your coding agent calls into this to query the graph |
The flow: the CLI builds the index once (and watch keeps it warm). The MCP server is what your coding agent talks to — you never invoke it yourself, you just point your agent's MCP config at graphpilot mcp once and the agent spawns it on every session.
If you only want CLI access to your code graph (no agent), run graphpilot index and then graphpilot stats / inspect graph.json directly. If you only want the agent integration, you still need to run graphpilot index once — the MCP server is read-only against the on-disk graph.
Other code-graph tools treat your repo as a static blob: index once, query forever, no branch awareness, no proof of where an answer came from. GraphPilot is built around three properties none of them ship:
file:line @ sha on every symbol and call site. The agent can quote the anchor verbatim and you can verify it instantly — hallucinations get exposed the moment you jump to the line.since: <commit|branch> to gp_impact and the result is filtered to files your branch actually touches. PR-scoped refactor analysis in one call instead of git diff | xargs grep.git worktree add-ed branches naturally produce two separate indexes — no manual config. Run graphpilot index ./src/feature from a subdir and it transparently re-roots to the worktree top. Opt out with --no-worktree.Add to that: local-first (no telemetry, no remote calls, enforced by an ESLint policy on src/ itself), deterministic (same repo → same graph), sub-second incremental updates via watch mode.
Prerequisites
node --version to check)End-to-end time: ~3 minutes.
1. Install the CLI
npm install -g @graphpilot-oss/graphpilot
# or: pnpm add -g @graphpilot-oss/graphpilot
# or one-shot, no install: npx @graphpilot-oss/graphpilot <command>
Verify it landed on your PATH:
graphpilot --version
# → 0.1.0
If you see command not found: graphpilot, your global npm bin is not on PATH. Run npm config get prefix and add <prefix>/bin to your shell's PATH, or use the npx form above.
2. Build the structural index for your repo
Run this once per project. It walks your source tree, parses each TS/JS file with tree-sitter, extracts symbols + call edges, and writes the graph to ~/.graphpilot/<repo-id>/graph.json.
graphpilot index ~/code/my-app
Expect a one-line summary like indexed 412 files · 3,981 symbols · 7,204 edges · 1.8s.
3. Wire it into your coding agent
GraphPilot speaks MCP over stdio. Add this server entry to your agent's MCP config — every supported client uses the same two-line shape:
{
"mcpServers": {
"graphpilot": {
"command": "graphpilot",
"args": ["mcp"]
}
}
}
Where this file lives depends on the client (~/.cursor/mcp.json, ~/.claude.json, Cline's settings panel, etc.). Pre-made configs with the exact file path for each agent are in examples/ — copy the one for your client.
Restart the agent. It now has four new tools: gp_recall, gp_callers, gp_impact, gp_index — see The four tools below for what each one does and when the agent should reach for it.
4. Try it
Ask your agent a structural question instead of letting it grep:
"Use gp_impact to show me everything that breaks if I rename
parseToken."
You should see a response with file:line @ sha anchors you can click straight to. If the agent doesn't reach for the tool, prompt explicitly: "use the gp_ MCP tools." If it can't see them at all, the MCP config wasn't picked up — restart the agent and re-check the config path for your client in examples/.
5. Keep the index fresh as you edit (optional but recommended)
graphpilot watch ~/code/my-app
Sub-10 ms incremental updates on each file save. Leave it running in a terminal tab.
6. Drop per-editor routing rules into your repo (optional)
graphpilot init
Auto-detects which editors you have installed (Cursor, Claude Code, Cline, Windsurf, Continue) and writes the matching rules file (.cursorrules, CLAUDE.md, etc.) to the current directory. The rules teach the agent to reach for gp_* tools before grep.
graphpilot init --all # write rules for every supported editor
graphpilot init --client cursor # one editor only
graphpilot init --dry-run # preview without writing
Full 5-minute walkthrough with screenshots: docs/quickstart.md.
GraphPilot exposes four MCP tools. Each one answers a structural question your agent would otherwise solve by grepping and reading files.
gp_recall — find a symbol by nameUse this when the agent asks "where is X defined?" or needs to locate a function before reasoning about it.
{ query, limit?, substring?, path? }substring: true for partial matches), each with file:line @ sha.grep -rn "function X" plus reading each hit to find the real definition.Agent: gp_recall({ query: "parseToken" })
→ parseToken (function) — src/auth.ts:42 @ a1b2c3d
export function parseToken(raw: string): Token | null
gp_callers — list callers (or callees)Use this when the agent needs to know "who calls X?" or "what does X call?" — the two fundamental questions of refactoring.
{ symbol, direction?: 'callers' | 'callees', limit?, includeUnresolved?, path? }grep -rn "X(" followed by manual filtering of comments, strings, and renamed shadows.Agent: gp_callers({ symbol: "authenticate", direction: "callers" })
→ login → authenticate — src/routes/login.ts:18 @ a1b2c3d
→ refreshSession → authenticate — src/session.ts:64 @ a1b2c3d
gp_impact — blast radius in one callUse this when the agent asks "what breaks if I rename X?" or "what depends on this?" — the single most expensive question an agent normally solves.
{ symbol, depth? (1–5, default 3), since?, path? }since: 'main' and the result is scoped to files your branch actually touches — PR-scoped refactor review without git diff gymnastics.Agent: gp_impact({ symbol: "extractSymbols", depth: 2, since: "main" })
→ Direct callers (2): indexDirectory, applyUpdate
→ Depth-2 callers (1): cmdIndex
→ Tests affected (3): tests/indexer.test.ts, tests/symbols.test.ts, tests/cli.test.ts
→ Public API: no
gp_index — refresh from inside the agentUse this after the agent (or the user) has made a batch of structural edits and wants the graph to reflect them without dropping to a shell.
{ path? }graphpilot watch for sub-10 ms incremental updates between explicit re-indexes.
Data flow is one-way: source → tree → symbols + edges → JSON → query → agent. GraphPilot never modifies your code.
Full pipeline writeup with file references: docs/architecture.md.
| If the agent is about to… | Reach for… | Why |
|---|---|---|
grep for a function by name | gp_recall | One call, no false positives from comments or strings |
| Read 20 files looking for "who calls X" | gp_callers | Pre-computed reverse index, sub-millisecond |
| Plan a rename or signature change | gp_impact | Direct + transitive + tests + public-API in one call |
| Review a PR's structural blast radius | gp_impact({ since: 'main' }) | Differential — only callers your branch touches |
| Re-grep after editing several files | gp_index | Incremental: lets the next call see your edits |
For string literals, error messages, config values, or anything in a language other than TS/JS: stay with grep. GraphPilot indexes code structure, not text.
GraphPilot speaks MCP over stdio, so it works with any MCP-capable client. Ready-to-paste configs live in examples/:
| Client | Folder |
|---|---|
| Claude Code (Anthropic) | examples/claude-code/ |
| Cursor | examples/cursor/ |
| Cline (VS Code extension) | examples/cline/ |
| Windsurf (Codeium) | examples/windsurf/ |
| Continue.dev | examples/continue/ |
| Any other MCP client | See docs/mcp-setup.md |
Each folder contains: a README.md walkthrough, a sample config file with the exact JSON to paste, and (where the client supports it) a routing template so the agent automatically reaches for GraphPilot on structural questions.
GraphPilot is local-first by promise and by build gate.
src/ has zero http, fetch, axios, or analytics imports — enforced by an ESLint rule plus a meta-test that proves the rule fires on every banned import.child_process, no exec, no spawn. Git facts are read directly from .git/ via pure-JS helpers.~/.graphpilot/.sk-, GitHub ghp_/ghs_, AWS AKIA, JWTs, PEM headers, Slack/Stripe tokens) before they're written to disk.0o700, files 0o600.graph.json falls back to "no index" rather than poisoning the agent.Threat model and per-defence test references live in docs/architecture.md. Report security issues per SECURITY.md.
GraphPilot v0.1 makes deliberate trade-offs to ship small and sharp:
gp_recall is name-only. "Find code similar to this snippet" is deferred until 30+ users ask for it..graphpilotignore yet (defaults skip node_modules, dist, build, .git, coverage, .next, .nuxt, .cache, out, *.d.ts).Full list with mitigations: docs/limitations.md.
Does it send my code anywhere?
No. There is no network code in src/, no telemetry, no update check. An ESLint rule blocks adding any of those at the build gate.
Will it slow down my editor? The MCP server is idle until your agent calls a tool. Tool calls are sub-millisecond after the first lazy load. Watch mode adds ~3–10 ms per file save.
What happens to the graph when I switch branches?
If you use git worktree, you automatically get a separate graph per worktree. On a single working copy that you switch with git checkout, the graph reflects the last gp_index (or watch-mode updates). Run gp_index after a branch switch to refresh.
Do I need to re-index every session?
No. The graph persists at ~/.graphpilot/<repo-id>/graph.json. Re-index after sweeping changes; otherwise, watch mode keeps it fresh incrementally.
Why TypeScript/JavaScript first? That's where the maintainer's pain was, and tree-sitter-typescript covers TS, TSX, JSX, and JS in a single grammar. Python is the next likely addition; vote with a GitHub Discussion.
How does this compare to LSP? LSPs are scoped to one editor and one buffer at a time, and they re-compute on each query. GraphPilot is editor-agnostic, persists across sessions, and answers structural questions (who-calls, blast-radius) that LSPs don't expose uniformly.
docs/quickstart.md — 5-minute walkthroughdocs/mcp-setup.md — per-client config referencedocs/architecture.md — pipeline writeup with file refsdocs/limitations.md — v1 caveats (read this)bench/README.md — benchmark methodology + resultsexamples/ — ready-to-paste configs for every supported clientGraphPilot is small, opinionated, and accepting contributions. Start with CONTRIBUTING.md — especially the "What we are NOT doing in v1" section before you propose a feature.
Found a security issue? Please follow SECURITY.md instead of opening a public issue.
Apache-2.0. Copyright 2026 Akshay Sharma — codewithakki@gmail.com
io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage
io.github.mikerawsonnz/llm-orchestration-agent
io.github.mikerawsonnz/authenticated-llm-agent
labforgedev/copilot-memory-mcp
csoai-org/agent-prompt-injection-firewall-mcp
io.github.mikerawsonnz/authenticated-multi-llm-agent