преди 1 месец · f5bbc26c60
--- a/.cursor/rules/codegraph.mdc
+++ b/.cursor/rules/codegraph.mdc
@@ -19,16 +19,17 @@ Use codegraph for **structural** questions — what calls what, what would break
 
															 | "What would break if I changed Z?" | `codegraph_impact` |
														
 
															 | "Show me Y's signature / source / docstring" | `codegraph_node` |
														
 
															 | "Give me focused context for a task/area" | `codegraph_context` |
														
 
															-| "Survey an unfamiliar module/topic" | `codegraph_explore` |
														
 
															+| "See several related symbols' source at once" | `codegraph_explore` |
														
 
															 | "What files exist under path/" | `codegraph_files` |
														
 
															 | "Is the index healthy?" | `codegraph_status` |
														
 
															 ### Rules of thumb
														
 
															+- **Answer directly — don't delegate exploration.** For "how does X work" / architecture / trace questions, answer with 2-3 codegraph calls: `codegraph_context` first, then ONE `codegraph_explore` for the source of the symbols it surfaces. Codegraph IS the pre-built index, so spawning a separate file-reading sub-task/agent — or running a grep + read loop — repeats work codegraph already did and costs more for the same answer.
														
 
															 - **Trust codegraph results.** They come from a full AST parse. Do NOT re-verify them with grep — that's slower, less accurate, and wastes context.
														
 
															 - **Don't grep first** when looking up a symbol by name. `codegraph_search` is faster and returns kind + location + signature in one call.
														
 
															 - **Don't chain `codegraph_search` + `codegraph_node`** when you just want context — `codegraph_context` is one call.
														
 
															-- **`codegraph_explore` is the heavy hitter** for unfamiliar areas — it returns full source from all relevant files in one call, but is token-heavy. If your harness supports parallel subagents (e.g., Claude Code's Task tool), spawn one for explore-class questions to keep main session context clean.
														
 
															+- **Don't loop `codegraph_node` over many symbols** — one `codegraph_explore` call returns several symbols' source grouped in a single capped call, while each separate node/Read call re-reads the whole context and costs far more.
														
 
															 - **Index lag**: the file watcher debounces ~500ms behind writes; don't re-query immediately after editing a file in the same turn.
														
 
															 ### If `.codegraph/` doesn't exist
														
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -33,6 +33,25 @@ and adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
 
															   setup is actually fast. `codegraph uninit` removes any hooks it installed.
														
 
															 ### Changed
														
 
															+- **MCP / agent guidance**: CodeGraph now tells agents to answer "how does X
														
 
															+  work" / architecture questions *directly* — `codegraph_context`, then one
														
 
															+  `codegraph_explore` for the surfaced symbols — instead of delegating to a
														
 
															+  file-reading sub-agent or a grep+read loop. The server instructions and the
														
 
															+  installed instruction files (`CLAUDE.md`, `.cursor/rules/codegraph.mdc`,
														
 
															+  `AGENTS.md`) previously suggested *spawning a sub-agent* for explore-class
														
 
															+  questions, which produced the opposite, more expensive behavior: the
														
 
															+  sub-agent reads files regardless of the index, so CodeGraph became overhead
														
 
															+  stacked on top of the reads. In rigorous N≥4-per-arm benchmarks this cut the
														
 
															+  cost of an architecture question by ~42–47% versus a no-CodeGraph agent on
														
 
															+  medium and large repos (Excalidraw ~600 files, VS Code ~10k), with
														
 
															+  equal-or-better, `file:line`-cited answers and ~6× fewer tool calls; on a
														
 
															+  tiny repo (~25 files) it's a wash, since native grep is already trivially
														
 
															+  cheap there.
														
 
															+- **MCP / codegraph_node**: `includeCode=true` on a class/interface/struct/enum
														
 
															+  now returns a compact member outline (fields + method signatures + line
														
 
															+  numbers) instead of the entire class body — which could be thousands of
														
 
															+  characters and was rarely needed in full. Functions and methods still return
														
 
															+  their full body; request a specific member for its source.
														
 
															 - **Minimum Node.js is now 20** (was 18). Node 18 is end-of-life and the
														
 
															   native SQLite binding (`better-sqlite3` 12.x) no longer ships a Node 18
														
 
															   prebuilt binary. Node 22 LTS and Node 24 get the native backend out of the
														
@@ -48,7 +67,7 @@ and adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
 
															   now scales with indexed file count: small projects (<500 files) cap at
														
 
															   ~18KB and skip the "Additional relevant files" / completeness / explore-
														
 
															   budget reminders that earn their keep on bigger codebases; medium
														
 
															-  (<5,000) caps at ~28KB; large (<15,000) keeps the historical ~35KB; very
														
 
															+  (<5,000) caps at ~13KB; large (<15,000) keeps the historical ~35KB; very
														
 
															   large goes up to ~38KB. A new per-file char cap also prevents a single
														
 
															   file with many adjacent symbols from collapsing into one whole-file dump
														
 
															   (the Alamofire `Session.swift` case from #185). Per-file cluster
														
@@ -63,6 +82,11 @@ and adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
 
															   Thanks to [@essopsp](https://github.com/essopsp) for the repro.
														
 
															 ### Fixed
														
 
															+- **MCP / explore**: `codegraph_explore` output is now hard-capped to its
														
 
															+  adaptive size budget. It could previously overrun (e.g. ~30K against a 28K
														
 
															+  cap) once the relationship map and trailer sections were appended; the
														
 
															+  oversized payload then sat in the agent's context and was re-read on every
														
 
															+  later turn.
														
 
															 - **Sync / status**: git-untracked files are no longer reported as pending
														
 
															   "Added" forever. After `codegraph sync` indexed a newly-created untracked
														
 
															   source file, `codegraph status` kept listing it under Pending Changes and
														
--- a/README.md
+++ b/README.md
@@ -4,7 +4,7 @@
 
															 ### Supercharge Claude Code, Cursor, Codex, and OpenCode with Semantic Code Intelligence
														
 
															-**94% fewer tool calls · 77% faster exploration · 100% local**
														
 
															+**~35% cheaper · ~70% fewer tool calls · 100% local**
														
 
															 [![npm version](https://img.shields.io/npm/v/@colbymchenry/codegraph.svg)](https://www.npmjs.com/package/@colbymchenry/codegraph)
														
 
															 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
														
@@ -50,61 +50,50 @@ When Claude Code explores a codebase, it spawns **Explore agents** that scan fil
 
															 ### Benchmark Results
														
 
															-Tested across 6 real-world codebases comparing Claude Code's Explore agent **with** and **without** CodeGraph:
														
 
															+Tested across **7 real-world open-source codebases** spanning 7 languages, comparing an agent (Claude Code, headless) answering one architecture question **with** and **without** CodeGraph. Each cell is the savings at the **median of 4 runs per arm**.
														
 
															-> **Average: 92% fewer tool calls · 71% faster**
														
 
															+> **Average: 35% cheaper · 59% fewer tokens · 49% faster · 70% fewer tool calls**
														
 
															-| Codebase | With CG | Without CG | Improvement |
														
 
															-|----------|---------|------------|-------------|
														
 
															-| **VS Code** · TypeScript | 3 calls, 17s | 52 calls, 1m 37s | **94% fewer · 82% faster** |
														
 
															-| **Excalidraw** · TypeScript | 3 calls, 29s | 47 calls, 1m 45s | **94% fewer · 72% faster** |
														
 
															-| **Claude Code** · Python + Rust | 3 calls, 39s | 40 calls, 1m 8s | **93% fewer · 43% faster** |
														
 
															-| **Claude Code** · Java | 1 call, 19s | 26 calls, 1m 22s | **96% fewer · 77% faster** |
														
 
															-| **Alamofire** · Swift | 3 calls, 22s | 32 calls, 1m 39s | **91% fewer · 78% faster** |
														
 
															-| **Swift Compiler** · Swift/C++ | 6 calls, 35s | 37 calls, 2m 8s | **84% fewer · 73% faster** |
														
 
															+| Codebase | Language | Cost | Tokens | Time | Tool calls |
														
 
															+|----------|----------|------|--------|------|------------|
														
 
															+| **VS Code** | TypeScript · ~10k files | 35% cheaper | 73% fewer | 41% faster | 72% fewer |
														
 
															+| **Excalidraw** | TypeScript · ~600 | 47% cheaper | 73% fewer | 60% faster | 86% fewer |
														
 
															+| **Django** | Python · ~2.7k | 34% cheaper | 64% fewer | 59% faster | 81% fewer |
														
 
															+| **Tokio** | Rust · ~700 | 52% cheaper | 81% fewer | 63% faster | 89% fewer |
														
 
															+| **OkHttp** | Java · ~640 | 17% cheaper | 41% fewer | 36% faster | 64% fewer |
														
 
															+| **Gin** | Go · ~150 | 22% cheaper | 23% fewer | 34% faster | 19% fewer |
														
 
															+| **Alamofire** | Swift · ~100 | 38% cheaper | 59% fewer | 51% faster | 77% fewer |
														
 
															+
														
 
															+The gains scale with codebase size: on large repos the agent answers from the index in a handful of calls with **zero file reads**, while the no-CodeGraph agent fans out across grep/find/Read (and the sub-agents it spawns). On a small repo like Gin (~150 files) native search is already cheap, so the margin narrows.
														
 
															 <details>
														
 
															 <summary><strong>Full benchmark details</strong></summary>
														
 
															-All tests used Claude Opus 4.6 (1M context) with Claude Code v2.1.91. Each test spawned a single Explore agent with the same question.
														
 
															+**Methodology.** Each arm is `claude -p` (Claude Opus 4.7, Claude Code v2.1.145) run headlessly against the repo with `--strict-mcp-config`: **WITH** = CodeGraph's MCP server enabled, **WITHOUT** = an empty MCP config. Built-in Read/Grep/Bash stay available to both. Same question per repo, **4 runs per arm, median reported**. Cost = the run's `total_cost_usd`; Tokens = total tokens processed (input incl. cached + output); Time = wall-clock; Tool calls = every tool invocation, including those inside any sub-agents the model spawns. Repos cloned at `--depth 1` and indexed by the same CodeGraph build that served them.
														
 
															-**Queries used:**
														
 
															+**Queries:**
														
 
															 | Codebase | Query |
														
 
															 |----------|-------|
														
 
															 | VS Code | "How does the extension host communicate with the main process?" |
														
 
															-| Excalidraw | "How does collaborative editing and real-time sync work?" |
														
 
															-| Claude Code (Python+Rust) | "How does tool execution work end to end?" |
														
 
															-| Claude Code (Java) | "How does tool execution work end to end?" |
														
 
															-| Alamofire | "Trace how a request flows from Session.request() through to the URLSession layer" |
														
 
															-| Swift Compiler | "How does the Swift compiler handle error diagnostics?" |
														
 
															-
														
 
															-**With CodeGraph — the agent uses `codegraph_explore` and stops:**
														
 
															-| Codebase | Files Indexed | Nodes | Tool Uses | Tokens | Time | File Reads |
														
 
															-|----------|--------------|-------|-----------|--------|------|------------|
														
 
															-| VS Code (TypeScript) | 4,002 | 59,377 | 3 | 56.6k | 17s | 0 |
														
 
															-| Excalidraw (TypeScript) | 626 | 9,859 | 3 | 57.1k | 29s | 0 |
														
 
															-| Claude Code (Python+Rust) | 115 | 3,080 | 3 | 67.1k | 39s | 0 |
														
 
															-| Claude Code (Java) | — | — | 1 | 40.8k | 19s | 0 |
														
 
															-| Alamofire (Swift) | 102 | 2,624 | 3 | 57.3k | 22s | 0 |
														
 
															-| Swift Compiler (Swift/C++) | 25,874 | 272,898 | 6 | 77.4k | 35s | 0 |
														
 
															-
														
 
															-**Without CodeGraph — the agent uses grep, find, ls, and Read extensively:**
														
 
															-| Codebase | Tool Uses | Tokens | Time | File Reads |
														
 
															-|----------|-----------|--------|------|------------|
														
 
															-| VS Code (TypeScript) | 52 | 89.4k | 1m 37s | ~15 |
														
 
															-| Excalidraw (TypeScript) | 47 | 77.9k | 1m 45s | ~20 |
														
 
															-| Claude Code (Python+Rust) | 40 | 69.3k | 1m 8s | ~15 |
														
 
															-| Claude Code (Java) | 26 | 73.3k | 1m 22s | ~15 |
														
 
															-| Alamofire (Swift) | 32 | 52.4k | 1m 39s | ~10 |
														
 
															-| Swift Compiler (Swift/C++) | 37 | 99.1k | 2m 8s | ~20 |
														
 
															-
														
 
															-**Key observations:**
														
 
															-- With CodeGraph, the agent **never fell back to reading files** — it trusted the codegraph_explore results completely
														
 
															-- Without CodeGraph, agents spent most of their time on discovery (find, ls, grep) before they could even start reading relevant code
														
 
															-- The Java codebase needed only **1 codegraph_explore call** to answer the entire question
														
 
															-- Cross-language queries (Python+Rust) worked seamlessly — CodeGraph's graph traversal found connections across language boundaries
														
 
															-- The Swift benchmark (Alamofire) traced a **9-step call chain** from `Session.request()` to `URLSession.dataTask()` — CodeGraph's graph traversal at depth 3 captured the full chain in one explore call
														
 
															-- The **Swift Compiler** benchmark is the largest codebase tested (**25,874 files, 272,898 nodes**) — CodeGraph indexed it in under 4 minutes and the agent answered a complex cross-cutting question with **6 explore calls and zero file reads** in 35 seconds
														
 
															+| Excalidraw | "How does Excalidraw render and update canvas elements?" |
														
 
															+| Django | "How does Django's ORM build and execute a query from a QuerySet?" |
														
 
															+| Tokio | "How does tokio schedule and run async tasks on its runtime?" |
														
 
															+| OkHttp | "How does OkHttp process a request through its interceptor chain?" |
														
 
															+| Gin | "How does gin route requests through its middleware chain?" |
														
 
															+| Alamofire | "How does Alamofire build, send, and validate a request?" |
														
 
															+
														
 
															+**Raw medians — WITH → WITHOUT:**
														
 
															+| Codebase | Cost | Tokens | Time | Tool calls |
														
 
															+|----------|------|--------|------|------------|
														
 
															+| VS Code | $0.42 → $0.64 | 393k → 1.4M | 1m 0s → 1m 43s | 7 → 23 |
														
 
															+| Excalidraw | $0.54 → $1.02 | 851k → 3.2M | 1m 17s → 3m 14s | 12 → 83 |
														
 
															+| Django | $0.41 → $0.62 | 499k → 1.4M | 1m 0s → 2m 25s | 9 → 48 |
														
 
															+| Tokio | $0.50 → $1.04 | 657k → 3.4M | 1m 5s → 2m 56s | 9 → 75 |
														
 
															+| OkHttp | $0.36 → $0.44 | 352k → 596k | 45s → 1m 11s | 5 → 14 |
														
 
															+| Gin | $0.36 → $0.46 | 431k → 562k | 47s → 1m 11s | 7 → 8 |
														
 
															+| Alamofire | $0.61 → $0.99 | 1.1M → 2.6M | 1m 19s → 2m 41s | 15 → 64 |
														
 
															+
														
 
															+**Why CodeGraph wins:** with the index available, the agent answers directly — `codegraph_context` to map the area, then one `codegraph_explore` for the relevant source — and stops, usually with zero file reads. Without it, the agent (and the Explore sub-agents it spawns) spends most of its budget on discovery (find/ls/grep) before reading the right code. CodeGraph only helps when queried *directly*, so its instructions steer agents to answer directly rather than delegate exploration to file-reading sub-agents — otherwise a sub-agent reads files regardless and CodeGraph becomes overhead.
														
 
															 </details>
														
--- a/src/installer/instructions-template.ts
+++ b/src/installer/instructions-template.ts
@@ -37,16 +37,17 @@ Use codegraph for **structural** questions — what calls what, what would break
 
															 | "What would break if I changed Z?" | \`codegraph_impact\` |
														
 
															 | "Show me Y's signature / source / docstring" | \`codegraph_node\` |
														
 
															 | "Give me focused context for a task/area" | \`codegraph_context\` |
														
 
															-| "Survey an unfamiliar module/topic" | \`codegraph_explore\` |
														
 
															+| "See several related symbols' source at once" | \`codegraph_explore\` |
														
 
															 | "What files exist under path/" | \`codegraph_files\` |
														
 
															 | "Is the index healthy?" | \`codegraph_status\` |
														
 
															 ### Rules of thumb
														
 
															+- **Answer directly — don't delegate exploration.** For "how does X work" / architecture / trace questions, answer with 2-3 codegraph calls: \`codegraph_context\` first, then ONE \`codegraph_explore\` for the source of the symbols it surfaces. Codegraph IS the pre-built index, so spawning a separate file-reading sub-task/agent — or running a grep + read loop — repeats work codegraph already did and costs more for the same answer.
														
 
															 - **Trust codegraph results.** They come from a full AST parse. Do NOT re-verify them with grep — that's slower, less accurate, and wastes context.
														
 
															 - **Don't grep first** when looking up a symbol by name. \`codegraph_search\` is faster and returns kind + location + signature in one call.
														
 
															 - **Don't chain \`codegraph_search\` + \`codegraph_node\`** when you just want context — \`codegraph_context\` is one call.
														
 
															-- **\`codegraph_explore\` is the heavy hitter** for unfamiliar areas — it returns full source from all relevant files in one call, but is token-heavy. If your harness supports parallel subagents (e.g., Claude Code's Task tool), spawn one for explore-class questions to keep main session context clean.
														
 
															+- **Don't loop \`codegraph_node\` over many symbols** — one \`codegraph_explore\` call returns several symbols' source grouped in a single capped call, while each separate node/Read call re-reads the whole context and costs far more.
														
 
															 - **Index lag**: the file watcher debounces ~500ms behind writes; don't re-query immediately after editing a file in the same turn.
														
 
															 ### If \`.codegraph/\` doesn't exist
														
--- a/src/mcp/server-instructions.ts
+++ b/src/mcp/server-instructions.ts
@@ -22,6 +22,18 @@ in the workspace. Reads are sub-millisecond; the index lags writes by
 
															 about a second through the file watcher. Consult it BEFORE writing or
														
 
															 editing code, not during.
														
 
															+## Answer directly — don't delegate exploration
														
 
															+
														
 
															+For "how does X work", architecture, trace, or where-is-X questions,
														
 
															+answer DIRECTLY using 2-3 codegraph calls: \`codegraph_context\` first,
														
 
															+then ONE \`codegraph_explore\` for the source of the symbols it surfaces.
														
 
															+Codegraph IS the pre-built search index — so delegating the lookup to a
														
 
															+separate file-reading sub-task/agent, or running your own grep + read
														
 
															+loop, repeats work codegraph already did and costs more for the same
														
 
															+answer. Reach for raw Read/Grep only to confirm a specific detail
														
 
															+codegraph didn't cover. A direct codegraph answer is typically a handful
														
 
															+of calls; a grep/read exploration is dozens.
														
 
															+
														
 
															 ## Tool selection by intent
														
 
															 - **"What is the symbol named X?"** → \`codegraph_search\`
														
@@ -30,7 +42,7 @@ editing code, not during.
 
															 - **"What does this call?"** → \`codegraph_callees\`
														
 
															 - **"What would changing this break?"** → \`codegraph_impact\`
														
 
															 - **"Show me this symbol's source / signature / docstring."** → \`codegraph_node\`
														
 
															-- **"Survey an unfamiliar topic / pattern / module."** → \`codegraph_explore\` (heavier; deep dive)
														
 
															+- **"Show me several related symbols' source / survey an area."** → \`codegraph_explore\` (ONE capped call; prefer over many codegraph_node/Read)
														
 
															 - **"What's in directory X?"** → \`codegraph_files\`
														
 
															 - **"Is the index ready / what's its size?"** → \`codegraph_status\`
														
@@ -44,7 +56,7 @@ editing code, not during.
 
															 - **Don't grep first** when looking up a symbol by name — \`codegraph_search\` is faster and returns kind + location + signature.
														
 
															 - **Don't chain \`codegraph_search\` + \`codegraph_node\`** when you just want context — \`codegraph_context\` is one round-trip.
														
 
															-- **Don't use \`codegraph_explore\` for narrow questions** — it's a multi-call deep dive, expensive in tokens. Save it for genuine "I'm new here" surveys.
														
 
															+- **Don't loop \`codegraph_node\` over many symbols** — one \`codegraph_explore\` call returns them all grouped by file, while each separate call re-reads the whole context and costs far more. Use \`codegraph_node\` for a single symbol.
														
 
															 - **Don't query the index immediately after editing a file** — the watcher needs ~500ms to debounce + sync. Wait for the next turn.
														
 
															 ## Limitations
														
--- a/src/mcp/tools.ts
+++ b/src/mcp/tools.ts
@@ -25,6 +25,16 @@ const MAX_OUTPUT_LENGTH = 15000;
 
															  */
														
 
															 const RUST_PATH_PREFIXES = new Set(['crate', 'super', 'self']);
														
 
															+/**
														
 
															+ * Node kinds that contain other symbols. For these, `codegraph_node` with
														
 
															+ * `includeCode=true` returns a structural outline (member names + signatures
														
 
															+ * + line numbers) instead of the full body, which for a large class is a
														
 
															+ * multi-thousand-character wall of source that bloats the agent's context.
														
 
															+ */
														
 
															+const CONTAINER_NODE_KINDS = new Set<NodeKind>([
														
 
															+  'class', 'struct', 'interface', 'trait', 'protocol', 'enum', 'namespace', 'module',
														
 
															+]);
														
 
															+
														
 
															 /** Last `::` / `.` / `/`-separated segment of a qualified symbol. */
														
 
															 function lastQualifierPart(symbol: string): string {
														
 
															   const parts = symbol.split(/::|[./]/).filter((p) => p.length > 0);
														
@@ -102,12 +112,12 @@ export function getExploreOutputBudget(fileCount: number): ExploreOutputBudget {
 
															   }
														
 
															   if (fileCount < 5000) {
														
 
															     return {
														
 
															-      maxOutputChars: 28000,
														
 
															-      defaultMaxFiles: 9,
														
 
															-      maxCharsPerFile: 5000,
														
 
															-      gapThreshold: 12,
														
 
															-      maxSymbolsInFileHeader: 10,
														
 
															-      maxEdgesPerRelationshipKind: 10,
														
 
															+      maxOutputChars: 13000,
														
 
															+      defaultMaxFiles: 6,
														
 
															+      maxCharsPerFile: 2500,
														
 
															+      gapThreshold: 10,
														
 
															+      maxSymbolsInFileHeader: 8,
														
 
															+      maxEdgesPerRelationshipKind: 8,
														
 
															       includeRelationships: true,
														
 
															       includeAdditionalFiles: true,
														
 
															       includeCompletenessSignal: true,
														
@@ -263,7 +273,7 @@ export const tools: ToolDefinition[] = [
 
															   },
														
 
															   {
														
 
															     name: 'codegraph_context',
														
 
															-    description: 'PRIMARY TOOL: Build comprehensive context for a task. Returns entry points, related symbols, and key code - often enough to understand the codebase without additional tool calls. NOTE: This provides CODE context, not product requirements. For new features, still clarify UX/behavior questions with the user before implementing.',
														
 
															+    description: 'PRIMARY TOOL — call this FIRST for any "how does X work", architecture, feature, or bug-context question. Composes search + node + callers + callees and returns entry points, related symbols, and key code in ONE call — usually enough to answer with no further search/Read/Grep. Prefer this over chaining codegraph_search + codegraph_node, and over codegraph_explore. NOTE: provides CODE context, not product requirements; for new features still clarify UX/edge cases with the user.',
														
 
															     inputSchema: {
														
 
															       type: 'object',
														
 
															       properties: {
														
@@ -348,7 +358,7 @@ export const tools: ToolDefinition[] = [
 
															   },
														
 
															   {
														
 
															     name: 'codegraph_node',
														
 
															-    description: 'Get detailed information about a specific code symbol. Use includeCode=true only when you need the full source code - otherwise just get location and signature to minimize context usage.',
														
 
															+    description: 'Get detailed info about ONE symbol (location, signature, docstring). Pass includeCode=true for source: a function/method returns its body; a class/interface/struct/enum returns a compact member OUTLINE (fields + method signatures + line numbers), not every method body — Read or codegraph_node a specific member for its body. Keep includeCode=false to minimize context. For SEVERAL related symbols, make ONE codegraph_explore (or codegraph_context) call instead of many node calls — repeated node calls each re-read the whole context and cost far more.',
														
 
															     inputSchema: {
														
 
															       type: 'object',
														
 
															       properties: {
														
@@ -368,7 +378,7 @@ export const tools: ToolDefinition[] = [
 
															   },
														
 
															   {
														
 
															     name: 'codegraph_explore',
														
 
															-    description: 'Deep exploration tool — returns comprehensive context for a topic in a SINGLE call. Groups all relevant source code by file (contiguous sections, not snippets), includes a relationship map, and uses deeper graph traversal. Designed to replace multiple codegraph_node + file Read calls. Use this instead of codegraph_context when you need thorough understanding. IMPORTANT: Use specific symbol names, file names, or short code terms in your query — NOT natural language sentences. Before calling this, use codegraph_search to discover relevant symbol names, then include those names in your query. Bad: "how are agent prompts loaded and passed to the CLI". Good: "readAgentsFromDirectory createClaudeSession chat-manager agents.ts".',
														
 
															+    description: 'Returns source for SEVERAL related symbols grouped by file, plus a relationship map, in ONE capped call. This is the efficient way to inspect many related symbols at once — strongly prefer it over a series of codegraph_node or Read calls (each separate call re-reads the whole context, so 8 node calls cost far more than 1 explore). Use it after codegraph_context when you need to see the actual source of several symbols. Query with specific symbol/file/code terms, NOT natural-language sentences — run codegraph_search first to find names. Bad: "how are agent prompts loaded and passed to the CLI". Good: "renderStaticScene drawElementOnCanvas ShapeCache renderElement.ts".',
														
 
															     inputSchema: {
														
 
															       type: 'object',
														
 
															       properties: {
														
@@ -1241,7 +1251,20 @@ export class ToolHandler {
 
															       }
														
 
															     }
														
 
															-    return this.textResult(lines.join('\n'));
														
 
															+    // Hard-cap to the adaptive budget. The per-file loop bounds the source
														
 
															+    // sections, but the relationship map, additional-files list, and
														
 
															+    // completeness/budget notes can still push the assembled output past
														
 
															+    // maxOutputChars (observed 30k against a 28k tier cap). A fat explore
														
 
															+    // payload persists in the agent's context and is re-read as cache-input
														
 
															+    // on every subsequent turn, so the overrun is paid many times over.
														
 
															+    const output = lines.join('\n');
														
 
															+    if (output.length > budget.maxOutputChars) {
														
 
															+      const cut = output.slice(0, budget.maxOutputChars);
														
 
															+      const lastNewline = cut.lastIndexOf('\n');
														
 
															+      const safe = lastNewline > budget.maxOutputChars * 0.8 ? cut.slice(0, lastNewline) : cut;
														
 
															+      return this.textResult(safe + '\n\n... (explore output truncated to budget — use codegraph_node or Read for more)');
														
 
															+    }
														
 
															+    return this.textResult(output);
														
 
															   }
														
 
															   /**
														
@@ -1261,12 +1284,24 @@ export class ToolHandler {
 
															     }
														
 
															     let code: string | null = null;
														
 
															+    let outline: string | null = null;
														
 
															     if (includeCode) {
														
 
															-      code = await cg.getCode(match.node.id);
														
 
															+      // For container symbols (class/interface/struct/…), the full body is the
														
 
															+      // sum of every method body — a wall of source (e.g. a 10k-char class)
														
 
															+      // that bloats context and is rarely needed in full. Return a structural
														
 
															+      // outline (members + signatures + line numbers) instead; the agent can
														
 
															+      // Read or codegraph_node a specific method for its body. Leaf symbols
														
 
															+      // (function/method/etc.) return their full body as before.
														
 
															+      if (CONTAINER_NODE_KINDS.has(match.node.kind)) {
														
 
															+        outline = this.buildContainerOutline(cg, match.node);
														
 
															+      }
														
 
															+      if (!outline) {
														
 
															+        code = await cg.getCode(match.node.id);
														
 
															+      }
														
 
															     }
														
 
															-    const formatted = this.formatNodeDetails(match.node, code) + match.note;
														
 
															+    const formatted = this.formatNodeDetails(match.node, code, outline) + match.note;
														
 
															     return this.textResult(this.truncateOutput(formatted));
														
 
															   }
														
@@ -1716,7 +1751,29 @@ export class ToolHandler {
 
															     return lines.join('\n');
														
 
															   }
														
 
															-  private formatNodeDetails(node: Node, code: string | null): string {
														
 
															+  /**
														
 
															+   * Build a compact structural outline of a container symbol from its
														
 
															+   * indexed children (methods, fields, properties, …) — name, kind,
														
 
															+   * line number, and signature — so the agent gets the shape of a class
														
 
															+   * without the full source of every method. Returns '' when the container
														
 
															+   * has no indexed children, so the caller can fall back to full source.
														
 
															+   */
														
 
															+  private buildContainerOutline(cg: CodeGraph, node: Node): string {
														
 
															+    const children = cg.getChildren(node.id)
														
 
															+      .filter(c => c.kind !== 'import' && c.kind !== 'export')
														
 
															+      .sort((a, b) => (a.startLine ?? 0) - (b.startLine ?? 0));
														
 
															+    if (children.length === 0) return '';
														
 
															+
														
 
															+    const lines = [`**Members (${children.length}):**`, ''];
														
 
															+    for (const c of children) {
														
 
															+      const loc = c.startLine ? `:${c.startLine}` : '';
														
 
															+      const sig = c.signature ? ` — \`${c.signature}\`` : '';
														
 
															+      lines.push(`- ${c.name} (${c.kind})${loc}${sig}`);
														
 
															+    }
														
 
															+    return lines.join('\n');
														
 
															+  }
														
 
															+
														
 
															+  private formatNodeDetails(node: Node, code: string | null, outline?: string | null): string {
														
 
															     const location = node.startLine ? `:${node.startLine}` : '';
														
 
															     const lines: string[] = [
														
 
															       `## ${node.name} (${node.kind})`,
														
@@ -1733,7 +1790,10 @@ export class ToolHandler {
 
															       lines.push('', node.docstring);
														
 
															     }
														
 
															-    if (code) {
														
 
															+    if (outline) {
														
 
															+      lines.push('', outline, '',
														
 
															+        `> Structural outline only. Read \`${node.filePath}\` or call codegraph_node on a specific member for its body.`);
														
 
															+    } else if (code) {
														
 
															       lines.push('', '```' + node.language, code, '```');
														
 
															     }