|
|
@@ -7,13 +7,15 @@
|
|
|
* before it sees individual tool descriptions.
|
|
|
*
|
|
|
* Goals when editing this:
|
|
|
- * - Tool selection by intent (which tool for which question)
|
|
|
- * - Common chains (refactor planning = X then Y)
|
|
|
- * - Anti-patterns (don't grep when codegraph_search is faster)
|
|
|
+ * - Lead the agent to codegraph_explore for any structural/flow question
|
|
|
+ * - Reinforce "explore instead of Read/Grep" for indexed code
|
|
|
+ * - Anti-patterns (don't re-verify with grep; don't hand-reconstruct flows)
|
|
|
*
|
|
|
* Keep it tight. The agent reads this every session — long instructions
|
|
|
- * burn tokens. Reference only tools that exist on `main`; gate any
|
|
|
- * conditional tools behind feature checks if/when they ship.
|
|
|
+ * burn tokens. The DEFAULT MCP surface is `codegraph_explore` ALONE (see
|
|
|
+ * DEFAULT_MCP_TOOLS in tools.ts) — reference only that tool here. The other
|
|
|
+ * tools (node/search/callers/…) stay defined and are re-enablable via
|
|
|
+ * CODEGRAPH_MCP_TOOLS, but they are NOT listed to agents, so don't name them.
|
|
|
*/
|
|
|
export const SERVER_INSTRUCTIONS = `# Codegraph — code intelligence over an indexed knowledge graph
|
|
|
|
|
|
@@ -27,45 +29,36 @@ verbatim source PLUS who calls it and what it affects, so you edit with the
|
|
|
blast radius in view. More accurate context, in far fewer tokens and
|
|
|
round-trips than reading files yourself.
|
|
|
|
|
|
-## Use codegraph instead of reading files — for questions AND edits
|
|
|
+## One tool: codegraph_explore — use it instead of reading files
|
|
|
|
|
|
-Whether you're answering "how does X work" or implementing a change (fixing
|
|
|
-a bug, adding a feature), reach for codegraph before you Read. For
|
|
|
-understanding, answer DIRECTLY — usually with ONE \`codegraph_explore\` call.
|
|
|
-\`codegraph_explore\` takes either a natural-language question or a bag of
|
|
|
-symbol/file names and returns the verbatim source of the relevant symbols
|
|
|
-grouped by file, so it is Read-equivalent and most often the ONLY
|
|
|
-codegraph call you need. Codegraph IS the pre-built search index — so
|
|
|
-delegating the lookup to a separate file-reading sub-task/agent, or
|
|
|
-running your own grep + read loop, repeats work codegraph already did and
|
|
|
-costs more for the same answer. Reach for raw Read/Grep only to confirm a
|
|
|
-specific detail codegraph didn't cover. A direct codegraph answer is
|
|
|
-typically one to a few calls; a grep/read exploration is dozens.
|
|
|
+There is a single tool, \`codegraph_explore\`, and it is Read-equivalent. It
|
|
|
+takes either a natural-language question or a bag of symbol/file names and
|
|
|
+returns the **verbatim, line-numbered source** of the relevant symbols
|
|
|
+grouped by file — the same \`<n>\\t<line>\` shape \`Read\` gives you, safe to
|
|
|
+\`Edit\` from — PLUS the call path among them (including dynamic-dispatch hops
|
|
|
+like callbacks, React re-render, and JSX children that grep can't follow) and
|
|
|
+a blast-radius summary of what depends on them.
|
|
|
|
|
|
-## Tool selection by intent
|
|
|
+Whether you're answering "how does X work" or implementing a change (fixing a
|
|
|
+bug, adding a feature), call \`codegraph_explore\` before you Read. ONE call
|
|
|
+usually answers the whole question. Codegraph IS the pre-built search index —
|
|
|
+so running your own grep + read loop, or delegating the lookup to a separate
|
|
|
+file-reading sub-task/agent, repeats work codegraph already did and costs more
|
|
|
+for the same answer. A direct codegraph answer is typically one to a few
|
|
|
+calls; a grep/read exploration is dozens.
|
|
|
|
|
|
-- **Almost any question — "how does X work", architecture, a bug, "what/where is X", or surveying an area** → \`codegraph_explore\` (PRIMARY — call FIRST; ONE capped call returns the verbatim source of the relevant symbols grouped by file; most often the ONLY call you need)
|
|
|
-- **"How does X reach/become Y? / the flow / the path from X to Y"** → \`codegraph_explore\`, naming the symbols that span the flow (e.g. \`mutateElement renderScene\`) — it surfaces the call path among them, including dynamic-dispatch hops (callbacks, React re-render, JSX children) grep can't follow
|
|
|
-- **"What is the symbol named X?" (just its location)** → \`codegraph_search\`
|
|
|
-- **"What calls this?" / "What would changing this break?"** → \`codegraph_callers\` — EVERY call site with file:line, including where a function is **registered as a callback** (passed as an argument, assigned to a function pointer/field, listed in a handler table) — labeled "via callback registration" — so a function with no direct calls is NOT dead if it's wired up somewhere. When several UNRELATED symbols share a name (one \`UserService\` per monorepo app), it reports **one section per definition** (never a merged list) — pass \`file\` to focus the definition you mean. The wider blast radius arrives automatically on \`codegraph_explore\` (its "Blast radius" section) and \`codegraph_node\` (the dependents note)
|
|
|
-- **"What does this call?"** → \`codegraph_node\` with that symbol and \`includeCode: true\` — the body IS the callee list, and the caller/callee trail comes with it
|
|
|
-- **Reading a source FILE (any time you'd use the \`Read\` tool)** → \`codegraph_node\` with a \`file\` path and no \`symbol\`. It returns the file's **current source with line numbers — the same \`<n>\\t<line>\` shape \`Read\` gives you, safe to \`Edit\` from** — narrowable with \`offset\`/\`limit\` exactly like \`Read\`, PLUS a one-line note of which files depend on it. Same bytes as \`Read\`, faster (served from the index), with the blast radius attached. Use it **instead of \`Read\`** for indexed source files; fall back to \`Read\` only for what codegraph doesn't index (configs, docs). Pass \`symbolsOnly: true\` for just the file's structure.
|
|
|
-- **About to read or edit a symbol you can name** → \`codegraph_node\` with that \`symbol\` (SECONDARY — the after-explore depth tool): the verbatim source (\`includeCode: true\`) PLUS its caller/callee trail, so before changing it you see what calls it and what your edit would break. For an OVERLOADED name it returns EVERY matching definition's body in one call, so you never Read a file to find the right overload
|
|
|
+## How to query
|
|
|
|
|
|
-## Common chains
|
|
|
-
|
|
|
-- **Flow / "how does X reach Y"**: ONE \`codegraph_explore\` with the symbol names spanning the flow — it surfaces the call path among them (riding dynamic-dispatch hops) AND returns their source. No need to reconstruct the path with \`codegraph_search\` + \`codegraph_callers\`.
|
|
|
-- **Onboarding / understanding any area**: ONE \`codegraph_explore\` is usually the whole answer. Only follow up — \`codegraph_node\` for a specific symbol — if something is still unclear.
|
|
|
-- **Refactor planning**: \`codegraph_callers\` for the complete call-site list to update; the wider blast radius is already attached to \`codegraph_explore\` / \`codegraph_node\` output.
|
|
|
-- **Debugging a regression**: \`codegraph_callers\` of the suspected symbol; \`codegraph_node\` on anything unexpected that appears.
|
|
|
+- **Almost any question — "how does X work", architecture, a bug, "what/where is X", or surveying an area** → \`codegraph_explore\` with a natural-language question or the relevant names. ONE capped call returns the verbatim source grouped by file; most often the ONLY call you need.
|
|
|
+- **"How does X reach/become Y? / the flow / the path from X to Y"** → \`codegraph_explore\`, naming the symbols that span the flow (e.g. \`mutateElement renderScene\`) — it surfaces the call path among them, riding dynamic-dispatch hops, and returns their source.
|
|
|
+- **Reading or editing a file/symbol you can name** → put its name or file path in the \`codegraph_explore\` query — it returns that current line-numbered source (safe to \`Edit\` from) with the call path and blast radius attached, so you don't Read it separately. For an overloaded name it returns every matching definition's body in one call.
|
|
|
+- **Need more?** Call \`codegraph_explore\` again with more specific names — treat the source it returns as already Read.
|
|
|
|
|
|
## Anti-patterns
|
|
|
|
|
|
- **Trust codegraph's results — don't re-verify them with grep.** They come from a full AST parse; re-checking with grep is slower, less accurate, and wastes context.
|
|
|
-- **Don't grep first** when looking up a symbol by name — \`codegraph_search\` is faster and returns kind + location + signature.
|
|
|
-- **Don't chain \`codegraph_search\` + \`codegraph_node\`** to understand an area — ONE \`codegraph_explore\` returns the relevant symbols' source together in a single round-trip.
|
|
|
-- **Don't loop \`codegraph_node\` over many symbols** — one \`codegraph_explore\` call returns them all grouped by file, while each separate call re-reads the whole context and costs far more. Use \`codegraph_node\` for a single symbol.
|
|
|
-- **Don't reach for the \`Read\` tool on an indexed source file** — \`codegraph_node\` with a \`file\` reads it for you (same \`<n>\\t<line>\` source, \`offset\`/\`limit\` like Read, faster, with its blast radius), and with a \`symbol\` it returns the source plus the caller/callee trail. Reach for raw \`Read\` only for what codegraph doesn't index (configs, docs) or when the staleness banner flags a file as pending re-index.
|
|
|
+- **Don't grep or Read first** to find or understand indexed code — ONE \`codegraph_explore\` returns the relevant symbols' source together in a single round-trip. Reach for raw \`Read\`/\`Grep\` only to confirm a specific detail codegraph didn't cover, or for what codegraph doesn't index (configs, docs).
|
|
|
+- **Don't reconstruct a flow by hand** — name the endpoints in one \`codegraph_explore\` and it surfaces the path between them, dynamic-dispatch hops included.
|
|
|
- **After editing, check the staleness banner.** When a tool response starts with "⚠️ Some files referenced below were edited since the last index sync…", the listed files are pending re-index — Read those specific files for accurate content. Every file NOT in that banner is fresh, so still trust codegraph. A different, rarer banner — "⚠️ CodeGraph auto-sync is DISABLED…" — means live watching stopped entirely (the whole index is frozen, not just a few files); until it's resolved, Read files directly to confirm anything that may have changed.
|
|
|
|
|
|
## Limitations
|