Audience: a Claude agent continuing this work. Mission: systematically close static-extraction coverage holes for dynamic dispatch across every language and framework codegraph supports, and validate each one the same way, so cross-symbol flows exist in the graph everywhere.
This is the top-level playbook. The deep design for one mechanism (the callback synthesizer) is in
callback-edge-synthesis.md. Full investigation context + findings: auto-memoryproject_codegraph_read_displacement.
codegraph's value is being the map — answering structural/flow questions
(trace, impact, callers, "how does X reach Y") that grep/Read cannot. Agents
will use codegraph instead of Read only when it is sufficient. We proved
empirically (see memory) that the lever for sufficiency is coverage, not
prompting/hooks/new-tools: when a flow is missing from the graph, the agent reads
the files to reconstruct it; when the flow is in the graph, the agent can answer
completely without reading.
Validated end-to-end on excalidraw: after closing the update-flow hole, 2/3 headless agent runs answered the "how does an update reach the screen" question with Read 0 and a complete answer — impossible before, because the key edge wasn't in the graph. (Caveat: coverage enables the no-read path; agent confirm-by-reading variance means it doesn't force it. Completeness improves unconditionally.)
The mission is to make that true for all languages/frameworks.
Static tree-sitter extraction captures explicit calls (foo(), this.bar()). It
misses any call whose target is computed/indirect. Four recurring shapes, with a
difficulty gradient (do the cheap ones first):
| # | Shape | Example | Fix mechanism | Cost |
|---|---|---|---|---|
| 1 | Named attribute / descriptor | django self._iterable_class(self) |
framework resolver (claimsReference + resolve()) |
cheap |
| 2 | Field-backed observer | onUpdate(cb) + for(cb of cbs)cb() |
callback synthesizer (whole-graph pass) | medium |
| 3 | String-keyed EventEmitter | on('e',fn) / emit('e') |
callback synthesizer (event-keyed) | medium |
| 4 | Inline callback handler | on('e', function h(){}) / () => {} |
extraction (named) + synthesizer link-through-body (anon) | named: cheap · anon: hard |
Key distinction driving the mechanism choice:
_iterable_class is an attribute name) → resolver.cb() is anonymous; needs registrar↔dispatcher correlation) → synthesizer.QuerySet._fetch_all calls self._iterable_class(self) (a runtime-chosen
iterable, default ModelIterable), whose __iter__ runs the SQL compiler. Static
parsing can't resolve the attribute-as-callable → _fetch_all's only callee was
_prefetch_related_objects; trace(_fetch_all, execute_sql) returned no path.djangoResolver claims the unresolved _iterable_class ref through the
name-exists pre-filter, then resolves it to ModelIterable.__iter__.src/resolution/types.ts (claimsReference? on FrameworkResolver),
src/resolution/index.ts (pre-filter in resolveOne consults claimsReference),
src/resolution/frameworks/python.ts (djangoResolver.resolve + claimsReference +
resolveModelIterableIter).trace(_fetch_all, execute_sql) → _fetch_all → __iter__ → execute_sql (3 hops).Scene.triggerUpdate does for (cb of this.callbacks) cb(); triggerRender
is registered via scene.onUpdate(this.triggerRender). The triggerUpdate →
triggerRender edge is dynamic → trace returned no path; the whole update flow broke.dispatcher → callback edges. Plus extraction of
named inline callbacks so handlers like express's function onmount(){} are nodes.src/resolution/callback-synthesizer.ts (the pass — field observers +
EventEmitter), src/resolution/index.ts (calls synthesizeCallbackEdges() at the end
of resolveAndPersistBatched), src/extraction/tree-sitter.ts (visitFunctionBody
extracts named nested functions).trace(mutateElement, triggerRender) → 3 hops; express use → onmount.Every framework has a signature data/control flow. Pick the "how does X reach/become Y"
question and a real repo (add to .claude/skills/agent-eval/corpus.json). Examples:
@Controller→servicerm -rf <repo>/.codegraph && ( cd <repo> && codegraph init -i )
node scripts/agent-eval/probe-trace.mjs <repo> <from-symbol> <to-symbol> # does the flow break? where?
node scripts/agent-eval/probe-node.mjs <repo> <break-symbol> # trail: is the next hop missing?
A "No direct call path … breaks at dynamic dispatch" + a sparse trail at the break
point locates the hole (this is exactly how _iterable_class and triggerUpdate
were found). Confirm it's dynamic by reading the break symbol's body.
self.<attr>(...) / descriptor / metaclass → resolver (§3a).for(cb of store)cb() / store.forEach(cb=>cb()) → field-observer synthesizer (§3b).on('e',fn) + emit('e') → EventEmitter synthesizer (§3b).tree-sitter.ts); anonymous: synthesizer link-through-body (not yet built).src/resolution/frameworks/<lang>.ts — a resolve() branch +
claimsReference(name) if the ref name isn't a declared symbol. Copy djangoResolver.src/resolution/callback-synthesizer.ts — add the
framework's registrar/dispatcher name patterns and body patterns (e.g. signals
use .connect()/.emit(); Rx uses .subscribe()/.next()).probe-trace — the flow should now connect.probe-trace(from,to) finds the path; probe-node shows the
bridged hop. The previously-broken hop is closed.Precision: count + spot-check synthesized/resolved edges — no explosion, correct targets:
sqlite3 <repo>/.codegraph/codegraph.db \
"select s.name||' → '||t.name||' '||coalesce(e.metadata,'') from edges e \
join nodes s on e.source=s.id join nodes t on e.target=t.id where e.provenance='heuristic';"
(Resolver edges aren't heuristic; verify via the trace + callees instead.)
Regression: node count stable (select count(*) from nodes; before/after — a big
jump means an extraction change over-fired); existing traces on a control repo intact.
End-to-end agent eval: run the flow question with codegraph and measure reads / answer-completeness / cost vs a pre-fix baseline:
# headless (exact cost + clean tool sequence)
bash scripts/agent-eval/run-agent.sh <repo> with "<flow question>"
# or the full A/B + interactive Explore-subagent path:
scripts/agent-eval/audit.sh local <name> <url> "<flow question>" all
Then parse: Read count, codegraph-tool count, cost, and whether the answer now
contains the glue symbols (the ones that previously required a read).
trace finds the canonical flow end-to-end (no dynamic-dispatch break).| Tool | Purpose |
|---|---|
scripts/agent-eval/probe-trace.mjs <repo> <from> <to> |
call-path between two symbols (the hole detector) |
scripts/agent-eval/probe-node.mjs <repo> <sym> [code] |
symbol + trail (callers/callees); code adds the body |
scripts/agent-eval/probe-context.mjs <repo> "<task>" |
context output incl. call-paths |
scripts/agent-eval/probe-explore.mjs <repo> "<query>" |
explore output |
scripts/agent-eval/{audit,run-agent,itrun}.sh |
agent A/B (headless + interactive); also the /agent-eval skill |
sqlite3 <repo>/.codegraph/codegraph.db |
direct edge/node inspection (provenance, metadata, counts) |
Probe scripts use the built dist/ — run npm run build first. Reindex after any
extraction or resolution change (rm -rf <repo>/.codegraph && codegraph init -i) — the
synthesizer/resolvers run at index time. Test fixtures: keep a tiny per-pattern fixture
(see /tmp/cb-fixture/bus.js; move into __tests__/ when shipping).
Status legend: ✅ done+validated · 🔬 hole identified · ⬜ not started.
Mechanism: R = resolver, S = synthesizer channel, X = extraction.
| Language | Framework(s) | Canonical flow to test | Mechanism | Status |
|---|---|---|---|---|
| TypeScript/JS | React / observer / EventEmitter | state→render; dispatch→callback | S + X | ✅ (excalidraw) |
| TypeScript/JS | Vue / Nuxt | template events (@click→handler); component composition; reactive→render | S + X | ✅ events + composition (vitepress S / vben M / element-plus L); 🔬 reactive→render (vue-core Proxy runtime — frontier, deferred) |
| TypeScript/JS | Svelte / SvelteKit | template calls/composition; SvelteKit action→api; store→DOM | X | ✅ already strong (realworld S / skeleton M / shadcn L): template {fn()} calls, <Pascal/> composition, import * as api namespace, load→api all work out of the box. + exported-const object-of-functions extraction (SvelteKit actions). 🔬 $lib-namespace-from-action + store/reactive frontier |
| TypeScript/JS | Express / Koa | request → route → handler → service | R + X | ✅ named handlers + middleware + controller/service (resolver) + inline arrow handlers → service body calls (realworld S 19 / parse M / ghost L 65 edges). 🔬 custom routers (payload had 0 routes — not app.get-style) |
| TypeScript/JS | NestJS | request → controller → provider | ? | ⬜ |
| TypeScript/JS | RxJS / signals | subscribe → operator → observer | S | ⬜ |
| Python | Django ORM | QuerySet → SQL compiler | R | ✅ |
| Python | Django (views/signals) | url → view; signal → receiver | R/S | 🔬 (routes done; signals ⬜) |
| Python | Flask / FastAPI | request → route → dependency | R | 🔬 (routes done) |
| Go | Gin / net/http | request → handler chain | ? | ⬜ |
| Rust | Axum / Cargo workspace | request → handler; trait dispatch | R | 🔬 (workspaces done) |
| Java | Spring | request → @Controller → service; DI | ? | ⬜ |
| Kotlin | (coroutines / DI) | flow/callback dispatch | ? | ⬜ |
| Swift | Vapor | request → route → controller | ? | ⬜ |
| C# | ASP.NET | request → controller; DI | ? | ⬜ |
| Ruby | Rails / Sinatra | request → controller → view; callbacks | ? | ⬜ |
| PHP | Laravel / Drupal | request → controller; events | ? | ⬜ |
| C/C++ | (callback structs / vtables) | function-pointer dispatch | ? | ⬜ |
| Dart | Flutter | setState → build | S | ⬜ |
| Lua / Luau | (Neovim / Roblox) | event/callback dispatch | S | ⬜ |
| Scala | (Akka / Play) | actor message → handler | ? | ⬜ |
(Verify the exact supported set against src/extraction/languages/ and
src/resolution/frameworks/ before starting — this table is a starting point.)
<template>
is unparsed by the extractor, so template usage needs synthesis (vueTemplateEdges):
@click="fn" → handler, kebab <el-button> → ElButton. PascalCase <Child/> is
already covered by the JSX channel (the SFC component node spans the template). Result:
agent reads drop in every size (vben login 1–3 vs 4–11), strongest where handlers are
local functions (vben handleLogin/handleSubmit).
Composable-destructure handlers RESOLVED: @click="closeSidebar" where
const { close: closeSidebar } = useSidebarControl() now follows alias → composable →
the returned close fn (when it's defined in the composable's file). vitepress sidebar
flow dropped 6 → 0 reads (best case). Precise-only — no fallback to the composable
itself (the static useX() call edge already covers that), so it adds nothing where the
returned fn can't be located (e.g. re-exported / external composable). Remaining limits:
prefix-convention kebab — element-plus el-button → button.vue (component named
button, not ElButton), so kebab stays unresolved there; and reactive→render
(vue-core Proxy runtime) — the deep framework-internal frontier, deferred..svelte extractor already parses the template: extractTemplateCalls ({fn()}),
extractTemplateComponents (<Pascal/> composition — skeleton 956 / shadcn 1610 reference edges),
plus import * as api namespace + load→api resolution all work. Agent A/B (realworld login): with
codegraph 1 read vs without 4 — codegraph already wins out of the box. The one extraction gap
was object-of-functions (export const actions = { default: async () => {} }; the walker
deliberately skips object-literal functions to avoid inline-object noise). Fixed for EXPORTED consts
(general — Redux/Express handler maps too); extractFunction nameOverride keeps inline-object arrows
skipped. Residual: a $lib-alias namespace call (api.post) from an extracted action node doesn't
resolve even though the same alias resolves for load — a deeper resolver interaction, deferred
(local/relative calls from actions connect). Lesson: measure before assuming a hole — modern Svelte
barely uses on:click={fn} (form actions / callback props instead), so the assumed event-handler hole
wasn't the real one; Svelte needed far less than Vue.XController.method/XService.method.
The real hole was inline arrow route handlers (router.post('/x', async (req,res) => {...}) — the
dominant modern pattern): the handler regex [^)]+ broke on the arrow's ), so the route connected to
NOTHING and the anonymous handler's body (the request→service flow) was lost. The entire inline-handler
API was unreachable (realworld POST /users/login → 0 edges). Fixed (frameworks/express.ts): span the
call with a string-aware balanced scan; for inline arrows, extract the body's calls (RESERVED-filtered to
drop res/req/builtins) and attribute them to the route node → realworld 19 / ghost 65 precise
route→service edges (POST /users/login→login, POST /articles→createArticle, …), no node explosion,
framework-scoped (zero blast radius off Express). Deterministic win is clear; the agent A/B is muddied
by repo characteristics — realworld (39 files) is below the size where codegraph beats reading, and
Ghost's layered custom-API architecture makes both arms thrash. Residual: custom routers — payload's
6.4k-file codebase had 0 routes (its router abstraction isn't app.get-style, so undetected). Lesson
inverse of Svelte: Express's dominant pattern WAS the uncovered one, so it needed real work like Vue.tree-sitter.ts walker — re-check node counts across
several languages after any extraction change (it held at +3 on excalidraw because
anonymous arrows are skipped).error/change). Receiver-type
matching (via type_of edges) is the planned precision upgrade — deferred.provenance:'heuristic' +
metadata.synthesizedBy (the enum has no 'callback-synthesis'). See the design doc.resolveAndPersistBatched (full index) — wire into
resolveAndPersist for incremental sync before shipping.trace: common names (render, execute_sql) match many
nodes; trace picks among them and may start from the wrong one. Trace from the specific
method, not a class name.For each language × framework: the canonical flow traces end-to-end, an agent can
answer the flow question with Read 0 in at least some runs with the glue present, no node
explosion, no regression — recorded in the matrix (§6) with the validating repo + numbers.
Then ship-prep: tests per mechanism, CHANGELOG, wire incremental, commit.