mirror of
https://github.com/garrytan/gstack.git
synced 2026-05-19 10:52:28 +08:00
Merge branch 'main' into garrytan/team-supabase-store
Resolved 15 conflicts: - session-runner.ts: kept both costs array (ours) and model/timing telemetry (main), renamed loop var to avoid shadowing model param - skill-e2e.test.ts: accepted deletion (main split into separate files) - gen-skill-docs.ts: kept our codex review flow with $PROJECTS_DIR paths - plan review templates: took main + re-applied $PROJECTS_DIR paths, replaced gstack-review-log with inline approach - TODOS.md: took main's shipped status for E2E model pinning - Generated SKILL.md + .agents/ files: regenerated from resolved templates Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -145,7 +145,7 @@ Spawns `claude -p` as a subprocess with `--output-format stream-json --verbose`,
|
||||
|
||||
```bash
|
||||
# Must run from a plain terminal — can't nest inside Claude Code or Conductor
|
||||
EVALS=1 bun test test/skill-e2e.test.ts
|
||||
EVALS=1 bun test test/skill-e2e-*.test.ts
|
||||
```
|
||||
|
||||
- Gated by `EVALS=1` env var (prevents accidental expensive runs)
|
||||
@@ -153,7 +153,7 @@ EVALS=1 bun test test/skill-e2e.test.ts
|
||||
- API connectivity pre-check — fails fast on ConnectionRefused before burning budget
|
||||
- Real-time progress to stderr: `[Ns] turn T tool #C: Name(...)`
|
||||
- Saves full NDJSON transcripts and failure JSON for debugging
|
||||
- Tests live in `test/skill-e2e.test.ts`, runner logic in `test/helpers/session-runner.ts`
|
||||
- Tests live in `test/skill-e2e-*.test.ts` (split by category), runner logic in `test/helpers/session-runner.ts`
|
||||
|
||||
### E2E observability
|
||||
|
||||
|
||||
Reference in New Issue
Block a user