mirror of
https://github.com/garrytan/gstack.git
synced 2026-05-21 20:28:24 +08:00
Merge remote-tracking branch 'origin/main' into garrytan/design
# Conflicts: # CLAUDE.md
This commit is contained in:
15
CLAUDE.md
15
CLAUDE.md
@@ -126,6 +126,21 @@ Contributors can store long-range vision docs and design documents in `~/.gstack
|
||||
These are local-only (not checked in). When reviewing TODOS.md, check `plans/` for candidates
|
||||
that may be ready to promote to TODOs or implement.
|
||||
|
||||
## E2E eval failure blame protocol
|
||||
|
||||
When an E2E eval fails during `/ship` or any other workflow, **never claim "not
|
||||
related to our changes" without proving it.** These systems have invisible couplings —
|
||||
a preamble text change affects agent behavior, a new helper changes timing, a
|
||||
regenerated SKILL.md shifts prompt context.
|
||||
|
||||
**Required before attributing a failure to "pre-existing":**
|
||||
1. Run the same eval on main (or base branch) and show it fails there too
|
||||
2. If it passes on main but fails on the branch — it IS your change. Trace the blame.
|
||||
3. If you can't run on main, say "unverified — may or may not be related" and flag it
|
||||
as a risk in the PR body
|
||||
|
||||
"Pre-existing" without receipts is a lazy claim. Prove it or don't say it.
|
||||
|
||||
## Deploying to the active skill
|
||||
|
||||
The active skill lives at `~/.claude/skills/gstack/`. After making changes:
|
||||
|
||||
Reference in New Issue
Block a user