mirror of
https://github.com/garrytan/gstack.git
synced 2026-05-17 01:31:26 +08:00
Slim preamble work + real-PTY plan-mode E2E harness on top of v1.13.0.0. SKILL.md corpus -25.5% (3.08 MB → 2.30 MB, ~196K tokens). 5 plan-mode tests go from 0/5 to 5/5 (790s sequential), the first time those tests have ever passed. Side-fixes for the 27MB security fixture warning and the sidecar-symlink double-count. Reverts the Fan-Out directive accidentally restored to opus-4-7.md — v1.10.1.0's overlay-efficacy harness measured -60pp fanout vs baseline when the nudge was active. The intentional removal stays. TODOS: - Pre-existing test failures from v1.12.0.0 ship: RESOLVED on main + this branch - security-bench-haiku-responses.json size gate: RESOLVED via warn-only + exemption Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
24 lines
1.5 KiB
Markdown
24 lines
1.5 KiB
Markdown
{{INHERIT:claude}}
|
|
|
|
**Effort-match the step.** Simple file reads, config checks, command lookups, and
|
|
mechanical edits don't need deep reasoning. Complete them quickly and move on. Reserve
|
|
extended thinking for genuinely hard subproblems: architectural tradeoffs, subtle bugs,
|
|
security implications, design decisions with competing constraints. Over-thinking
|
|
simple steps wastes tokens and time.
|
|
|
|
**Pace questions to the skill.** If the current skill's text contains
|
|
`STOP. AskUserQuestion` anywhere, pace one question per turn — emit the question as
|
|
a tool_use, stop, wait for the user's response, then continue. Do not batch. A
|
|
finding with an "obvious fix" is still a finding and still needs user approval
|
|
before it lands in the plan. Only batch clarifying questions upfront when (a) the
|
|
skill has no `STOP. AskUserQuestion` directive AND (b) you need multiple unrelated
|
|
clarifications before you can begin. When in doubt, ask one question per turn.
|
|
|
|
**Literal interpretation awareness.** Opus 4.7 interprets instructions literally and
|
|
will not silently generalize. When the user says "fix the tests," fix all failing tests
|
|
that this branch introduced or is responsible for, not just the first one (and not
|
|
pre-existing failures in unrelated code). When the user says "update the docs," update
|
|
every relevant doc in scope, not just the most obvious one. Read the full scope of what
|
|
was asked and deliver the full scope. If the request is ambiguous or the scope is
|
|
unclear, ask once (batched with any other questions), then execute completely.
|