gstack/test at 0adc71a13b48c8c6cbbbe467a7653456a69a7d4b - gstack - Gitea: Hai

hai/gstack

mirror of https://github.com/garrytan/gstack.git synced 2026-05-09 14:09:47 +08:00

Files

History

Garry Tan 0adc71a13b fix: lower command reference completeness threshold to 3

The LLM judge consistently scores the command reference table's
completeness at 3/5 because it's a terse quick-reference format.
Detailed argument docs live in per-command sections, not the summary
table. The baseline already expects 3 — align the direct test threshold.

2026-03-24 14:27:11 -07:00

..

feat: test coverage catalog — shared audit across plan/ship/review (v0.10.1.0) (#259 )

2026-03-22 11:28:16 -07:00

fix: three flaky E2E test fixes

2026-03-24 14:19:25 -07:00

analytics.test.ts

feat: safety hook skills + skill usage telemetry (v0.7.1) (#189 )

2026-03-18 23:57:59 -05:00

codex-e2e.test.ts

feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )

2026-03-23 23:05:22 -07:00

gemini-e2e.test.ts

feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )

2026-03-23 23:05:22 -07:00

gen-skill-docs.test.ts

feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )

2026-03-23 23:05:22 -07:00

global-discover.test.ts

feat: /retro global — cross-project AI coding retrospective (v0.10.2.0) (#316 )

2026-03-22 13:52:47 -07:00

hook-scripts.test.ts

feat: safety hook skills + skill usage telemetry (v0.7.1) (#189 )

2026-03-18 23:57:59 -05:00

skill-e2e-bws.test.ts

feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )

2026-03-23 23:05:22 -07:00

skill-e2e-cso.test.ts

feat: /cso v2 — infrastructure-first security audit (v0.11.6.0) (#384 )

2026-03-23 06:57:22 -07:00

skill-e2e-deploy.test.ts

feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )

2026-03-23 10:17:33 -07:00

skill-e2e-design.test.ts

feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )

2026-03-23 10:17:33 -07:00

skill-e2e-plan.test.ts

test: E2E tests for plan review report and Codex offering (v0.11.15.0) (#449 )

2026-03-24 07:30:24 -07:00

skill-e2e-qa-bugs.test.ts

feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )

2026-03-23 10:17:33 -07:00

skill-e2e-qa-workflow.test.ts

feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )

2026-03-23 10:17:33 -07:00

skill-e2e-review.test.ts

feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )

2026-03-23 10:17:33 -07:00

skill-e2e-workflow.test.ts

fix: three flaky E2E test fixes

2026-03-24 14:19:25 -07:00

skill-e2e.test.ts

feat: test coverage catalog — shared audit across plan/ship/review (v0.10.1.0) (#259 )

2026-03-22 11:28:16 -07:00

skill-llm-eval.test.ts

fix: lower command reference completeness threshold to 3

2026-03-24 14:27:11 -07:00

skill-parser.test.ts

feat: SKILL.md template system, 3-tier testing, DX tools (v0.3.3) (#41 )

2026-03-13 21:08:12 -07:00

skill-routing-e2e.test.ts

fix: three flaky E2E test fixes

2026-03-24 14:08:38 -07:00

skill-validation.test.ts

feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )

2026-03-23 23:05:22 -07:00

telemetry.test.ts

feat: opt-in usage telemetry + community intelligence platform (v0.8.6) (#210 )

2026-03-19 17:21:05 -07:00

touchfiles.test.ts

Merge remote-tracking branch 'origin/main' into garrytan/e2e-test-triage

2026-03-24 08:16:27 -07:00

worktree.test.ts

feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )

2026-03-23 23:05:22 -07:00