gstack/test at 3074e6773ab1b663b0233dc1baf791cdd744b9ce - gstack - Gitea: Hai

hai/gstack

mirror of https://github.com/garrytan/gstack.git synced 2026-05-09 05:59:48 +08:00

Files

History

Garry Tan f458f18f42 fix: broaden session-awareness E2E assertion to accept more LLM phrasings

The test checked for exact keywords like "RECOMMENDATION", "option a",
"which approach" but the model sometimes phrases options as "A)" or
references "Checkout" vs "Elements" directly without using the word
"recommend". Added: "option b", regex for "a)"/"b)", and the actual
decision terms (checkout, elements, hosted, embedded).

Failed 3/3 retries in CI because the assertion was too narrow for
non-deterministic LLM output.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-29 15:45:26 -07:00

..

feat: test coverage catalog — shared audit across plan/ship/review (v0.10.1.0) (#259 )

2026-03-22 11:28:16 -07:00

merge: incorporate origin/main into community-mode branch

2026-03-27 19:52:08 -07:00

analytics.test.ts

feat: safety hook skills + skill usage telemetry (v0.7.1) (#189 )

2026-03-18 23:57:59 -05:00

audit-compliance.test.ts

fix: security audit compliance — credentials, telemetry, bun pin, untrusted warning (v0.12.12.0) (#574 )

2026-03-27 12:06:58 -06:00

codex-e2e.test.ts

feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )

2026-03-23 23:05:22 -07:00

community-tier.test.ts

feat: PR screenshots in /ship template + upload/auth tests

2026-03-24 20:05:30 -07:00

gemini-e2e.test.ts

feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )

2026-03-23 23:05:22 -07:00

gen-skill-docs.test.ts

merge: incorporate origin/main into community-mode branch

2026-03-29 13:21:04 -07:00

global-discover.test.ts

feat: /retro global — cross-project AI coding retrospective (v0.10.2.0) (#316 )

2026-03-22 13:52:47 -07:00

hook-scripts.test.ts

feat: safety hook skills + skill usage telemetry (v0.7.1) (#189 )

2026-03-18 23:57:59 -05:00

review-log.test.ts

fix: community PRs + security hardening + E2E stability (v0.12.7.0) (#552 )

2026-03-26 23:21:27 -06:00

skill-e2e-bws.test.ts

fix: broaden session-awareness E2E assertion to accept more LLM phrasings

2026-03-29 15:45:26 -07:00

skill-e2e-cso.test.ts

feat: /cso v2 — infrastructure-first security audit (v0.11.6.0) (#384 )

2026-03-23 06:57:22 -07:00

skill-e2e-deploy.test.ts

feat: /land-and-deploy first-run dry run + staging-first + trust ladder (v0.12.2.0) (#518 )

2026-03-26 11:08:31 -07:00

skill-e2e-design.test.ts

feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )

2026-03-23 10:17:33 -07:00

skill-e2e-plan.test.ts

test: E2E tests for plan review report and Codex offering (v0.11.15.0) (#449 )

2026-03-24 07:30:24 -07:00

skill-e2e-qa-bugs.test.ts

feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )

2026-03-23 10:17:33 -07:00

skill-e2e-qa-workflow.test.ts

feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )

2026-03-23 10:17:33 -07:00

skill-e2e-review.test.ts

fix: community PRs + security hardening + E2E stability (v0.12.7.0) (#552 )

2026-03-26 23:21:27 -06:00

skill-e2e-sidebar.test.ts

fix: sidebar agent uses real tab URL instead of stale Playwright URL (v0.12.6.0) (#544 )

2026-03-26 22:07:03 -06:00

skill-e2e-workflow.test.ts

feat: 2-tier E2E test system — granular touchfiles + gate/periodic split (v0.11.16.0) (#450 )

2026-03-24 15:24:00 -07:00

skill-e2e.test.ts

feat: test coverage catalog — shared audit across plan/ship/review (v0.10.1.0) (#259 )

2026-03-22 11:28:16 -07:00

skill-llm-eval.test.ts

feat: voice directive for all skills (v0.12.3.0) (#520 )

2026-03-26 17:31:53 -06:00

skill-parser.test.ts

feat: SKILL.md template system, 3-tier testing, DX tools (v0.3.3) (#41 )

2026-03-13 21:08:12 -07:00

skill-routing-e2e.test.ts

fix: community PRs + security hardening + E2E stability (v0.12.7.0) (#552 )

2026-03-26 23:21:27 -06:00

skill-validation.test.ts

fix: Codex hang fixes — plan visibility, stdout buffering, reasoning effort (v0.12.4.0) (#536 )

2026-03-26 18:19:26 -06:00

telemetry.test.ts

merge: incorporate origin/main into community-mode branch

2026-03-28 07:38:15 -07:00

touchfiles.test.ts

feat: 2-tier E2E test system — granular touchfiles + gate/periodic split (v0.11.16.0) (#450 )

2026-03-24 15:24:00 -07:00

uninstall.test.ts

feat: community PRs — faster install, skill namespacing, uninstall, Codex fallback, Windows fix, Python patterns (v0.12.9.0) (#561 )

2026-03-27 00:44:37 -06:00

worktree.test.ts

feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )

2026-03-23 23:05:22 -07:00