fix: reduce E2E test flakiness — pre-warm browse, simplify ship, accept multi-skill routing

Browse E2E: pre-warm Chromium in beforeAll so agent doesn't waste turns on cold
startup. Reduce maxTurns 10→3. Add CI-aware MAX_START_WAIT (8s→30s when CI=true).

Ship E2E: simplify prompt from full /ship workflow to focused VERSION bump +
CHANGELOG + commit + push. Reduce maxTurns 15→8.

Routing E2E: accept multiple valid skills for ambiguous prompts.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
Garry Tan
2026-03-23 20:24:05 -07:00
parent 9dc04020a4
commit b0c7739c93
4 changed files with 18 additions and 35 deletions

View File

@@ -15,7 +15,7 @@ import { resolveConfig, ensureStateDir, readVersionHash } from './config';
const config = resolveConfig();
const IS_WINDOWS = process.platform === 'win32';
const MAX_START_WAIT = IS_WINDOWS ? 15000 : 8000; // Node+Chromium takes longer on Windows
const MAX_START_WAIT = IS_WINDOWS ? 15000 : (process.env.CI ? 30000 : 8000); // Node+Chromium takes longer on Windows
export function resolveServerScript(
env: Record<string, string | undefined> = process.env,