callJudge<T> returns { result: T, meta } but three call sites were
accessing properties directly on the wrapper object instead of
destructuring result first. This caused "Expected and actual values
must be numbers or bigints" in all workflow judge tests (10 failures).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>