Rework test262-runner.py to support --workers N parallel shards, each running a long-lived sx_server session. Replace thread-per-readline with a select-based raw-fd line buffer. On 2-core machines, 1 worker still beats 2 (OCaml eval is CPU-bound and starves when shared). Auto-defaults n_workers=1 on <=2 CPU, nproc-1 (up to 8) otherwise. Throughput baseline: ~1.1 Math tests/s serial on 2-core (unchanged; the evaluator dominates). The runner framework is now ready to scale on bigger machines without further code changes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
969 B
969 B
test262 scoreboard
Pinned commit: d5e73fc8d2c663554fb72e2380a8c2bc1a318a33
Wall time: 426.2s
Total: 67/288 runnable passed (23.3%). Raw: pass=67 fail=204 skip=39 timeout=17 total=327.
Top failure modes
- 94x ReferenceError (undefined symbol)
- 79x Test262Error (assertion failed)
- 30x TypeError: not a function
- 17x Timeout
- 1x Unhandled: Not callable: {:random <js-math-random()> :trunc <js-math-tr
Categories (worst pass-rate first, min 10 runnable)
| Category | Pass | Fail | Skip | Timeout | Total | Pass % |
|---|---|---|---|---|---|---|
| built-ins/Math | 67 | 204 | 39 | 17 | 327 | 23.3% |
Per-category top failures (min 10 runnable, worst first)
built-ins/Math (67/288 — 23.3%)
- 94x ReferenceError (undefined symbol)
- 79x Test262Error (assertion failed)
- 30x TypeError: not a function
- 17x Timeout
- 1x Unhandled: Not callable: {:random <js-math-random()> :trunc <js-math-tr