Rework test262-runner.py to support --workers N parallel shards, each running a long-lived sx_server session. Replace thread-per-readline with a select-based raw-fd line buffer. On 2-core machines, 1 worker still beats 2 (OCaml eval is CPU-bound and starves when shared). Auto-defaults n_workers=1 on <=2 CPU, nproc-1 (up to 8) otherwise. Throughput baseline: ~1.1 Math tests/s serial on 2-core (unchanged; the evaluator dominates). The runner framework is now ready to scale on bigger machines without further code changes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
31 lines
969 B
Markdown
31 lines
969 B
Markdown
# test262 scoreboard
|
|
|
|
Pinned commit: `d5e73fc8d2c663554fb72e2380a8c2bc1a318a33`
|
|
Wall time: 426.2s
|
|
|
|
**Total:** 67/288 runnable passed (23.3%). Raw: pass=67 fail=204 skip=39 timeout=17 total=327.
|
|
|
|
## Top failure modes
|
|
|
|
- **94x** ReferenceError (undefined symbol)
|
|
- **79x** Test262Error (assertion failed)
|
|
- **30x** TypeError: not a function
|
|
- **17x** Timeout
|
|
- **1x** Unhandled: Not callable: {:random <js-math-random()> :trunc <js-math-tr
|
|
|
|
## Categories (worst pass-rate first, min 10 runnable)
|
|
|
|
| Category | Pass | Fail | Skip | Timeout | Total | Pass % |
|
|
|---|---:|---:|---:|---:|---:|---:|
|
|
| built-ins/Math | 67 | 204 | 39 | 17 | 327 | 23.3% |
|
|
|
|
## Per-category top failures (min 10 runnable, worst first)
|
|
|
|
### built-ins/Math (67/288 — 23.3%)
|
|
|
|
- **94x** ReferenceError (undefined symbol)
|
|
- **79x** Test262Error (assertion failed)
|
|
- **30x** TypeError: not a function
|
|
- **17x** Timeout
|
|
- **1x** Unhandled: Not callable: {:random <js-math-random()> :trunc <js-math-tr
|