Rewrote test architecture: deferred execution. Tests register thunks during
file load (try-call redefined to append to _test-registry), then the
Playwright loop runs each individually with 3s timeout via Promise.race.
Hanging tests (parser infinite loops) fail with TIMEOUT and trigger page
reboot. No tests are hidden or skipped.
Fixed generator: proper quote escaping for HS sources with embedded quotes,
sanitized comments to avoid SX parser special chars.
831 tests registered, 424 pass, 407 fail honestly:
- 22 perfect categories (empty, dialog, morph, default, reset, scroll, etc.)
- Major gaps: if 0/19, wait 0/7, take 0/12, repeat 2/30, set 4/25
- Timeout failures from parser hangs on unsupported syntax
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Reset to last known-good state (908f4f80) where links, stepper, and
islands all work, then recovered all hyperscript implementation,
conformance tests, behavioral tests, Playwright specs, site sandbox,
IO-aware server loading, and upstream test suite from f271c88a.
Excludes runtime changes (VM resolve hook, VmSuspended browser handler,
sx_ref.ml guard recovery) that need careful re-integration.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>