Make DAP protocol tests deterministic #1033

lionel- · 2026-02-09T16:44:52Z

Branched from #1032 (Merge CommManager into Shell and IOPub)
Closes #1026

This PR is the payoff for the IOPub/Comm unification of #1032. It leverages deterministic message ordering to replace fuzzy message matching with strict sequential assertions in the test infrastructure. It also fixes several bugs made obvious by the new assertions, and removes emission of debug events during auto-stepping so that code injected for breakpoints is invisible to the frontend.

Replaces fuzzy matching with strict sequential assertions, making tests both more readable and more accurate. Before:

frontend.recv_iopub_until(|acc| {
    acc.has_comm_method_count("start_debug", 2) &&
        acc.has_comm_method_count("stop_debug", 2) &&
        acc.saw_idle()
});

After (the different counts are due to other fixes, see below):

frontend.recv_iopub_start_debug();
frontend.assert_stream_stdout_contains("Called from:");
frontend.assert_stream_debug_at(&file);
frontend.recv_iopub_idle();

Eliminates workarounds like recv_iopub_busy_skip_streams, recv_iopub_execute_input_skip_streams, recv_iopub_async, and the entire MessageAccumulator / recv_iopub_until pattern

As part of a complete test review I also fixed:

Off-by-one line number issue in sourced code (missing initial #line directive)
Missing source file assertions on stack frames
Superfluous DAP events from auto-stepping. Code injected for breakpoints no longer emits start_debug/stop_debug cycles

Stream buffering

The one remaining source of non-determinism is R's stream output. R may batch or split stdout/stderr messages arbitrarily, independently of the ordering of non-stream messages. The infrastructure around stream assertions was reworked to robustly prevent test flakiness caused by stream buffering.

All recv_iopub_* methods now automatically buffer Stream messages and ignore them. The accumulated streams are now tested/asserted separately via assert_stream_stdout_contains() / assert_stream_stderr_contains(). Stream assertions are still order-sensitive: each match drains the buffer up to that point. But a stray stream can no longer cause other assertions to fail.

recv_iopub_idle() acts as a synchronization point: at that point, we know for sure R has finished streaming all output emitted during the request handling. The method panics if streams were received but not asserted, then clears the buffers.

There must be at least one stream assertion per request handling if output is emitted. Unfortunately this setup still allows unexpected stream output to go unnoticed as long as there is one stream assertion in the test. Due to arbitrary splitting though, I don't think we can do better. Unless we go for full snapshots, but this approach has issues of its own (scrubbing of stateful output, and stability across platforms and versions of R).

Auto-stepping before `debug_start()`

Previously, auto-stepping through injected breakpoint wrappers happened after debug_start() had already been sent to the frontend. Every breakpoint hit produced visible intermediate events (start_debug, Stopped, Continued, Continued, start_debug, Stopped). Tests had to account for all of these with recv_auto_step_through() and multiple start/stop cycles.

Now maybe_auto_step() is called before debug_start(). If we detect we're at injected code, we immediately return n to R without emitting any debug notifications. The frontend only sees a single start_debug + Stopped at the user expression.

Similarly, when pending_inputs is non-empty (multiple expressions queued), we skip debug_start() for intermediate browser prompts.

The main insight is that we never start a new debug session unless we're ready to close the active execution request, and we never close the active execution request until R has stopped at the final user-visible location.

DavisVaughan

I didn't actually build locally and try out any debugging since this was mostly concerned with test improvements

DavisVaughan · 2026-02-10T18:58:58Z

...ts/ark__console_annotate__tests__annotate_source_breakpoint_on_function_definition_line.snap

+#line 1 "file:///test.R"
 base::.ark_auto_step(base::.ark_breakpoint(browser(), "file:///test.R", "1"))
 #line 1 "file:///test.R"


It's supposed to be on line 6 and 8?

DavisVaughan · 2026-02-10T19:02:45Z

crates/ark_test/Cargo.toml

 serde_json = "1.0.94"
 stdext = { path = "../stdext" }
 tempfile = "3"
+regex = "1"


alphabetical order?

DavisVaughan · 2026-02-10T19:05:13Z