Fix flaky tracing test in cudf-polars by TomAugspurger · Pull Request #22012 · rapidsai/cudf

TomAugspurger · 2026-04-03T21:43:36Z

This test was failing under some conditions. At a minimum, this
previously hit an error

pytest --executor in-memory \
  python/cudf_polars/tests/test_tracing.py::test_import_without_structlog \
  python/cudf_polars/tests/test_scan.py::test_scan[csv-no_row_index-all_rows-all-no_mask-no_slice]

consistent with the error we occasionaly saw in CI.

By avoiding monkeypatching, we seem to avoid the issue with our singledispatch implementation not being found.

copy-pr-bot · 2026-04-03T21:43:40Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

TomAugspurger · 2026-04-03T21:43:46Z

/ok to test 9f9cbe2

TomAugspurger · 2026-04-03T21:57:20Z

/ok to test 74e2bab

TomAugspurger · 2026-04-20T15:46:23Z

/ok to test 944e962

This test was failing under some conditions. At a minimum, this previously hit an error ``` pytest --executor in-memory \ python/cudf_polars/tests/test_tracing.py::test_import_without_structlog \ python/cudf_polars/tests/test_scan.py::test_scan[csv-no_row_index-all_rows-all-no_mask-no_slice] ``` consistent with the error we occasionaly saw in CI. By avoiding monkeypatching, we seem to avoid the issue with our singledispatch implementation not being found.

vyasr · 2026-04-20T21:55:59Z

I dug into this issue a bit further and made sense of what's happening. tl;dr I recommended the subprocess solution and I stand by it now, I think it's the only really safe choice here; monkeypatching sys.modules and then performing operations that lead to other imports (which happens during cudf_polars execution) is fundamentally unsound and can lead to inconsistent states after the monkeypatching concludes.

The best way to see the problem is by applying this diff to the problematic test:

❯ git diff
diff --git a/python/cudf_polars/tests/test_tracing.py b/python/cudf_polars/tests/test_tracing.py
index dff1c61e94..09237ee6b5 100644
--- a/python/cudf_polars/tests/test_tracing.py
+++ b/python/cudf_polars/tests/test_tracing.py
@@ -64,6 +64,8 @@ def test_trace_basic(
 def test_import_without_structlog(monkeypatch: pytest.MonkeyPatch) -> None:
     modules = list(sys.modules)

+    original_keys = set(sys.modules.keys())
+    print("The original len is ", len(modules))
     for module in modules:
         if module.startswith("cudf_polars"):
             monkeypatch.delitem(sys.modules, module)
@@ -77,6 +79,11 @@ def test_import_without_structlog(monkeypatch: pytest.MonkeyPatch) -> None:
     q = pl.DataFrame({"a": [1, 2, 3]}).lazy().select(pl.col("a").sum())
     q.collect(engine="gpu")

+    print("The length before undo is ", len(sys.modules))
+    monkeypatch.undo()
+    print("The length after undo is ", len(sys.modules))
+    print("The new modules are ", set(sys.modules.keys()) - original_keys)
+

When this runs, you see something like

python/cudf_polars/tests/test_tracing.py .The original len is  1085
The length before undo is  1101
The length after undo is  1103
The new modules are  {'cudf_polars.experimental.parallel', 'cudf_polars.experimental.utils', 'cudf_polars.experimental.scheduler', 'cudf_polars.experimental.groupby', 'cudf_polars.experimental.dispatch', 'cudf_polars.experimental.shuffle', 'cudf_polars.experimental.repartition', '_statistics', 'cudf_polars.experimental.base', 'cudf_polars.experimental', 'cudf_polars.experimental.expressions', 'cudf_polars.experimental.join', 'cudf_polars.experimental.statistics', 'statistics', 'cudf_polars.experimental.distinct', 'cudf_polars.experimental.sort', 'cudf_polars.experimental.io', 'cudf_polars.experimental.select'}

Here's what's happening in this example. The collect call inside this test will trigger the import of cudf_polars.experimental module since streaming is the default executor. If test_tracing.py is the first test to run in the current process (or if we are using an executor other than streaming to run all the tests before test_tracing.py runs), then the imports that occur as part of cudf_polars's execution of the collect call will result in modules being added to sys.modules that were not present before we monkeypatched sys.modules. That is a problem because pytest.Monkeypatch won't know that it should delete these modules; the modifications are outside its scope. That is the root of the issue.

After the test completes, we therefore wind up in a mixed state. We have all of the original non-experimental cudf_polars modules restored into sys.modules, but the cudf_polars.experimental modules stay in there, and most importantly, they contain references to the non-experimental cudf.polars modules that were created in the middle of the structlog test, which are not the same as the contents of sys.modules after the test. In particular, that means that we have the following situation:

# Before the test
original_ir = sys.modules['cudf_polars.dsl.ir']
...
# In the middle of the structlog test after we delete modules and reimport
ir_in_structlog_test = sys.modules['cudf_polars.dsl.ir']
evaluate_streaming_in_structlog_test = sys.modules['cudf_polars.experimental.parallel'].evaluate_streaming
lower_ir_graph_in_structlog_test = sys.modules['cudf_polars.experimental.dispatch'].lower_ir_node
...
# After monkeypatch cleans up
ir_after_cleanup = sys.modules['cudf_polars.dsl.ir']  # Note: `ir_after_cleanup is original_ir`
evaluate_streaming_after_cleanup = sys.modules['cudf_polars.experimental.parallel'].evaluate_streaming
lower_ir_graph_after_cleanup = sys.modules['cudf_polars.experimental.dispatch'].lower_ir_node

Then, when a subsequent test calls evaluate_streaming, we have

return evaluate_streaming( # This is `evaluate_streaming_in_structlog_test`, not `evaluate_streaming_after_cleanup`
    ir, # BUT this is `ir_after_cleanup`, not `ir_in_structlog_test`
    config_options
)

Therefore, the evaluate_streaming call eventually tries to look up instances of original_ir.* inside the registry of lower_ir_graph_in_structlog_test, which don't match because that singledispatch was created with references to elements of ir_in_structlog_test.

vyasr

Thanks for the fix!

vyasr · 2026-04-20T21:58:40Z

+def test_import_without_structlog() -> None:
+    code = textwrap.dedent("""\
+    import sys
+    sys.modules["structlog"] = None


Is this necessary? Will structlog get imported by default somehow if it is installed?

We have to fake that structlog is not available so that cudf_polars.dsl.tracing._HAS_STRUCTLOG will return false, I suspect.

A more principled way to do this would be to install a module finder hook to raise import error when trying to find structlog, but that seems like too much effort.

TomAugspurger · 2026-04-21T11:49:17Z

/merge

This test was failing under some conditions. At a minimum, this previously hit an error ``` pytest --executor in-memory \ python/cudf_polars/tests/test_tracing.py::test_import_without_structlog \ python/cudf_polars/tests/test_scan.py::test_scan[csv-no_row_index-all_rows-all-no_mask-no_slice] ``` consistent with the error we occasionaly saw in CI. By avoiding monkeypatching, we seem to avoid the issue with our singledispatch implementation not being found. Authors: - Tom Augspurger (https://github.com/TomAugspurger) Approvers: - Matthew Murray (https://github.com/Matt711) - Vyas Ramasubramani (https://github.com/vyasr) URL: rapidsai#22012

github-actions Bot assigned TomAugspurger Apr 3, 2026

github-actions Bot added Python Affects Python cuDF API. cudf-polars Issues specific to cudf-polars labels Apr 3, 2026

github-project-automation Bot added this to cuDF Python Apr 3, 2026

GPUtester moved this to In Progress in cuDF Python Apr 3, 2026

TomAugspurger force-pushed the tom/debug-ci branch from 944e962 to 4b66b6f Compare April 20, 2026 18:57

TomAugspurger changed the title ~~[WIP]: Debug cudf-polars failures~~ Fix flaky tracing test in cudf-polars Apr 20, 2026

TomAugspurger marked this pull request as ready for review April 20, 2026 18:58

TomAugspurger requested a review from a team as a code owner April 20, 2026 18:58

TomAugspurger requested a review from vyasr April 20, 2026 18:58

Matt711 approved these changes Apr 20, 2026

View reviewed changes

Matt711 added non-breaking Non-breaking change bug Something isn't working labels Apr 20, 2026

TomAugspurger added improvement Improvement / enhancement to an existing function and removed improvement Improvement / enhancement to an existing function labels Apr 20, 2026

vyasr approved these changes Apr 20, 2026

View reviewed changes

TomAugspurger added 2 commits April 21, 2026 04:46

Merge remote-tracking branch 'upstream/main' into tom/debug-ci

9d08ec3

reference

39b428c

Merge branch 'main' into tom/debug-ci

bba6870

rapids-bot Bot merged commit c209410 into rapidsai:main Apr 22, 2026
180 of 184 checks passed

github-project-automation Bot moved this from In Progress to Done in cuDF Python Apr 22, 2026

TomAugspurger deleted the tom/debug-ci branch April 22, 2026 20:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix flaky tracing test in cudf-polars#22012

Fix flaky tracing test in cudf-polars#22012
rapids-bot[bot] merged 4 commits into
rapidsai:mainfrom
TomAugspurger:tom/debug-ci

TomAugspurger commented Apr 3, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented Apr 3, 2026

Uh oh!

TomAugspurger commented Apr 3, 2026

Uh oh!

TomAugspurger commented Apr 3, 2026

Uh oh!

TomAugspurger commented Apr 20, 2026

Uh oh!

vyasr commented Apr 20, 2026

Uh oh!

vyasr left a comment

Uh oh!

vyasr Apr 20, 2026

Uh oh!

wence- Apr 21, 2026

Uh oh!

Uh oh!

TomAugspurger commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

TomAugspurger commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot Bot commented Apr 3, 2026

Uh oh!

TomAugspurger commented Apr 3, 2026

Uh oh!

TomAugspurger commented Apr 3, 2026

Uh oh!

TomAugspurger commented Apr 20, 2026

Uh oh!

vyasr commented Apr 20, 2026

Uh oh!

vyasr left a comment

Choose a reason for hiding this comment

Uh oh!

vyasr Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

wence- Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

TomAugspurger commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

TomAugspurger commented Apr 3, 2026 •

edited

Loading