Fix `stdout`/`stderr` redirects #25

kenodegard · 2024-02-13T03:29:16Z

Resolves #24

Changes include:

Ruff version bump and enabled some additional lints (happy to remove these changes if undesired)
Add a TDD test used to showcase problematic behavior
Call item.ihook.pytest_runtest_call(item=item) instead of directly calling item.runtest() to benefit from pytest's automatic handling of sys.last_type, sys.last_value, sys.last_execption (apparently not handling this causes stdout/stderr to be redirected and inaccessible within tests, see https://github.com/pytest-dev/pytest/blob/3b798e54221f1895a983000c7e5bc8afdacd5011/src/_pytest/runner.py#L165-L182)
Switch from pytest_runtest_protocol hook to pytest_pyfunc_call hook (deeper in the call stack, pytest_pyfunc_call is invoked from runtest which in turn is called from pytest_runtest_call)

jezdez · 2024-02-13T13:00:44Z

Nice, thanks @kenodegard!

codspeed-hq · 2024-02-13T15:46:46Z

CodSpeed Performance Report

Merging #25 will not alter performance

_{Comparing kenodegard:fix-outerr-redirects (26b5d09) with master (90e639b)}

Summary

✅ 5 untouched benchmarks

art049

Thank you, agree entirely, we need this :)
No problem with the ruff bump and config improvement; love it!

LMK what you think about the details

art049 · 2024-02-13T17:56:47Z

src/pytest_codspeed/plugin.py

+                plugin.lib,
+                item.nodeid,
+                item.config,
+                lambda: ihook.pytest_runtest_call(item=item),


In the current state this might introduce variance in the code we measure since we'll also measure the pytest hook.
I think it would be better to do it the other way around: have pytest_runtest_call call _run_with_instrumentation

A way to do it might be to and override runtest but don't hesitate if you have another idea!

After digging some more I found we can use pytest_pyfunc_call instead of pytest_runtest_protocol (runtest invokes this hook, see https://github.com/pytest-dev/pytest/blob/5bb1363435a8cb3e2010505dbeb1e015c36beed6/src/_pytest/python.py#L1762-L1764)

I got the tests to pass locally but I may be missing something so let me know if you don't think this is the right thing to do

pyproject.toml

tests/benchmarks/test_print.py

kenodegard · 2024-02-14T14:22:09Z

@art049 thanks for running the tests

pushed 59ea7ad to resolve the typing failure in Python<3.10 and 2b026dd to resolve the perf trampoline test failure on Python 3.12

kenodegard · 2024-02-29T22:09:22Z

@art049 anything more to do here?

.pre-commit-config.yaml

Co-authored-by: Edgar Ramírez Mondragón <16805946+edgarrmondragon@users.noreply.github.com>

art049 · 2024-03-20T02:57:57Z

It's a bit annoying that formatter changes and logic change are in the same PR in the end.
I'll maybe try to split things out.

art049

Since you're touching the core of the plugin, I have some additional feedback just to make sure the behavior stays the same.

Really appreciate you refactoring the test protocol thing! 🔥

Edit: thanks for splitting the formating things into #29 !

art049 · 2024-03-21T18:26:28Z

src/pytest_codspeed/plugin.py

+        if (
+            plugin.is_codspeed_enabled
+            and plugin.lib is not None
+            and plugin.should_measure
+        ):
+            return wrap_pyfunc_with_instrumentation(
+                plugin.lib,
+                self._request.node.nodeid,
+                config,
+                func,
+            )(*args, **kwargs)


When used with the fixture, the fixture payload shouldn't be executed wrapped again within pyfunc_call since it's already wrapped by the test itself.

for example, this would probably fail because of the warmup:

# This doesn't have any pytest marker but defines a bench since its using the fixture def test_bench(benchmark): # ... some preprocessing called_once = False @benchmark def _(): nonlocal called_once if not called_once: called_once = True else: raise Exception("bench codewas called twice but actual bench context only once")

Don't hesitate to add that as an additional test

I started to get really confused with the intention here since the above example also fails with master, see #30

I suspect we need to do something similar to what pytest-rerunfailures does to clear the cached results between the cache priming run and the instrumented run: https://github.com/pytest-dev/pytest-rerunfailures/blob/a53b9344c0d7a491a3cc53d91c7319696651d21b/src/pytest_rerunfailures.py#L565-L567

art049 · 2024-03-21T19:23:15Z

src/pytest_codspeed/plugin.py

+@pytest.hookimpl(hookwrapper=True)
+def pytest_pyfunc_call(pyfuncitem: pytest.Function) -> Iterator[None]:
+    plugin = get_plugin(pyfuncitem.config)
+    if (
+        plugin.is_codspeed_enabled
+        and should_benchmark_item(pyfuncitem)
+        and not has_benchmark_fixture(pyfuncitem)
+    ):
+        plugin.benchmark_count += 1
+        if plugin.lib is not None and plugin.should_measure:
+            pyfuncitem.obj = wrap_pyfunc_with_instrumentation(
+                plugin.lib,
+                pyfuncitem.nodeid,
+                pyfuncitem.config,
+                pyfuncitem.obj,
+            )
+    yield


Love this refactor!
However, when a benchmark is defined from a fixture, we should still perform the warmup at this level and not in the benchmark fixture(see the other comment I left on it).

art049 · 2024-03-21T19:24:07Z

src/pytest_codspeed/plugin.py

+            if SUPPORTS_PERF_TRAMPOLINE:
+                # Warmup CPython performance map cache
+                __codspeed_root_frame__()


I think the warmup shouldn't be included in the wrapper but handled at the protocol level

Ah, this also explains the issue we ran into on our own trying to use tmp_path in benchmark tests, e.g.:

@pytest.mark.benchmark def test_tmp_path_benchmark(tmp_path: Path): tmp_path.mkdir()

art049 · 2024-03-21T19:25:09Z

src/pytest_codspeed/plugin.py

+            lib.zero_stats()
+            lib.start_instrumentation()
+            try:
+                return __codspeed_root_frame__()
+            finally:
+                lib.stop_instrumentation()
+                uri = get_git_relative_uri(nodeid, config.rootpath)
+                lib.dump_stats_at(uri.encode("ascii"))


since you introduced this try block, can you add a test bench raising an exception?

kenodegard added 5 commits February 12, 2024 17:32

Update .gitignore

84b7619

Update .pre-commit-config.yaml

d12f6b7

Update ruff rules

3d3ec77

Add print & capsys tests

9c131c1

Use pytest_runtest_call

003bbe6

kenodegard marked this pull request as ready for review February 13, 2024 03:30

art049 requested changes Feb 13, 2024

View reviewed changes

kenodegard added 5 commits February 13, 2024 13:20

Add ruff target-version

887ad44

Convert benchmark tests into pytester tests

35ab172

Replace pytest_runtest_protocol with pytest_pyfunc_call

14446c1

Move type annotations to TYPE_CHECKING

59ea7ad

Fix test_perf_maps_generation

2b026dd

Skip print/capsys tests if no valgrind installed

7f320b9

kenodegard requested a review from art049 February 26, 2024 17:13

kenodegard mentioned this pull request Feb 27, 2024

Remove dev/linux, dev/macos, dev/windows in favor of streamlined GH workflow conda/conda#13162

Merged

3 tasks

Merge branch 'master' into fix-outerr-redirects

1b9a580

edgarrmondragon reviewed Mar 20, 2024

View reviewed changes

.pre-commit-config.yaml Outdated Show resolved Hide resolved

Update .pre-commit-config.yaml

d3d8405

Co-authored-by: Edgar Ramírez Mondragón <16805946+edgarrmondragon@users.noreply.github.com>

kenodegard mentioned this pull request Mar 20, 2024

Add TCH, FA, and UP lints #29

Merged

Merge remote-tracking branch 'upstream/master' into fix-outerr-redirects

26b5d09

art049 requested changes Mar 21, 2024

View reviewed changes

kenodegard mentioned this pull request Mar 26, 2024

Cache priming tests #30

Draft

kenodegard force-pushed the fix-outerr-redirects branch 2 times, most recently from f2595e8 to 26b5d09 Compare March 26, 2024 06:18

kenodegard mentioned this pull request Mar 26, 2024

[WIP] Alternative fix for stdout/stderr redirects #31

Closed

kenodegard mentioned this pull request Apr 1, 2024

Don't override pytest's default protocol #32

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `stdout`/`stderr` redirects #25

Fix `stdout`/`stderr` redirects #25

kenodegard commented Feb 13, 2024 •

edited

jezdez commented Feb 13, 2024

codspeed-hq bot commented Feb 13, 2024 •

edited

art049 left a comment

art049 Feb 13, 2024

kenodegard Feb 14, 2024

kenodegard commented Feb 14, 2024

kenodegard commented Feb 29, 2024

art049 commented Mar 20, 2024

art049 left a comment •

edited

art049 Mar 21, 2024

art049 Mar 21, 2024

kenodegard Mar 26, 2024 •

edited

art049 Mar 21, 2024

art049 Mar 21, 2024

kenodegard Mar 25, 2024

art049 Mar 21, 2024

Fix stdout/stderr redirects #25

Are you sure you want to change the base?

Fix stdout/stderr redirects #25

Conversation

kenodegard commented Feb 13, 2024 • edited

jezdez commented Feb 13, 2024

codspeed-hq bot commented Feb 13, 2024 • edited

Merging #25 will not alter performance

Summary

art049 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kenodegard commented Feb 14, 2024

kenodegard commented Feb 29, 2024

art049 commented Mar 20, 2024

art049 left a comment • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kenodegard Mar 26, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Fix `stdout`/`stderr` redirects #25

Fix `stdout`/`stderr` redirects #25

kenodegard commented Feb 13, 2024 •

edited

codspeed-hq bot commented Feb 13, 2024 •

edited

art049 left a comment •

edited

kenodegard Mar 26, 2024 •

edited