pytest test sharding #493

vinnybod · 2025-01-21T03:20:47Z

Adds test sharding support to the pytest runner.

I took some inspiration from https://github.com/caseyduquettesc/rules_python_pytest . In particular, I used it as a reference for which Bazel environment variables need to be pulled in the shim.

The pytest plugin is a derivative work of https://github.com/AdamGleave/pytest-shard . That dependency hasn't been updated in a long time, and the sharding was based off hashing which often results in unbalanced shards or errors from empty shards. I vendored the code into the repo, modified it to use a plain round robin strategy for selection, and converted the code to be class based so it can be used with pytest.main().

Test plan

I added an example to the examples directory and I tested it with my own repository here: https://github.com/vinnybod/bazel-examples/blob/python-test-sharding/test-shard-python/BUILD.bazel

py/private/pytest_shard/pytest_shard.py

alexeagle · 2025-01-21T21:53:12Z

I haven't had time to study this yet, note I had #401 which may or may not overlap.

vinnybod · 2025-01-21T22:27:55Z

I did see #401 , but there hadn't been activity on it in a while. I think this could still be merged in the meantime.

If you do want to commit it with #401 instead, I think it would still be worth while to vendor in the pytest-shard plugin with the round-robin patch, always loading it, since its a better UX for developers.

alexeagle · 2025-01-23T03:38:36Z

round-robin is not the best strategy, because it tends to mean a shard has to do many different setup routines.

Take as example (invented from what I remember of JUnit, but I think it applies)

file 1
  test class A
    setup A
    test case i
    test case ii
  test class B
    setup B
    test case iii
    test case iv
file 2
  test class C
    setup C
    test case v
    test case vi

If this is meant to have --shard_count=2, you'd rather split the test cases [i, ii, iii] , [iv, v, vi] so that the first shard can skip setup C and the second shard can skip setup A.

IIUC, round robin means you'll have both shards having to run all three setups.

We did test sharding in Jasmine, https://github.com/aspect-build/rules_jasmine/blob/main/jasmine/private/runner.cjs#L32-L53 where we sort and then partition them

py/private/py_pytest_main.bzl

py/private/pytest_shard/LICENSE

alexeagle

Nice, thanks for bringing this in

py/private/pytest_shard/pytest_shard.py

vinnybod · 2025-01-23T04:13:37Z

@alexeagle I think that is less of a problem when you only have one test file per py_test target. But I suppose if we want to make this more optimal for use cases where developers are globbing a bunch of *_test.py into a single target, the change could help.

If I'm understanding your explanation correctly, instead of a round robin, we should just take the full list [i, ii, iii, iv, v, vi] and partition it by the number of shards, keeping the default ordering (n=3, [i, ii], [iii, iv], [v, vi]). Does that sound right?

I did look at rules_jvm's implementation and they use hashes, so they don't take into account any ordering.

alexeagle · 2025-01-23T07:01:00Z

Yes that's exactly right. Obviously this makes a difference for tests with heavy fixtures, with a larger number of test cases and a larger shard count the more we can "group by fixture" the less duplicate work is performed. I think rules_jvm probably has this wrong too, but I'm pretty sure the google3 implementation gets it right.

fzakaria · 2025-01-23T18:30:40Z

my 2c:
The discussion about sharding strategy seems like an implementation detail that can be further refined;
The addition to allow sharding even round robin might still be a benefit to some -- the strategy can continously be refined once the support is enabled.

py/private/pytest_shard/pytest_shard.py

py/private/pytest.py.tmpl

alexeagle · 2025-01-24T17:17:16Z

The discussion about sharding strategy seems like an implementation detail that can be further refined

Sure, it's always possible to leave a TODO - but you have to appreciate that typically on OSS projects you have a contributor who's on the hook to land the change, and later is likely to disappear. As the maintainer you then end up with that tech debt on your own head. So we're usually a bit defensive about making it right at the beginning. I could trust you guys to come back to this optimization later though.

vinnybod · 2025-01-24T17:41:50Z

Looks like a new change in https://github.com/aspect-build/rules_py/tree/main/examples/virtual_deps broke these changes. Investigating.

aspect-workflows · 2025-01-24T17:46:06Z

Test

6 test targets passed

Targets

//examples/multi_version:py_version_default_test [k8-fastbuild]                 1s
//examples/multi_version:py_version_test [k8-fastbuild-ST-494921797612]         1s
//examples/pytest:nested/pytest [k8-fastbuild]                                  2s
//examples/pytest:pytest_test [k8-fastbuild]                                    1s
//examples/pytest:sharding_test [k8-fastbuild]                                  2s
//examples/virtual_deps:pytest_test [k8-fastbuild]                              1s

Total test execution time was 9s. 23 tests (79.3%) were fully cached saving 41s.

vinnybod added 4 commits January 20, 2025 15:30

Initial implementation of test sharding for the pytest runner

e3d2f1f

change reference

b9ce906

change imports

7194656

apply ruff formatting

db4eeb5

vinnybod changed the title ~~Test sharding~~ pytest test sharding Jan 21, 2025

fzakaria reviewed Jan 21, 2025

View reviewed changes

py/private/pytest_shard/pytest_shard.py Show resolved Hide resolved

alexeagle reviewed Jan 23, 2025

View reviewed changes

py/private/py_pytest_main.bzl Outdated Show resolved Hide resolved

alexeagle reviewed Jan 23, 2025

View reviewed changes

py/private/pytest_shard/LICENSE Show resolved Hide resolved

alexeagle reviewed Jan 23, 2025

View reviewed changes

py/private/pytest_shard/pytest_shard.py Show resolved Hide resolved

alexeagle reviewed Jan 24, 2025

View reviewed changes

py/private/pytest_shard/pytest_shard.py Show resolved Hide resolved

alexeagle reviewed Jan 24, 2025

View reviewed changes

py/private/pytest.py.tmpl Outdated Show resolved Hide resolved

vinnybod added 2 commits January 24, 2025 10:11

code review

77ce3d1

also check total shards > 1

5a17cb0

vinnybod added 2 commits January 24, 2025 10:20

format README

b61b9e5

Merge branch 'main' into test-sharding

1787c2c

add :__test__ as a dep in virtual_deps example

4e716cd

vinnybod requested a review from alexeagle January 24, 2025 17:53

alexeagle approved these changes Jan 28, 2025

View reviewed changes

alexeagle merged commit a23ffaa into aspect-build:main Jan 28, 2025
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

pytest test sharding #493

pytest test sharding #493

Uh oh!

vinnybod commented Jan 21, 2025

Uh oh!

Uh oh!

alexeagle commented Jan 21, 2025

Uh oh!

vinnybod commented Jan 21, 2025

Uh oh!

alexeagle commented Jan 23, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

alexeagle left a comment

Uh oh!

Uh oh!

vinnybod commented Jan 23, 2025

Uh oh!

alexeagle commented Jan 23, 2025

Uh oh!

fzakaria commented Jan 23, 2025

Uh oh!

Uh oh!

Uh oh!

alexeagle commented Jan 24, 2025

Uh oh!

vinnybod commented Jan 24, 2025

Uh oh!

aspect-workflows bot commented Jan 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pytest test sharding #493

pytest test sharding #493

Uh oh!

Conversation

vinnybod commented Jan 21, 2025

Test plan

Uh oh!

Uh oh!

alexeagle commented Jan 21, 2025

Uh oh!

vinnybod commented Jan 21, 2025

Uh oh!

alexeagle commented Jan 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexeagle left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vinnybod commented Jan 23, 2025

Uh oh!

alexeagle commented Jan 23, 2025

Uh oh!

fzakaria commented Jan 23, 2025

Uh oh!

Uh oh!

Uh oh!

alexeagle commented Jan 24, 2025

Uh oh!

vinnybod commented Jan 24, 2025

Uh oh!

aspect-workflows bot commented Jan 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test

Uh oh!

Uh oh!

Uh oh!

alexeagle commented Jan 23, 2025 •

edited

Loading

aspect-workflows bot commented Jan 24, 2025 •

edited

Loading