[experiment] ci: xdist increase number of processes #681

deependujha · 2025-08-08T06:40:09Z

Before submitting

Was this discussed/agreed via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

Fixes # (issue).

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in GitHub issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

codecov · 2025-08-08T06:59:46Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 84%. Comparing base (a543725) to head (17772f5).
⚠️ Report is 4 commits behind head on main.

Additional details and impacted files

@@         Coverage Diff         @@
##           main   #681   +/-   ##
===================================
  Coverage    84%    84%           
===================================
  Files        52     52           
  Lines      7021   7021           
===================================
+ Hits       5900   5902    +2     
+ Misses     1121   1119    -2

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

deependujha · 2025-08-08T07:35:06Z

Result with `-n $CORES`

action: https://github.com/Lightning-AI/litData/actions/runs/16823796244/job/47655723453?pr=681

ignoring failing tests on windows for now.

mac (3 cpu in CI)

completes 5-8 min early

OS & Python Version	main: `-n 2`	PR: `-n 3`
macos-14: `py: 3.09`	26m 02s	19m 19s
macos-14: `py: 3.10`	29m 33s	23m 46s
macos-14: `py: 3.11`	28m 51s	22m 24s

ubuntu (4 cpu in CI)

very similar, no significant difference

OS & Python Version	main: `-n 2`	PR: `-n 3`
ubuntu-22.04: `py: 3.09`	27m 50s	27m 53s
ubuntu-22.04: `py: 3.10`	26m 48s	26m 38s
ubuntu-22.04: `py: 3.11`	27m 16s	27m 19s
ubuntu-22.04: `py: 3.12`	31m 26s	33m 23s
ubuntu-22.04: `py: 3.13`	26m 34s	26m 53s

my bad, earlier I said, github ci has 2 cpus, and my logic was:

CORES=$(python -c "import os; print(max(1, os.cpu_count() // 2))")
echo "Using $CORES cores for pytest"

I forgot about /2 and said github ci has 2 cpus. :)

ubuntu & windows have 4 cpus (cores)
mac: 3 cpus

cc: @Borda

.github/workflows/ci-testing.yml

Borda · 2025-08-08T15:22:22Z

btw, seems to be a bit flaky on Windows:

>               raise AssertionError(f"Test left zombie thread: {thread}")
E               AssertionError: Test left zombie thread: <Timer(pytest_timeout tests/streaming/test_dataset.py::test_streaming_dataset_distributed_full_shuffle_even[zstd-True], started 1296)>

bhimrazy · 2025-08-10T18:03:26Z

Changing to n=3 also doesn’t seem to help much at the moment.

Next, I think we might need to look into the ignored tests for parallel execution and also improve performance at the test level, especially based on the latest top 100 test timing results. On Windows, some tests take over 200 seconds, and other runtimes have several tests above 60 seconds.

Borda · 2025-08-12T19:34:25Z

Changing to n=3 also doesn’t seem to help much at the moment.

so let's close this experiment

github ci xdist increase number of processes

0cf80ab

deependujha requested review from Borda, lantiga and tchaton as code owners August 8, 2025 06:40

deependujha changed the title ~~ci: xdist increase number of processes~~ [experiment] ci: xdist increase number of processes Aug 8, 2025

deependujha commented Aug 8, 2025

View reviewed changes

.github/workflows/ci-testing.yml Outdated Show resolved Hide resolved

Update .github/workflows/ci-testing.yml

b2cde61

Borda reviewed Aug 8, 2025

View reviewed changes

.github/workflows/ci-testing.yml Outdated Show resolved Hide resolved

Apply suggestions from code review

17772f5

Borda approved these changes Aug 8, 2025

View reviewed changes

Borda requested a review from bhimrazy August 8, 2025 11:37

Borda closed this Aug 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[experiment] ci: xdist increase number of processes #681

[experiment] ci: xdist increase number of processes #681

Uh oh!

deependujha commented Aug 8, 2025

Uh oh!

codecov bot commented Aug 8, 2025 •

edited

Loading

Uh oh!

deependujha commented Aug 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Borda commented Aug 8, 2025 •

edited

Loading

Uh oh!

bhimrazy commented Aug 10, 2025

Uh oh!

Borda commented Aug 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[experiment] ci: xdist increase number of processes #681

[experiment] ci: xdist increase number of processes #681

Uh oh!

Conversation

deependujha commented Aug 8, 2025

What does this PR do?

PR review

Did you have fun?

Uh oh!

codecov bot commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

deependujha commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Result with -n $CORES

Uh oh!

Uh oh!

Uh oh!

Borda commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bhimrazy commented Aug 10, 2025

Uh oh!

Borda commented Aug 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Aug 8, 2025 •

edited

Loading

deependujha commented Aug 8, 2025 •

edited

Loading

Result with `-n $CORES`

Borda commented Aug 8, 2025 •

edited

Loading