Skip to content

ci: avoid deadsnakes in GPU workflows#77

Closed
ivanbasov wants to merge 5 commits intomainfrom
codex/fix-gpu-ci-python-setup
Closed

ci: avoid deadsnakes in GPU workflows#77
ivanbasov wants to merge 5 commits intomainfrom
codex/fix-gpu-ci-python-setup

Conversation

@ivanbasov
Copy link
Copy Markdown
Collaborator

@ivanbasov ivanbasov commented May 1, 2026

Summary

  • remove the GPU workflows' dependency on ppa:deadsnakes/ppa
  • install Python 3.11/3.12/3.13 with actions/setup-python@v5 instead
  • keep apt setup limited to stable system packages (git, git-lfs, gcc)
  • add the missing PR skip gate for multi-gpu-tests

Context

PR #75 and the test-only PR #76 both failed before checkout/tests in CI / GPU.
The failing step was Install system dependencies, where add-apt-repository -y ppa:deadsnakes/ppa
timed out while contacting Launchpad:

TimeoutError: [Errno 110] Connection timed out

Because PR #76 only changes README text and reproduced the same setup failure, this looks
independent of PR #75's code changes.

After the setup fix, multi-gpu-tests also became reachable on the PR path and queued on
the 2-GPU runner. Its comment says it is main-only, so this PR adds the missing
github.ref == 'refs/heads/main' gate.

Validation

  • Parsed .github/workflows/ci-gpu.yml and .github/workflows/long-running-tests.yml with PyYAML
  • Ran git diff --check

actionlint is not installed in the local environment.

ivanbasov and others added 5 commits March 30, 2026 11:54
…fault

torch.compile=on combined with DataLoader spawn workers during LER
validation causes a segfault (20 leaked semaphores, core dumped).
Set PREDECODER_TORCH_COMPILE=0 for the Train all orientations step.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@ivanbasov
Copy link
Copy Markdown
Collaborator Author

Superseded by #78. I rebased the CI fix onto current main so the branch contains only the two workflow commits. Repository rules block force-pushing to this branch, so I opened a clean replacement PR instead.

@ivanbasov ivanbasov closed this May 1, 2026
@bmhowe23 bmhowe23 deleted the codex/fix-gpu-ci-python-setup branch May 5, 2026 17:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant