docs: note CUDA_VISIBLE_DEVICES workaround for multi-GPU systems by tvogels · Pull Request #75 · microsoft/skala

tvogels · 2026-05-26T12:57:37Z

Adds a short note to the GPU getting-started section: importing gpu4pyscf allocates memory on every visible CUDA device, which conflicts with PyTorch and with other processes sharing those GPUs (e.g. in MPI-parallel workloads).

Documents the CUDA_VISIBLE_DEVICES=0 workaround (and the MPI per-local-rank variant) and links to the upstream tracking issue pyscf/gpu4pyscf#435.

Importing gpu4pyscf allocates memory on every visible CUDA device, which conflicts with PyTorch and with other processes sharing those GPUs (e.g. in MPI-parallel workloads). Document the CUDA_VISIBLE_DEVICES workaround and link to the upstream tracking issue. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

tvogels marked this pull request as ready for review May 26, 2026 12:59

tvogels self-assigned this May 26, 2026

tvogels requested a review from awvwgk May 26, 2026 12:59

awvwgk approved these changes May 26, 2026

View reviewed changes

awvwgk merged commit c07796c into microsoft:main May 26, 2026
24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: note CUDA_VISIBLE_DEVICES workaround for multi-GPU systems#75

docs: note CUDA_VISIBLE_DEVICES workaround for multi-GPU systems#75
awvwgk merged 1 commit into
microsoft:mainfrom
tvogels:docs/multi-gpu-note

tvogels commented May 26, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tvogels commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tvogels commented May 26, 2026 •

edited

Loading