Skip to content

docs: note CUDA_VISIBLE_DEVICES workaround for multi-GPU systems#75

Merged
awvwgk merged 1 commit into
microsoft:mainfrom
tvogels:docs/multi-gpu-note
May 26, 2026
Merged

docs: note CUDA_VISIBLE_DEVICES workaround for multi-GPU systems#75
awvwgk merged 1 commit into
microsoft:mainfrom
tvogels:docs/multi-gpu-note

Conversation

@tvogels
Copy link
Copy Markdown
Collaborator

@tvogels tvogels commented May 26, 2026

Adds a short note to the GPU getting-started section: importing gpu4pyscf allocates memory on every visible CUDA device, which conflicts with PyTorch and with other processes sharing those GPUs (e.g. in MPI-parallel workloads).

Documents the CUDA_VISIBLE_DEVICES=0 workaround (and the MPI per-local-rank variant) and links to the upstream tracking issue pyscf/gpu4pyscf#435.

Importing gpu4pyscf allocates memory on every visible CUDA device, which
conflicts with PyTorch and with other processes sharing those GPUs (e.g.
in MPI-parallel workloads). Document the CUDA_VISIBLE_DEVICES workaround
and link to the upstream tracking issue.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@tvogels tvogels marked this pull request as ready for review May 26, 2026 12:59
@tvogels tvogels self-assigned this May 26, 2026
@tvogels tvogels requested a review from awvwgk May 26, 2026 12:59
@awvwgk awvwgk merged commit c07796c into microsoft:main May 26, 2026
24 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants