[FEA]: Add a `MemoryResource` which uses CUDA VMM APIs to allocate memory #968

benhg · 2025-09-12T22:24:38Z

Description

This is an implementation for #967.

Summary:

Implement VMM grow-in-place via:
- Fast path: contiguous VA extension with fixedAddr + cuMemMap
- Slow path: remap to new VA, preserve data without memcpy
Use aligned sizes everywhere and cast pointers for arithmetic
Return proper enums for CUmemAccessDesc.flags
Default all VMMConfig fields to common values

References:

NVIDIA VMM blog
CUDA Driver VMM docs

Checklist

New or existing tests cover these changes.
The documentation is up to date with these changes.

…m-allocator

copy-pr-bot · 2025-09-12T22:24:41Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

…orting driver enums

leofang · 2025-09-17T23:20:04Z

@benhg the IPC PR was merged, could you resolve conflicts plz? 🙂

…m-allocator

benhg · 2025-09-18T18:00:56Z

@benhg the IPC PR was merged, could you resolve conflicts plz? 🙂

Done, thanks for the reminder.

leofang · 2025-09-18T18:51:19Z

pre-commit.ci autofix

copy-pr-bot · 2025-09-18T18:53:23Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

cuda_core/cuda/core/experimental/__init__.py

cuda_core/cuda/core/experimental/_memory.pyx

Co-authored-by: Keith Kraus <keith.j.kraus@gmail.com>

kkraus14

I agree with @Andy-Jost's comment that it would be nice to parametrize some existing tests with using the VirtualMemoryResource, but I'm okay doing that in a follow up

leofang

Thanks, Ben! Sorry I was not as responsive as I'd like for reviews. Keith told me this is ready to go. We have a number of PRs that have conflicts with each other, so what I'd do is to merge all P0 tasks for the release, and then merge this the last, to minimize conflicts. By the time we get to merge this PR, I will resolve all conflicts and rerun tests. No action is needed.

Marking this PR red only to avoid unplanned merge, again no action is needed. Much appreciated, Ben!

leofang · 2025-10-07T21:45:47Z

I am working on resolving the merge conflicts logically.

leofang · 2025-10-07T22:29:27Z

/ok to test 6450712

github-actions · 2025-10-07T22:42:14Z

Doc Preview CI
🚀 View preview at https://nvidia.github.io/cuda-python/pr-preview/pr-968/
https://nvidia.github.io/cuda-python/pr-preview/pr-968/cuda-core/
https://nvidia.github.io/cuda-python/pr-preview/pr-968/cuda-bindings/
https://nvidia.github.io/cuda-python/pr-preview/pr-968/cuda-pathfinder/
Preview will be ready when the GitHub Pages deployment is complete.

…n into benjaming/vmm-allocator

benhg · 2025-10-08T00:11:50Z

/ok to test 4af54ac

leofang · 2025-10-08T00:28:37Z

Vetter: a user that has permissions to leave an /ok to test comment on a pull request. This is a user with write access (or greater) for a particular repository or a member that is listed in the additional_vetters configuration list (see note below)

I always forget that not all NV employees can launch the CI... 😞

leofang · 2025-10-08T00:28:47Z

/ok to test 4af54ac

leofang · 2025-10-08T00:57:37Z

Ben, Piotr and I discussed with Vishnu from the memory team, and confirmed that VMM is not supported by the TCC mode. I suggested Ben to skip the tests on Windows for now.. perhaps also raise a warning (or exception?) on VirtualMemoryResource constructor. We can revisit this once we turn on WDDM (#462) and MCDM CI (#984).

leofang · 2025-10-08T01:44:47Z

/ok to test 9db04b1

leofang · 2025-10-08T02:25:26Z

Thanks Ben for your first contribution to cuda.core and Keith/Andy for review 🎉

benhg added 5 commits September 10, 2025 13:22

commit initial draft

35d7dd5

add modification/growing option

1de97e2

add tests

e7fd8d0

Add tests and make them pass

c941700

Merge branch 'main' of github.com:benhg/cuda-python into benjaming/vm…

4ddac45

…m-allocator

benhg added 2 commits September 12, 2025 15:32

Fix format with pre-commit hooks

bb5de7f

Fix format with pre-commit hooks

4517ca8

leofang assigned benhg Sep 15, 2025

leofang self-requested a review September 15, 2025 14:35

leofang added P1 Medium priority - Should do feature New feature or request cuda.core Everything related to the cuda.core module labels Sep 15, 2025

leofang added this to the cuda.core beta 7 milestone Sep 15, 2025

Expose enumertor options through VMMAllocationOptions rather than exp…

aa4f8df

…orting driver enums

This was referenced Sep 15, 2025

Add VMMAllocatedMemoryResource for Virtual Memory Management APIs #972

Closed

Mempool memory resource - IPC #446

Draft

leofang requested a review from Andy-Jost September 17, 2025 23:20

Merge branch 'main' of github.com:benhg/cuda-python into benjaming/vm…

5cfd99a

…m-allocator

benhg added 2 commits September 18, 2025 11:02

fix merge conflict

b1d99e5

fix pre-commit issues

a9f4191

[pre-commit.ci] auto code formatting

d1b3379

leofang reviewed Sep 18, 2025

View reviewed changes

cuda_core/cuda/core/experimental/__init__.py Outdated Show resolved Hide resolved

leofang reviewed Sep 18, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_memory.pyx Outdated Show resolved Hide resolved

leofang reviewed Sep 18, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_memory.pyx Outdated Show resolved Hide resolved

leofang reviewed Sep 18, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_memory.pyx Outdated Show resolved Hide resolved

benhg and others added 2 commits October 2, 2025 09:42

Update cuda_core/cuda/core/experimental/_memory.pyx

e90b9b0

Co-authored-by: Keith Kraus <keith.j.kraus@gmail.com>

Handle missing error check and address review comments

ae8263c

kkraus14 previously approved these changes Oct 2, 2025

View reviewed changes

leofang requested changes Oct 3, 2025

View reviewed changes

Merge branch 'main' into benjaming/vmm-allocator

4595298

leofang dismissed kkraus14’s stale review via 4595298 October 7, 2025 21:58

leofang added 2 commits October 7, 2025 22:24

nit: hide non-public dataclass members

fea55e0

add basic docs

6450712

leofang previously approved these changes Oct 7, 2025

View reviewed changes

benhg added 2 commits October 7, 2025 17:06

Merge branch 'benjaming/vmm-allocator' of github.com:benhg/cuda-pytho…

cd90278

…n into benjaming/vmm-allocator

add windows support

4af54ac

benhg dismissed leofang’s stale review via 4af54ac October 8, 2025 00:11

kkraus14 previously approved these changes Oct 8, 2025

View reviewed changes

remove windows tests

9db04b1

benhg dismissed kkraus14’s stale review via 9db04b1 October 8, 2025 01:38

leofang approved these changes Oct 8, 2025

View reviewed changes

leofang enabled auto-merge (squash) October 8, 2025 01:45

leofang merged commit 6efc348 into NVIDIA:main Oct 8, 2025
71 checks passed

leofang linked an issue Oct 8, 2025 that may be closed by this pull request

[FEA]: Add a MemoryResource which uses CUDA VMM APIs to allocate memory #967

Closed

1 task

This was referenced Oct 8, 2025

[FEA]: Add a MemoryResource which uses CUDA VMM APIs to allocate memory #967

Closed

Skipping VirtualMemoryResource tests on WSL #1127

Merged

[FEA]: Add a MemoryResource which uses CUDA VMM APIs to allocate memory #968

[FEA]: Add a MemoryResource which uses CUDA VMM APIs to allocate memory #968

Uh oh!

Conversation

benhg commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

copy-pr-bot bot commented Sep 12, 2025

Uh oh!

leofang commented Sep 17, 2025

Uh oh!

benhg commented Sep 18, 2025

Uh oh!

leofang commented Sep 18, 2025

Uh oh!

copy-pr-bot bot commented Sep 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kkraus14 left a comment

Choose a reason for hiding this comment

Uh oh!

leofang left a comment

Choose a reason for hiding this comment

Uh oh!

leofang commented Oct 7, 2025

Uh oh!

leofang commented Oct 7, 2025

Uh oh!

github-actions bot commented Oct 7, 2025

Preview will be ready when the GitHub Pages deployment is complete.

Uh oh!

benhg commented Oct 8, 2025

Uh oh!

leofang commented Oct 8, 2025

Uh oh!

leofang commented Oct 8, 2025

Uh oh!

leofang commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leofang commented Oct 8, 2025

Uh oh!

Uh oh!

leofang commented Oct 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[FEA]: Add a `MemoryResource` which uses CUDA VMM APIs to allocate memory #968

[FEA]: Add a `MemoryResource` which uses CUDA VMM APIs to allocate memory #968

benhg commented Sep 12, 2025 •

edited

Loading

leofang commented Oct 8, 2025 •

edited

Loading