limit the amount of memory a process can allocate on a single CUDA device #49

DiTo97 · 2023-07-08T12:15:34Z

Hi all,

As the title suggests, is there a way to limit the total amount of memory that a process can allocate on a single CUDA device?

Perhaps, even by using pyNVML?

This issue is related to the following discussions:

What are the cons of sharing the resources of a single CUDA device among different processes competing for access?

leofang · 2024-06-15T02:48:53Z

Sorry for late response. AFAIK there is no generic sw tool for you to limit the amount of GPU memory allocatable per process. The closest thing is MPS or MIG (link) for partitioning a GPU in some ways. Using the stream-ordered memory allocator is another possibility, but all sw frameworks and libraries that you use must honor the driver mempool. I assume none of these is what you are asking for.

The linked material also discussed the consequence of oversubscribing a single device by multiple processes. Performance drop is an obvious possibility. Depending on the workload you might also experience deadlocks.

DiTo97 closed this as completed Jun 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

limit the amount of memory a process can allocate on a single CUDA device #49

limit the amount of memory a process can allocate on a single CUDA device #49

DiTo97 commented Jul 8, 2023 •

edited

Loading

leofang commented Jun 15, 2024

limit the amount of memory a process can allocate on a single CUDA device #49

limit the amount of memory a process can allocate on a single CUDA device #49

Comments

DiTo97 commented Jul 8, 2023 • edited Loading

leofang commented Jun 15, 2024

DiTo97 commented Jul 8, 2023 •

edited

Loading