Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MemoryError cph.initialize_allocator(cph.PoolAllocation, 1000000000) #98

Closed
ducktrA opened this issue May 30, 2022 · 5 comments
Closed

Comments

@ducktrA
Copy link

ducktrA commented May 30, 2022

All python examples complain about cph.PoolAllocation. Has anyone else seen that? cuda toolkit 11.6
NVIDIA-SMI 510.60.02 Driver Version: 510.60.02 CUDA Version: 11.6
There is plenty of GPU mem available. The 1G fits in easily. I can reduce the size heavily and same issue

traceback (most recent call last):
File "basic/benchmarks3.py", line 5, in
cph.initialize_allocator(cph.PoolAllocation, 1000000000)
MemoryError: std::bad_alloc: RMM failure at:/home/adolf/cupoch/third_party/rmm/include/rmm/mr/device/pool_memory_resource.hpp:179: Maximum pool size exceeded

@neka-nat
Copy link
Owner

Hi,
Does RMM work when used by itself?
https://docs.rapids.ai/api/rmm/stable/basics.html

import rmm

pool = rmm.mr.PoolMemoryResource(
    rmm.mr.ManagedMemoryResource(),
    initial_pool_size=2**30,
    maximum_pool_size=2**32
)
rmm.mr.set_current_device_resource(pool)

@ducktrA
Copy link
Author

ducktrA commented May 31, 2022

RMM itself does work when using rapids-ai RMM module. To my surprise cupoch referenced rmm module did not compile as it complained about outdated make. Somehow this doesnt feel right.

@neka-nat
Copy link
Owner

neka-nat commented Jun 1, 2022

I have increased the version of RMM, so please try it with the latest master. (I don't use c++17, so it is not the latest RMM)

@neka-nat neka-nat closed this as completed Jun 7, 2022
@ducktrA
Copy link
Author

ducktrA commented Jun 20, 2022

i did check out and built it. still the same issue. btw, the upper rmm test hit a rmm module cupoch obviously did not compile against.

i remove my manually installed rmm (pip uninstall rmm)

Traceback (most recent call last):
File "examples/python/basic/benchmarks.py", line 5, in
cph.initialize_allocator(cph.PoolAllocation, 1000000000)
MemoryError: std::bad_alloc: RMM failure at:/home/adolf/cupoch/third_party/rmm/include/rmm/mr/device/pool_memory_resource.hpp:179: Maximum pool size exceeded

@ducktrA
Copy link
Author

ducktrA commented Jun 20, 2022

same on develop branch

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants