[REVIEW] Allow construction of cuda_async_memory_resource from existing pool #889

fkallen · 2021-10-11T20:01:00Z

Adds a new MR type cuda_async_view_memory_resource which has a constructor cuda_async_view_memory_resource(cudaMemPool_t valid_pool_handle) . The memory resource will use this pool for allocation and deallocation instead of managing its own pool.

Refactors cuda_async_memory_resource to have an instance of the above and create it with a cudaMemPool_t that it owns.

… pool

GPUtester · 2021-10-11T20:01:02Z

Can one of the admins verify this patch?

GPUtester · 2021-10-11T20:01:02Z

Can one of the admins verify this patch?

include/rmm/mr/device/cuda_async_memory_resource.hpp

jrhemstad · 2021-10-11T20:24:22Z

This makes me uncomfortable. I don't like having a type that is "sometimes owning, sometimes not owning".

I'd rather see a new resource for wrapping an existing cudaMemPool_t.

harrism

Mostly doc changes.

include/rmm/mr/device/cuda_async_memory_resource.hpp

harrism

Discussed this with @jrhemstad and we don't think we should have classes with unclear ownership of resources like this. Perhaps this class should always own its pool. So if you pass in an existing pool, this class should take ownership, and destroy the pool in its destructor. But we may still need a version of async_memory_resource that does not own its pool, and if so, then maybe that should be a different class. I'm interested in discussion on this point.

fkallen · 2021-10-18T13:28:03Z

I see, that's a good point. The owning resource could be the cuda_async_memory_resource from this PR with the is_owner_of_pool_ flag removed. It should take ownership of the pool which is used in the constructor. One has to check that the pool is not the default pool of the current device since it cannot be destroyed via cudaMemPoolDestroy.

Do you have a name in mind for the non-owning class? How about cuda_async_non_owning_memory_resource ?
The non-owning class can be used with the default memory pool.

jrhemstad · 2021-10-18T16:57:56Z

Do you have a name in mind for the non-owning class? How about cuda_async_non_owning_memory_resource ?

Hm, in some sense a resource that wraps an existing cudaMemPool_t is kind of like a resource adaptor.

The difference is the "upstream" isn't a device_memory_resource, but the CUDA pool represented by the cudaMemPool_t.

So we could call it cuda_pool_adaptor, but that could be misleading as all the other adaptor types adapt a device_memory_resource upstream.

cuda_pool_wrapper could be a good pick as its still descriptive and shouldn't be confused with other adaptor types.

Also, we should refactor the current cuda_async_memory_resource to be implemented in terms of the cuda_pool_wrapper type, e.g.,

class cuda_async_memory_resource final : public device_memory_resource {

 cuda_pool_wrapper pool_;

 public:

  cuda_async_memory_resource(thrust::optional<std::size_t> initial_pool_size = {},
                             thrust::optional<std::size_t> release_threshold = {})
  {
    ...
    RMM_CUDA_TRY(cudaMemPoolCreate(&cuda_pool_handle_, &pool_props));
    pool_ = cuda_pool_wrapper(cuda_pool_handle);
   ...
  }

  void* do_allocate(std::size_t bytes, rmm::cuda_stream_view stream) override
  {
#ifdef RMM_CUDA_MALLOC_ASYNC_SUPPORT
   return pool_.allocate(bytes, stream);
#else
    (void)bytes;
    (void)stream;
    return nullptr;
#endif
  }

harrism · 2021-10-19T00:56:31Z

Another approach could be to have an owning cuda_pool type that is simply a RAII wrapper for cudaMemPool_t, and a non-owning cuda_pool_view type that can be constructed either from a cuda_pool or a cudaMemPool_t. Then have cuda_async_memory_resource take a cuda_pool_view and never own the pool.

jrhemstad · 2021-10-19T03:31:48Z

Another approach could be to have an owning cuda_pool type that is simply a RAII wrapper for cudaMemPool_t, and a non-owning cuda_pool_view type that can be constructed either from a cuda_pool or a cudaMemPool_t. Then have cuda_async_memory_resource take a cuda_pool_view and never own the pool.

I don't think that's the direction we want to go because that introduces another object that a caller would have to keep alive outside of the usual device_memory_resource hierarchy.

fkallen · 2021-10-19T08:01:35Z

Implemented the idea of cuda_pool_wrapper.

Do you think cuda_async_mr should check if the pool which should be owned is a default pool (since it cannot be destroyed), or leave it to the user? If yes, the current check will not be sufficient. It could still be a pool of a different device with access enabled for the current device.

Should there be a requirement for the device location of the pool? Does it have to be the same device on which the memory resource is used?

jrhemstad · 2021-10-19T13:19:31Z

include/rmm/mr/device/cuda_async_memory_resource.hpp

+   * @param valid_pool_handle Handle to a CUDA memory pool which will be used to
+   * serve allocation requests. 
+   */
+  cuda_async_memory_resource(cudaMemPool_t valid_pool_handle)


Do we still need this ctor with the cuda_pool_wrapper?

harrism · 2021-10-19T23:20:00Z

This still feels clunky to me. I think it's mostly because of the name cuda_pool_wrapper. The name is vague. To me a pool wrapper sounds like it's not a memory_resource, but just some RAII wrapper for something to be owned. Especially confusing since we have another resource in RMM called an owning_wrapper. Memory_resources so far always have memory_resource or resource_adaptor in the name. cuda_pool_wrapper does exactly what cuda_async_memory_resource does except it doesn't own the pool. The only difference between the two resources is one is owning and the other is non-owning. So I think the naming should reflect that (e.g. shared vs. unique, or somehow make one a view, etc).

harrism · 2021-11-10T20:52:15Z

Due to open discussion, I'm moving this to the next release. @fkallen can you merge rapidsai:branch-22.02 into your branch in order to target the right branch?

…cuda-async-mr

fkallen · 2022-01-11T10:06:38Z

@harrism Sorry, I did not focus on this PR. Thanks for reminding me.
I agree with your opinions on the naming of the new class. I will change cuda_pool_wrapper to cuda_async_view_memory_resource .

I have more questions. At the moment I have added the constructor cuda_async_memory_resource(cudaMemPool_t) which takes ownership of the pool. Do we still need this if we have a dedicated view type? My original intention was to be able to use a raw cudaMemPool_t with rmm. Now, the view will be sufficient.

harrism · 2022-01-11T20:29:06Z

I agree, I think the view is sufficient. No need for the constructor that takes ownership.

…ownership. Update tests

harrism

Looks great! Just a few unnecessary blank lines.

tests/mr/device/cuda_async_mr_tests.cpp

tests/mr/device/cuda_async_view_mr_tests.cpp

Co-authored-by: Mark Harris <mharris@nvidia.com>

github-actions · 2022-02-12T10:00:43Z

This PR has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this PR if it is no longer required. Otherwise, please respond with a comment indicating any updates. This PR will be labeled inactive-90d if there is no activity in the next 60 days.

include/rmm/mr/device/cuda_async_memory_resource.hpp

harrism · 2022-02-15T22:04:30Z

ok to test

harrism · 2022-02-25T03:41:13Z

@fkallen can you fix the style failures? Helps to run clang-format locally (e.g. enable format-on-save).

fkallen · 2022-02-25T09:36:23Z

I hope this fixes the issues. I could not use AlignConsecutiveBitFields and AllowShortEnumsOnASingleLine with clang-format 10 on my machine.

harrism · 2022-03-21T20:30:37Z

@fkallen we've merged some major changes from @robertmaynard which now load the symbols for cudaMallocAsync and related functions using dlopen. See #990 . Unfortunately this means your PR may need some changes, or at least to resolve conflicts. We only have two days before 22.04 code freeze. If you have time to try to get this working before then it may still make it, but we need to move fast, otherwise we need to slip to the next release.

fkallen · 2022-03-22T21:44:50Z

I have merged your changes. In tests/mr/device/cuda_async_view_mr_tests.cpp I have disabled a test because cudaDeviceGetDefaultMemPool is not available using the new mechanism. This should not be an issue since the view is effectively tested when testing the cuda_async_memory_resource.

harrism · 2022-03-22T22:16:33Z

I have merged your changes. In tests/mr/device/cuda_async_view_mr_tests.cpp I have disabled a test because cudaDeviceGetDefaultMemPool is not available using the new mechanism. This should not be an issue since the view is effectively tested when testing the cuda_async_memory_resource.

Should probably add that API to the new mechanism and enable the test...

harrism · 2022-03-23T23:42:13Z

Thanks @fkallen !

harrism · 2022-03-23T23:42:26Z

@gpucibot merge

Allow construction of cuda_async_memory_resource from existing memory…

5aa77a7

… pool

fkallen requested a review from a team as a code owner October 11, 2021 20:01

fkallen requested a review from rongou October 11, 2021 20:01

fkallen requested a review from codereport October 11, 2021 20:01

github-actions bot added the cpp Pertains to C++ code label Oct 11, 2021

rongou reviewed Oct 11, 2021

View reviewed changes

include/rmm/mr/device/cuda_async_memory_resource.hpp Outdated Show resolved Hide resolved

include/rmm/mr/device/cuda_async_memory_resource.hpp Outdated Show resolved Hide resolved

include/rmm/mr/device/cuda_async_memory_resource.hpp Outdated Show resolved Hide resolved

harrism requested changes Oct 11, 2021

View reviewed changes

include/rmm/mr/device/cuda_async_memory_resource.hpp Outdated Show resolved Hide resolved

include/rmm/mr/device/cuda_async_memory_resource.hpp Outdated Show resolved Hide resolved

harrism added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Oct 11, 2021

harrism added this to PR-WIP in v21.12 Release via automation Oct 11, 2021

harrism requested changes Oct 11, 2021

View reviewed changes

v21.12 Release automation moved this from PR-WIP to PR-Needs review Oct 11, 2021

cuda_pool_wrapper

d42e54f

jrhemstad reviewed Oct 19, 2021

View reviewed changes

harrism mentioned this pull request Oct 20, 2021

Mark optional CUDA runtime functions as weak symbols #895

Closed

harrism removed this from PR-Needs review in v21.12 Release Nov 10, 2021

harrism added this to PR-WIP in v22.02 Release via automation Nov 10, 2021

harrism changed the base branch from branch-21.12 to branch-22.02 November 10, 2021 20:52

Merge remote-tracking branch 'upstream/branch-22.02' into non-owning-…

2a03700

…cuda-async-mr

fkallen force-pushed the non-owning-cuda-async-mr branch from dffae07 to 2a03700 Compare November 12, 2021 08:16

github-actions bot removed the inactive-30d label Jan 11, 2022

Name cuda_async_view_memory_resource. Remove constructor which takes …

2b9e02f

…ownership. Update tests

harrism approved these changes Jan 12, 2022

View reviewed changes

tests/mr/device/cuda_async_mr_tests.cpp Outdated Show resolved Hide resolved

tests/mr/device/cuda_async_view_mr_tests.cpp Outdated Show resolved Hide resolved

tests/mr/device/cuda_async_view_mr_tests.cpp Outdated Show resolved Hide resolved

v22.04 Release automation moved this from PR-WIP to PR-Reviewer approved Jan 12, 2022

Formatting

bb7135b

Co-authored-by: Mark Harris <mharris@nvidia.com>

github-actions bot added the inactive-30d label Feb 12, 2022

harrism changed the base branch from branch-22.02 to branch-22.04 February 15, 2022 02:26

jrhemstad reviewed Feb 15, 2022

View reviewed changes

include/rmm/mr/device/cuda_async_memory_resource.hpp Outdated Show resolved Hide resolved

jrhemstad approved these changes Feb 15, 2022

View reviewed changes

rongou approved these changes Feb 15, 2022

View reviewed changes

fkallen and others added 2 commits February 15, 2022 21:03

Avoid repeated nullptr check

b50b333

Merge branch 'rapidsai:branch-22.04' into non-owning-cuda-async-mr

7c4e166

github-actions bot removed the inactive-30d label Feb 15, 2022

Fix formatting

2df8cb8

Merge remote-tracking branch 'upstream/branch-22.04'

959f155

Add cudart api wrapper for cudaDeviceGetDefaultMemPool

c1a9603

rapids-bot bot merged commit 220ba88 into rapidsai:branch-22.04 Mar 23, 2022

v22.04 Release automation moved this from PR-Reviewer approved to Done Mar 23, 2022

fkallen deleted the non-owning-cuda-async-mr branch August 9, 2022 07:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REVIEW] Allow construction of cuda_async_memory_resource from existing pool #889

[REVIEW] Allow construction of cuda_async_memory_resource from existing pool #889

fkallen commented Oct 11, 2021 •

edited by harrism

Loading

GPUtester commented Oct 11, 2021

GPUtester commented Oct 11, 2021

jrhemstad commented Oct 11, 2021

harrism left a comment

harrism left a comment

fkallen commented Oct 18, 2021

jrhemstad commented Oct 18, 2021

harrism commented Oct 19, 2021

jrhemstad commented Oct 19, 2021

fkallen commented Oct 19, 2021 •

edited

Loading

jrhemstad Oct 19, 2021

harrism commented Oct 19, 2021 •

edited

Loading

harrism commented Nov 10, 2021

fkallen commented Jan 11, 2022 •

edited

Loading

harrism commented Jan 11, 2022

harrism left a comment

github-actions bot commented Feb 12, 2022

harrism commented Feb 15, 2022

harrism commented Feb 25, 2022

fkallen commented Feb 25, 2022

harrism commented Mar 21, 2022

fkallen commented Mar 22, 2022

harrism commented Mar 22, 2022

harrism commented Mar 23, 2022

harrism commented Mar 23, 2022

[REVIEW] Allow construction of cuda_async_memory_resource from existing pool #889

[REVIEW] Allow construction of cuda_async_memory_resource from existing pool #889

Conversation

fkallen commented Oct 11, 2021 • edited by harrism Loading

GPUtester commented Oct 11, 2021

GPUtester commented Oct 11, 2021

jrhemstad commented Oct 11, 2021

harrism left a comment

Choose a reason for hiding this comment

harrism left a comment

Choose a reason for hiding this comment

fkallen commented Oct 18, 2021

jrhemstad commented Oct 18, 2021

harrism commented Oct 19, 2021

jrhemstad commented Oct 19, 2021

fkallen commented Oct 19, 2021 • edited Loading

jrhemstad Oct 19, 2021

Choose a reason for hiding this comment

harrism commented Oct 19, 2021 • edited Loading

harrism commented Nov 10, 2021

fkallen commented Jan 11, 2022 • edited Loading

harrism commented Jan 11, 2022

harrism left a comment

Choose a reason for hiding this comment

github-actions bot commented Feb 12, 2022

harrism commented Feb 15, 2022

harrism commented Feb 25, 2022

fkallen commented Feb 25, 2022

harrism commented Mar 21, 2022

fkallen commented Mar 22, 2022

harrism commented Mar 22, 2022

harrism commented Mar 23, 2022

harrism commented Mar 23, 2022

fkallen commented Oct 11, 2021 •

edited by harrism

Loading

fkallen commented Oct 19, 2021 •

edited

Loading

harrism commented Oct 19, 2021 •

edited

Loading

fkallen commented Jan 11, 2022 •

edited

Loading