[FEA] Change semantics of RMM memory resource equality comparison #1402

harrism · 2023-12-06T08:43:36Z

Part of #1443

We plan to adopt cuda::mr::memory_resource concepts as the basis for RMM memory resources. This refactoring will bring flexibility both in building and composing memory resources (e.g. machinery for our stream-ordered pool_memory_resource can be reused for non-device memory as in #1392 ) and for applications by providing them the means to specify the properties of memory resources that their functions expect to be passed (via the cuda::mr::resource_ref property interface).

One effect of this is that cuda::mr::resource_ref equality comparison has different semantics from RMM's current device_memory_resource equality comparison, which mimics std::pmr::memory_resource. The semantics of this are defined here:

Two memory_resources compare equal if and only if memory allocated from one memory_resource can be deallocated from the other and vice versa.

In contrast, two cuda::mr::resource_refs compare equal only if they have the same equality function pointer, and calling that function on the two resource_refs returns true.

Arguably, the std::pmr semantics are more useful, because, for example, one might allocate memory with a logging_memory_resource<cuda_memory_resource> (A logging MR with its upstream MR being a CUDA MR), but later need to deallocate the memory with just a cuda_memory_resource, and these two will compare equal because the upstream of the former compares equal to the latter.

However, these semantics are difficult to support within the cuda::mr design. Because it is based on concepts, rather than inheritance, there is no base class (unlike std::pmr and rmm::mr::device_memory_resource). Without a base class, we can't dynamic_cast, which makes it hard to do the kind of compatibility comparison described above. RTTI-based solutions may be possible, but we don't necessarily want to require RTTI for RMM.

The concept-based approach does not preclude an inheritance hierarchy as we currently have in RMM. However, one of the advantages of moving to refactor to the concept approach is that libraries can provide pluggable memory interfaces based only on a (fairly lightweight) semi-standard (hopefully someday) like cuda::mr::resource_ref without requiring RMM and instead of defining their own plugin interface.

This was discussed during the design sessions for cuda::mr and no strong opinions were held on the issues above and the designers felt reasonably OK with diverging from std::pmr here.

I wanted to open this up for discussion. First, I want to find out if anyone relies on the "compatibility" equality comparison semantics currently built in RMM. My hunch is that few if any do. Please comment below on this question and then we can move on to discussing options.

The text was updated successfully, but these errors were encountered:

bdice · 2023-12-06T23:48:58Z

I'm not aware of any code in libcudf (or in RAPIDS generally) that depends on particular semantics of memory resource equality. I know of no issues from changing semantics as described above. Absence of evidence != evidence of absence, of course, but I hope this is helpful.

harrism · 2023-12-07T01:12:57Z

Thanks @bdice . I also did some searching of the main RAPIDS repos.

I see various comparisons of device_memory_resource* to nullptr, which is not using the operator==, so that's not an issue.
I see RAFT explicitly calling is_equal which does exercise the code path, but the code is planned to be removed according to @cjnolet .
I see RAFT keeping a std::vector of std::shared_ptr<device_memory_resource>, and also places in RMM's gtests that have vectors of resources, but this should not use the operator==. I would be worried if there were uses of std::set or std::map.

harrism · 2024-01-23T06:15:06Z

The usage in RAFT has been removed.

harrism · 2024-02-20T09:35:15Z

@miscco @jrhemstad I think having a concept for get_upstream_resource would also help the usability of equality comparison, because users could call get_upstream_resource if it exists and do comparison of those resources. This way a user could check if the upstream of a logging_mr is the same as some other MR, and can therefore be used to free memory allocated with the logging_mr. Thoughts?

harrism · 2024-05-07T02:23:48Z

This may be unnecessary as @miscco and I were discussing. @miscco mentioned that it would be useful for cuda::mr to have a resource_adaptor concept which can be queried and used to implement equality semantics more similar to std::pmr::memory_resource and rmm::mr::device_memory_resource implementations.

harrism added bug Something isn't working question Further information is requested labels Dec 6, 2023

harrism self-assigned this Dec 6, 2023

harrism mentioned this issue Dec 10, 2023

Fix ann-bench multithreading rapidsai/raft#2021

Merged

harrism mentioned this issue Jan 18, 2024

Add a host-pinned memory resource that can be used as upstream for pool_memory_resource. #1392

Merged

3 tasks

This was referenced Jan 30, 2024

Update RAPIDS to use cuda::mr::async_resource_ref rapidsai/build-planning#16

Open

[FEA] Refactor RMM in terms of cuda::mr::memory_resource #1443

Open

harrism added feature request New feature or request and removed bug Something isn't working question Further information is requested labels Jan 30, 2024

harrism removed their assignment Jan 30, 2024

harrism changed the title ~~[DISCUSSION] Change semantics of RMM memory resource equality comparison~~ [FEA] Change semantics of RMM memory resource equality comparison Jan 30, 2024

harrism mentioned this issue Mar 6, 2024

Replace all internal usage of get_upstream with get_upstream_resource #1491

Merged

harrism self-assigned this Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Change semantics of RMM memory resource equality comparison #1402

[FEA] Change semantics of RMM memory resource equality comparison #1402

harrism commented Dec 6, 2023 •

edited

bdice commented Dec 6, 2023

harrism commented Dec 7, 2023 •

edited

harrism commented Jan 23, 2024

harrism commented Feb 20, 2024

harrism commented May 7, 2024

[FEA] Change semantics of RMM memory resource equality comparison #1402

[FEA] Change semantics of RMM memory resource equality comparison #1402

Comments

harrism commented Dec 6, 2023 • edited

bdice commented Dec 6, 2023

harrism commented Dec 7, 2023 • edited

harrism commented Jan 23, 2024

harrism commented Feb 20, 2024

harrism commented May 7, 2024

harrism commented Dec 6, 2023 •

edited

harrism commented Dec 7, 2023 •

edited