question: is there any method to avoid mapping same page? #258

hongbilu · 2023-05-16T08:01:25Z

hi, there
i saw there's test case named "basic_small_buffers_mapping". If cudamalloc many times(far more than twice), is there any method to check if this memory's page has been already mapped? if it has been mapped by others, maybe we should use va from matched handle, the map API should not return failure?

pakmarkthub · 2023-05-18T02:27:13Z

Hi @hongbilu,

We don't provide such API. And the agreement of the pin and map API is within the buffer you pin and map. It is unsafe to assume that you can use the same CPU VA range from mapping CUDA buffer A to access CUDA buffer B.

hongbilu · 2023-05-18T02:36:09Z

Hi @hongbilu,

We don't provide such API. And the agreement of the pin and map API is within the buffer you pin and map. It is unsafe to assume that you can use the same CPU VA range from mapping CUDA buffer A to access CUDA buffer B.

yes, but cudaMalloc cannot guarantee that memory address must be different page. In fact they might be same pretty much when allocating small data size which is a very common usage. The problem is that applications will take the management of all the cuda memory and check if cpu va is at same range with others, that is an additional work, too dirty and specific solution. what do you think?

pakmarkthub · 2023-05-18T03:23:20Z

Let's say that you have two CUDA buffers A and B from cudaMalloc. You will be able to pin both A and B, but you may not be able to map them. gdr_map requires the start address (does not have to be at the beginning of the buffer) to be GPU BAR1 page aligned. cudaMalloc does not guarantee the alignment. If you want to use GDRCopy to create CPU VA of your buffers, you must manually adjust the alignment (see https://github.com/NVIDIA/gdrcopy/blob/master/tests/common.cpp#L46). Generally, this results in you allocating each buffer with size larger than GPU BAR1 page. Thus, the buffers should not be that small anyway.

basic_small_buffers_mapping is a unit test to ensure that we can do gdr_pin of two small contiguous buffers. But even if you can pin, you cannot map as the second buffer is not GPU BAR1 page aligned. It is probably not what you are looking for.

hongbilu · 2023-05-18T04:05:30Z

thanks! so it need to allocate more buffers manually which means a not easily to use for clients

pakmarkthub · 2023-05-18T04:42:17Z

You may use CUDA VMM instead of cudaMalloc. VMM always guarantees that CUDA VA is page aligned.

hongbilu · 2023-05-18T04:45:58Z

very appreciate for remind! thanks

hongbilu closed this as completed May 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question: is there any method to avoid mapping same page? #258

question: is there any method to avoid mapping same page? #258

hongbilu commented May 16, 2023

pakmarkthub commented May 18, 2023

hongbilu commented May 18, 2023

pakmarkthub commented May 18, 2023

hongbilu commented May 18, 2023

pakmarkthub commented May 18, 2023

hongbilu commented May 18, 2023

question: is there any method to avoid mapping same page? #258

question: is there any method to avoid mapping same page? #258

Comments

hongbilu commented May 16, 2023

pakmarkthub commented May 18, 2023

hongbilu commented May 18, 2023

pakmarkthub commented May 18, 2023

hongbilu commented May 18, 2023

pakmarkthub commented May 18, 2023

hongbilu commented May 18, 2023