refactor template order `allocMappedBuf` #2270

SimeonEhrig · 2024-05-16T13:01:09Z

Move template TPlatform as the last template. There is no need to provide the platform template signature if we pass the platform as an instance.

follow up of #2162

psychocoderHPC · 2024-05-16T13:26:45Z

please update

alpaka/docs/source/dev/backends.rst

Line 386 in 8dbe7c4

    
               | cudaMallocHost             | alpaka::allocMappedBuf<TPlatform, TElement>(host, extents) 1D, 2D, 3D supported!           |

too.
It was also forgotten during the change from platform type to platform instance.

Maybe we should use here allocMappedBufIfSupported()

psychocoderHPC · 2024-05-16T13:28:24Z

please update

alpaka/docs/source/dev/backends.rst

Line 366 in 8dbe7c4

    
               | cudaHostAlloc              | alpaka::allocMappedBuf<TPlatform, TElement>(host, extents) 1D, 2D, 3D supported!           |

too

alpaka::allocMappedBuf<TElement>(host, platform, extents) 1D, 2D, 3D supported!

SimeonEhrig · 2024-05-16T14:03:52Z

please update

alpaka/docs/source/dev/backends.rst

Line 366 in 8dbe7c4

| cudaHostAlloc | alpaka::allocMappedBuf<TPlatform, TElement>(host, extents) 1D, 2D, 3D supported! |

too
alpaka::allocMappedBuf<TElement>(host, platform, extents) 1D, 2D, 3D supported!  

I alpaka::allocMappedBuf<TElement, TIdx>(host, platform, extents) 1D, 2D, 3D supported! should be correct.

psychocoderHPC · 2024-05-16T14:29:37Z

docs/source/dev/backends.rst

@@ -363,7 +363,7 @@ The following tables list the functions available in the `CUDA Runtime API <http
    +----------------------------+--------------------------------------------------------------------------------------------+
    | cudaGetSymbolSize          | --                                                                                         |
    +----------------------------+--------------------------------------------------------------------------------------------+
-    | cudaHostAlloc              | alpaka::allocMappedBuf<TPlatform, TElement>(host, extents) 1D, 2D, 3D supported!           |
+    | cudaHostAlloc              | alpaka::allocMappedBuf<TElement, TIdx>(host, extents) 1D, 2D, 3D supported!                |


Sry my comment was wrong this should be allocMappedBufIfSupported. There is no strong need that this memory is mapped.
allocMappedBuf is not available for any accelerator or device.

fwyzard · 2024-05-16T14:55:37Z

docs/source/dev/backends.rst

@@ -363,7 +363,7 @@ The following tables list the functions available in the `CUDA Runtime API <http
    +----------------------------+--------------------------------------------------------------------------------------------+
    | cudaGetSymbolSize          | --                                                                                         |
    +----------------------------+--------------------------------------------------------------------------------------------+
-    | cudaHostAlloc              | alpaka::allocMappedBuf<TPlatform, TElement>(host, extents) 1D, 2D, 3D supported!           |
+    | cudaHostAlloc              | alpaka::allocMappedBufIfSupported<TElement, TIdx>(host, extents) 1D, 2D, 3D supported!     |


This too should be just

Suggested change

| cudaHostAlloc | alpaka::allocMappedBufIfSupported<TElement, TIdx>(host, extents) 1D, 2D, 3D supported! |

| cudaHostAlloc | alpaka::allocMappedBuf<TElement, TIdx>(host, extents) 1D, 2D, 3D supported! |

I do not think that allocMappedBuf is correct here. allocMappedBuf can fail if mapped memory is not supported for the current device, so the code will not run with all devices. allocMappedBufIfSupported will be safe for all kinds of devices.

The CUDA API function cudaHostAlloc is pinning memory but does not guarantee that the memory is visible on the GPU device, so the best fitting alpaka equivalent is allocMappedBufIfSupported

The CUDA API function cudaHostAlloc is pinning memory but does not guarantee that the memory is visible on the GPU device, so the best fitting alpaka equivalent is allocMappedBufIfSupported

It does guarantee that. From the CUDA Runtime API:

__host__ cudaError_t cudaHostAlloc ( void** pHost, size_t size, unsigned int flags )

Description

Allocates size bytes of host memory that is page-locked and accessible to the device.

In any case, cudaMallocHost() and cudaHostAlloc() are identical, so they should map to the same functionality in alpaka.

Ohh I was not aware that cudaMallocHost is the C++-API equivalent of cudaHostAlloc. I used always cudaHostAlloc with flags to create mapped memory.

In this case, IMO both should be equal. The problem I had is that for both CUDA methods, you can configure the behavior via a flag where we have only named functions. Maybe we should add allocMappedBufIfSupported, allocMappedBuf, and allocMappedBuf into the documentation and the user must decide what fitts best for their problem because the method used for the replacement depends on the flags used in the native CUDA function call.

I changed it back to alpaka::allocMappedBuf<TPlatform, TElement>(host, extents) and wrote a for foot note for alpaka::allocMappedBufIfSupported<TPlatform, TElement>(host, extents)

fwyzard · 2024-05-27T15:07:43Z

docs/source/dev/backends.rst

@@ -363,7 +363,7 @@ The following tables list the functions available in the `CUDA Runtime API <http
    +----------------------------+--------------------------------------------------------------------------------------------+
    | cudaGetSymbolSize          | --                                                                                         |
    +----------------------------+--------------------------------------------------------------------------------------------+
-    | cudaHostAlloc              | alpaka::allocMappedBuf<TPlatform, TElement>(host, extents) 1D, 2D, 3D supported!           |
+    | cudaHostAlloc              | alpaka::allocMappedBuf<TElement, TIdx>(host, extents) 1D, 2D, 3D supported! [1]            |


Suggested change

| cudaHostAlloc | alpaka::allocMappedBuf<TElement, TIdx>(host, extents) 1D, 2D, 3D supported! [1] |

| cudaHostAlloc | alpaka::allocMappedBuf<TElement, TIdx>(host, platform, extents) 1D, 2D, 3D supported! [1] |

Move template TPlatform as the last template. There is no need to provide the platform template signature if we pass the platform as an instance.

SimeonEhrig added Type:Enhancement Type:Refactoring labels May 16, 2024

SimeonEhrig added this to the 1.2.0 milestone May 16, 2024

psychocoderHPC previously approved these changes May 16, 2024

View reviewed changes

SimeonEhrig dismissed psychocoderHPC’s stale review via bc8241b May 16, 2024 14:10

SimeonEhrig force-pushed the RefactorAllocMappedBuf branch from f2defff to bc8241b Compare May 16, 2024 14:10

psychocoderHPC reviewed May 16, 2024

View reviewed changes

SimeonEhrig force-pushed the RefactorAllocMappedBuf branch from bc8241b to bc82094 Compare May 16, 2024 14:34

fwyzard reviewed May 16, 2024

View reviewed changes

SimeonEhrig force-pushed the RefactorAllocMappedBuf branch from bc82094 to 84e6a12 Compare May 27, 2024 12:48

fwyzard reviewed May 27, 2024

View reviewed changes

SimeonEhrig force-pushed the RefactorAllocMappedBuf branch from 84e6a12 to 7173395 Compare May 27, 2024 15:13

refactor template order allocMappedBuf

9ff7917

Move template TPlatform as the last template. There is no need to provide the platform template signature if we pass the platform as an instance.

SimeonEhrig force-pushed the RefactorAllocMappedBuf branch from 7173395 to 9ff7917 Compare May 27, 2024 15:16

fwyzard approved these changes May 27, 2024

View reviewed changes

psychocoderHPC approved these changes May 28, 2024

View reviewed changes

psychocoderHPC merged commit 283e1b9 into alpaka-group:develop May 28, 2024
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor template order `allocMappedBuf` #2270

refactor template order `allocMappedBuf` #2270

SimeonEhrig commented May 16, 2024

psychocoderHPC commented May 16, 2024 •

edited

Loading

psychocoderHPC commented May 16, 2024 •

edited

Loading

SimeonEhrig commented May 16, 2024

psychocoderHPC May 16, 2024 •

edited

Loading

fwyzard May 16, 2024

psychocoderHPC May 17, 2024

fwyzard May 17, 2024

fwyzard May 17, 2024

psychocoderHPC May 17, 2024

SimeonEhrig May 27, 2024

fwyzard May 27, 2024

fwyzard May 27, 2024

	\| cudaHostAlloc \| alpaka::allocMappedBufIfSupported<TElement, TIdx>(host, extents) 1D, 2D, 3D supported! \|
	\| cudaHostAlloc \| alpaka::allocMappedBuf<TElement, TIdx>(host, extents) 1D, 2D, 3D supported! \|

refactor template order allocMappedBuf #2270

refactor template order allocMappedBuf #2270

Conversation

SimeonEhrig commented May 16, 2024

psychocoderHPC commented May 16, 2024 • edited Loading

psychocoderHPC commented May 16, 2024 • edited Loading

SimeonEhrig commented May 16, 2024

psychocoderHPC May 16, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Description

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

refactor template order `allocMappedBuf` #2270

refactor template order `allocMappedBuf` #2270

psychocoderHPC commented May 16, 2024 •

edited

Loading

psychocoderHPC commented May 16, 2024 •

edited

Loading

psychocoderHPC May 16, 2024 •

edited

Loading