Improve use of unified memory #86

maleadt · 2023-02-08T12:12:05Z

Our buffers are currently allocated as GPU-only buffers by choosing the Private* storage mode. That's OK given our current CUDA-style programming model where we perform explicit copies to and from the GPU, but it would be nice if we'd also properly support buffers that are shared between CPU and GPU, by selecting Shared storage mode: https://developer.apple.com/documentation/metal/resource_fundamentals/choosing_a_resource_storage_mode_for_apple_gpus. This should probably be a kwarg to the MtlArray constructor.

*Since we choose Private storage mode, I'm not sure how the unified memory examples work...

habemus-papadum · 2023-02-24T16:39:12Z

Hi -- this looks to be done already:

Metal.jl/src/array.jl

Line 36 in 7f45948

    
           function MtlArray{T,N}(::UndefInitializer, dims::Dims{N}; storage=Shared) where {T,N}

(MtlBuffer defaults to private but MtlArray to Shared; kwarg exists to choose another option)

If this seems correct, lmk and I will add documentation for others (and our future-selves.)

maleadt · 2023-02-24T16:45:05Z

Ah right, that's where it's set. That doesn't seem great, as per Apple we should be using private storage.

habemus-papadum · 2023-02-24T16:48:04Z

I agree with that. I will try changing the default and updating the unifiedmemory example (and the gtk example)

jvkersch · 2023-11-16T05:23:48Z

Just wanted to add that I stumbled across this issue when using Metal.jl through KernelAbstractions; I wanted to do some experiments with shared arrays but quickly found that the default allocator allocates arrays in private storage mode. I ended up just adding some defaults to the allocator (jvkersch@08ea259), but as I'm neither a Julia or GPU programmer this is probably not the right approach. Still interested in seeing how this issue evolves, however!

maleadt · 2023-11-16T08:06:51Z

CUDA.jl has recently seen a bunch of unified memory-related improvements, https://info.juliahub.com/cuda-jl-5-1-unified-memory, we should probably backport a bunch of those here (e.g. the ability to conveniently unsafe_wrap a Julia Array to MtlArray, if possible).

tgymnich · 2024-03-12T17:20:26Z

fixed by #305

maleadt · 2024-03-12T17:25:09Z

There's still a couple of important improvements to make, e.g. the ability to cheaply wrap Arrays with an MtlArray and vice-versa. That should make it much easier to use Metal.jl in an existing application.

tgymnich · 2024-03-12T17:27:12Z

tracked also here: #62

maleadt added the enhancement label Feb 8, 2023

maleadt added the arrays Things about the array abstraction. label May 22, 2023

tgymnich closed this as completed Mar 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve use of unified memory #86

Improve use of unified memory #86

maleadt commented Feb 8, 2023

habemus-papadum commented Feb 24, 2023

maleadt commented Feb 24, 2023

habemus-papadum commented Feb 24, 2023

jvkersch commented Nov 16, 2023

maleadt commented Nov 16, 2023

tgymnich commented Mar 12, 2024

maleadt commented Mar 12, 2024

tgymnich commented Mar 12, 2024

Improve use of unified memory #86

Improve use of unified memory #86

Comments

maleadt commented Feb 8, 2023

habemus-papadum commented Feb 24, 2023

maleadt commented Feb 24, 2023

habemus-papadum commented Feb 24, 2023

jvkersch commented Nov 16, 2023

maleadt commented Nov 16, 2023

tgymnich commented Mar 12, 2024

maleadt commented Mar 12, 2024

tgymnich commented Mar 12, 2024