Port MetalKernels #131

maxwindiff · 2023-03-14T07:26:39Z

Outstanding issues

copyto_testsuite used to fail for strange reasons, but seems consistently passing now?

Unsupported currently

get_backend(sparse): Depends on sparse(MtlArray), skipped
convert: Depends on Support for exceptions #69, skipped
specialfunction: Missing device intrinsics in Metal, skipped

maleadt · 2023-03-14T16:48:00Z

cc @vchuravy @tgymnich

src/Metal.jl

maleadt · 2023-03-17T15:53:39Z

Can we make use of the specialized thread_position_in_threadgroup_1d and thread_position_in_grid_1d queries instead of having to multiply things (i.e., #76 and #139)?

maxwindiff · 2023-03-18T07:27:59Z

I think the original version by @tgymnich already uses those queries - https://github.com/JuliaGPU/Metal.jl/pull/131/files#diff-2c3eb99b21164848c74517fc0b0e49e5d8785051cd61bf8ecda6fb5e2dfcfb22R146-R168

maleadt · 2023-03-18T08:38:48Z

Ah, right, not sure how I overlooked those.

vchuravy · 2023-03-19T01:56:19Z

We should expose a query in KA that checks capabilities of the backend. E.g. atomics and Float64 support come to mind.

src/MetalKernels.jl

maxwindiff · 2023-03-20T00:49:02Z

@vchuravy Sounds good, I'll take a stab at adding it.

By the way I have a question about specialfunctions. The test fails with this error:

julia> a = MtlArray(Float32[1.0])
1-element MtlVector{Float32}:
 1.0

julia> gamma.(a)
ERROR: InvalidIRError: compiling broadcast_kernel(Metal.mtlKernelContext, MtlDeviceVector{Float32, 1}, Val{CartesianIndices((1,))}, Base.Broadcast.Broadcasted{Metal.MtlArrayStyle{1}, Tuple{Base.OneTo{Int64}}, typeof(gamma), Tuple{Base.Broadcast.Extruded{MtlDeviceVector{Float32, 1}, Tuple{Bool}, Tuple{Int64}}}}, Int64) in world 32495 resulted in invalid LLVM IR
Reason: unsupported call to an unknown function (call to gpu_malloc)
...
Reason: unsupported call through a literal pointer (call to tgammaf)
Stacktrace:
 [1] _gamma
   @ ~/.julia/packages/SpecialFunctions/gXPNz/src/gamma.jl:578
 [2] gamma
   @ ~/.julia/packages/SpecialFunctions/gXPNz/src/gamma.jl:567
 [3] _broadcast_getindex_evalf
   @ ./broadcast.jl:670
...

The second error complains about a ccall((:tgammaf, libopenlibm), ...). How did that work for the other backends?

maxwindiff · 2023-03-20T01:01:49Z

Oh found it - https://github.com/JuliaGPU/CUDA.jl/blob/master/src/device/intrinsics/special_math.jl
There doesn't seem to be an equivalent function in Metal however.

src/MetalKernels.jl

maxwindiff · 2023-03-22T07:04:14Z

Once JuliaGPU/KernelAbstractions.jl#369 and JuliaGPU/KernelAbstractions.jl#374 are merged, most tests should pass. There's one weird issue remaining however -- the copyto tests fails if the adapt unit test was run before it. Still trying to figure out why...

To disable unrelated tests:

diff --git a/test/kernelabstractions.jl b/test/kernelabstractions.jl
index 6542799..17c79ff 100644
--- a/test/kernelabstractions.jl
+++ b/test/kernelabstractions.jl
@@ -13,4 +13,8 @@ Testsuite.testsuite(()->MetalBackend(), "Metal", Metal, MtlArray, Metal.MtlDevic
     "Convert",           # depends on https://github.com/JuliaGPU/Metal.jl/issues/69
     "SpecialFunctions",  # no equivalent Metal intrinsics for gamma, erf, etc
     "sparse",            # not supported yet
+
+    "partition", "get_backend", "indextest", "Const", "CPU synchronization",
+    "Zero iteration space $(MetalBackend())", "return statement", "fallback test: callable types", "priority",
+    "Localmem", "Private", "Unroll", "Printing", "Compiler", "Reflection", "Examples",
 ]))

vchuravy · 2023-03-22T11:59:47Z

Ok landed both PRs. I can tag 0.9.1

vchuravy · 2023-03-22T16:21:35Z

There doesn't seem to be an equivalent function in Metal however.

Yeah we just need to skip that test.

maxwindiff · 2023-03-23T05:14:30Z

I didn't change anything except updating dependencies, but now the copyto failure is gone. I guess this is ready for review. @vchuravy @tgymnich Appreciate if you can take another look, esp for the Adapt and copyto implementations.

test/kernelabstractions.jl

Co-authored-by: Tim Besard <tim.besard@gmail.com>

Manifest.toml

src/Metal.jl

maxwindiff force-pushed the ka branch from 69a7bfa to 95d2a1e Compare March 14, 2023 07:30

tgymnich reviewed Mar 14, 2023

View reviewed changes

src/Metal.jl Outdated Show resolved Hide resolved

maxwindiff force-pushed the ka branch from 8d548ba to f7602d6 Compare March 18, 2023 08:14

vchuravy reviewed Mar 19, 2023

View reviewed changes

src/MetalKernels.jl Outdated Show resolved Hide resolved

vchuravy reviewed Mar 19, 2023

View reviewed changes

src/MetalKernels.jl Outdated Show resolved Hide resolved

maxwindiff force-pushed the ka branch from f7602d6 to addad5a Compare March 20, 2023 00:37

maxwindiff commented Mar 20, 2023

View reviewed changes

src/MetalKernels.jl Outdated Show resolved Hide resolved

vchuravy mentioned this pull request Mar 21, 2023

Add reverse CI for Metal PR JuliaGPU/KernelAbstractions.jl#372

Merged

maxwindiff force-pushed the ka branch from f96fac7 to cc7b801 Compare March 22, 2023 05:22

maxwindiff added 10 commits March 22, 2023 10:35

Copy MetalKernels 0.8 over verbatim

67bdbf0

Update MetalKernels and enable tests

096f809

s/CUDABackend/MetalBackend/

358c8cb

Implement copyto!

1ab936b

Use Metal.@device_override directly

29b26ec

Skip unsupported test

a8ea450

Disable unsupported tests

3ef4a4e

add SparseArrays

e84cb9f

implement adapt

0178705

Update compatibility

d2f0eda

maxwindiff force-pushed the ka branch from cc7b801 to d2f0eda Compare March 23, 2023 04:03

Try building with KernelAbstractions head

d9dfe94

maxwindiff marked this pull request as ready for review March 23, 2023 05:11

vchuravy approved these changes Mar 24, 2023

View reviewed changes

tgymnich approved these changes Mar 24, 2023

View reviewed changes

maleadt reviewed Mar 24, 2023

View reviewed changes

test/kernelabstractions.jl Show resolved Hide resolved

maleadt reviewed Mar 24, 2023

View reviewed changes

test/kernelabstractions.jl Outdated Show resolved Hide resolved

maleadt reviewed Mar 24, 2023

View reviewed changes

test/kernelabstractions.jl Outdated Show resolved Hide resolved

maxwindiff and others added 3 commits March 24, 2023 17:40

Update test/kernelabstractions.jl

5c93f92

Co-authored-by: Tim Besard <tim.besard@gmail.com>

Update test/kernelabstractions.jl

c779d1a

Co-authored-by: Tim Besard <tim.besard@gmail.com>

Re-add MetalKernels since MetalBackend() is used

fa209a6

maxwindiff commented Mar 25, 2023

View reviewed changes

Manifest.toml Outdated Show resolved Hide resolved

Use KA 0.9.1

7f18365

vchuravy approved these changes Mar 25, 2023

View reviewed changes

vchuravy reviewed Mar 25, 2023

View reviewed changes

src/Metal.jl Show resolved Hide resolved

Update src/Metal.jl

fff59d9

maleadt merged commit 9d500f6 into JuliaGPU:main Mar 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port MetalKernels #131

Port MetalKernels #131

maxwindiff commented Mar 14, 2023 •

edited

Loading

maleadt commented Mar 14, 2023

maleadt commented Mar 17, 2023

maxwindiff commented Mar 18, 2023

maleadt commented Mar 18, 2023

vchuravy commented Mar 19, 2023

maxwindiff commented Mar 20, 2023

maxwindiff commented Mar 20, 2023

maxwindiff commented Mar 22, 2023

vchuravy commented Mar 22, 2023

vchuravy commented Mar 22, 2023

maxwindiff commented Mar 23, 2023

Port MetalKernels #131

Port MetalKernels #131

Conversation

maxwindiff commented Mar 14, 2023 • edited Loading

Outstanding issues

Unsupported currently

maleadt commented Mar 14, 2023

maleadt commented Mar 17, 2023

maxwindiff commented Mar 18, 2023

maleadt commented Mar 18, 2023

vchuravy commented Mar 19, 2023

maxwindiff commented Mar 20, 2023

maxwindiff commented Mar 20, 2023

maxwindiff commented Mar 22, 2023

vchuravy commented Mar 22, 2023

vchuravy commented Mar 22, 2023

maxwindiff commented Mar 23, 2023

maxwindiff commented Mar 14, 2023 •

edited

Loading