OpenCL backend plans? #42

LibRaw · 2012-10-05T08:24:09Z

Is there any chance, that Halide will generate OpenCL kernels for use on GPUs? Sometimes in future....

I want to use Halide in my desktop computer graphics (photo processing) app, but many users have AMD cards, not NVidia.

mikeseven · 2012-11-02T01:12:22Z

Definitely needed!

jrk · 2012-11-08T01:06:11Z

This is in process.

mikeseven · 2012-12-12T17:43:38Z

Any progress on OpenCL support? I would be happy to beta test it.

pvila89 · 2013-02-22T12:47:51Z

+1

okigan · 2013-02-22T15:40:09Z

+2

oscarbg · 2013-04-03T10:30:26Z

+3

mikeseven · 2013-04-03T16:17:55Z

As discussed at GTC, it might be difficult to do any portable IR for OpenCL. So an ideal solution maybe to use clang to generate OpenCL kernels source.

oscarbg · 2013-04-04T10:25:27Z

Wait there is OpenCL SPIR which is pretty similar to LLVMIR even clang can generate SPIR via:
clang -x cl -fno-builtin -target spir -c -emit-llvm
and seems coming soon at least in AMD OCL drivers for testing..

mikeseven · 2013-04-04T15:02:50Z

Yes but SPIR is not (yet) supported by any GPU vendor and as far as I know
none have plans for it in the next year.
On Apr 4, 2013 3:25 AM, "oscarbg" notifications@github.com wrote:

Wait there is OpenCL SPIR which is pretty similar to LLVMIR even clang can
generate SPIR via:
clang -x cl -fno-builtin -target spir -c -emit-llvm
and seems coming soon at least in AMD OCL drivers for testing..

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/42#issuecomment-15889813
.

mikeseven · 2013-12-11T09:16:21Z

SPIR has been released recently.
Has Halide made progress on OpenCL backend, using SPIR or not?

jrk · 2013-12-11T15:52:19Z

@dsharlet-intel has been making steady progress on both the SPIR and OpenCL C-based backends. I believe they're starting to pass most/all of the tests, as of (the very recent) 2e3222b commit.

mikeseven · 2013-12-11T18:09:01Z

I tried the basic apps on osx 10.9 and all seg fault with OpenCL.

-- Mike
On Dec 11, 2013 7:52 AM, "Jonathan Ragan-Kelley" notifications@github.com
wrote:

@dsharlet-intel https://github.com/dsharlet-intel has been making
steady progress on both the SPIR and OpenCL C-based backends. I believe
they're starting to pass most/all of the tests, as of (the very recent)
2e3222b 2e3222b0489d commit.

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/42#issuecomment-30331805
.

jrk · 2013-12-11T18:18:13Z

Do you know which OpenCL device you’re targeting? One really annoying gotcha of Apple’s implementations is that their x86 OpenCL backend only supports 1D kernel launches.

Do the tests pass?

mikeseven · 2013-12-11T18:35:36Z

I'm targeting embedded gpu mainly but before porting there I test on
desktops, which is macbook pro retina osx 10.9 with nvidia GPU. There the
Cuda/ptx back end works perfectly.
I'll try the tests and try to pinpoint the issue.

-- Mike
On Dec 11, 2013 10:18 AM, "Jonathan Ragan-Kelley" notifications@github.com
wrote:

Do you know which OpenCL device you’re targeting? One really annoying
gotcha of Apple’s implementations is that their x86 OpenCL backend only
supports 1D kernel launches.

Do the tests pass?

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/42#issuecomment-30346118
.

dsharlet-intel · 2013-12-11T18:52:55Z

I appreciate any information you can share from running the tests, but you should know that I've only just started looking at the apps since the last commit cited by jrk. The apps use a little bit different mechanism to run the generated code, which I didn't realize until recently.

In addition to the issue jrk mentioned regarding Apple's x86 OpenCL implementation, they also have a little bit different expected behavior for creating the OpenCL context. I will need to get an Apple machine to make sure this still works for Apple in addition to Linux/Win.

mikeseven · 2013-12-11T19:51:19Z

doing: make run_tests with HL_TARGET=opencl

clang++ -O3 test/correctness/argmax.cpp -Iinclude -Lbin -lHalide -lpthread
-ldl -o bin/test_argmax
cd tmp ; DYLD_LIBRARY_PATH=../bin LD_LIBRARY_PATH=../bin ../bin/test_argmax
OpenCL device codegen init_module
Error: Failed to build program executable! err = -11
Build Log:

No kernels or only kernel prototypes found.

Error: err == CL_SUCCESS
make: *** [test_argmax] Error 1

Same error with test_internal:
cd tmp ; DYLD_LIBRARY_PATH=../bin LD_LIBRARY_PATH=../bin
../bin/test_internal
IRPrinter test passed
CodeGen_C test passed
Simplify test passed
Bounds test passed
Lowering test passed
OpenCL device codegen init_module
Error: Failed to build program executable! err = -11
Build Log:

No kernels or only kernel prototypes found.

Error: err == CL_SUCCESS
make: *** [test_internal] Error 1

The problem is the kernel being generated:
/OpenCL C/
float nan_f32() { return NAN; }
float neg_inf_f32() { return -INFINITY; }
float inf_f32() { return INFINITY; }
float sqrt_f32(float x) { return sqrt(x); }
float sin_f32(float x) { return sin(x); }
float cos_f32(float x) { return cos(x); }
float exp_f32(float x) { return exp(x); }
float log_f32(float x) { return log(x); }
float abs_f32(float x) { return x < 0.0f ? -x : x; }
float floor_f32(float x) { return floor(x); }
float ceil_f32(float x) { return ceil(x); }
float round_f32(float x) { return round(x); }
float pow_f32(float x, float y) { return pow(x, y); }
float asin_f32(float x) { return asin(x); }
float acos_f32(float x) { return acos(x); }
float tan_f32(float x) { return tan(x); }
float atan_f32(float x) { return atan(x); }
float atan2_f32(float y, float x) { return atan2(y, x); }
float sinh_f32(float x) { return sinh(x); }
float asinh_f32(float x) { return asinh(x); }
float cosh_f32(float x) { return cosh(x); }
float acosh_f32(float x) { return acosh(x); }
float tanh_f32(float x) { return tanh(x); }
float atanh_f32(float x) { return atanh(x); }

there is no __kernel!!!

Misc stuff:
on OSX don't use g++ for CXX default but clang++. Default g++ is way to old.

with HL_TARGET=host or cuda, all success.

-- Mike

On Wed, Dec 11, 2013 at 10:53 AM, dsharlet-intel
notifications@github.comwrote:

I appreciate any information you can share from running the tests, but you
should know that I've only just started looking at the apps since the last
commit cited by jrk. The apps use a little bit different mechanism to run
the generated code, which I didn't realize until recently.

In addition to the issue jrk mentioned regarding Apple's x86 OpenCL
implementation, they also have a little bit different expected behavior for
creating the OpenCL context. I will need to get an Apple machine to make
sure this still works for Apple in addition to Linux/Win.

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/42#issuecomment-30350612
.

jrk closed this as completed Dec 11, 2013

platypusCA mentioned this issue Nov 14, 2017

allow build to proceed when llvm-config --system-libs is empty #2525

Merged

AlexanderGarmash mentioned this issue Jan 28, 2019

HelloAndroidCamera2 is crashing when camera permissions granted #3633

Open

chtsao8 mentioned this issue Jul 12, 2019

Onnx->Halide Conversion Issues #4007

Closed

steven-johnson mentioned this issue Aug 23, 2022

Fix possible overflow in saturating_cast bounds inference #6961

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenCL backend plans? #42

OpenCL backend plans? #42

LibRaw commented Oct 5, 2012

mikeseven commented Nov 2, 2012

jrk commented Nov 8, 2012

mikeseven commented Dec 12, 2012

pvila89 commented Feb 22, 2013

okigan commented Feb 22, 2013

oscarbg commented Apr 3, 2013

mikeseven commented Apr 3, 2013

oscarbg commented Apr 4, 2013

mikeseven commented Apr 4, 2013

mikeseven commented Dec 11, 2013

jrk commented Dec 11, 2013

mikeseven commented Dec 11, 2013

jrk commented Dec 11, 2013

mikeseven commented Dec 11, 2013

dsharlet-intel commented Dec 11, 2013

mikeseven commented Dec 11, 2013

OpenCL backend plans? #42

OpenCL backend plans? #42

Comments

LibRaw commented Oct 5, 2012

mikeseven commented Nov 2, 2012

jrk commented Nov 8, 2012

mikeseven commented Dec 12, 2012

pvila89 commented Feb 22, 2013

okigan commented Feb 22, 2013

oscarbg commented Apr 3, 2013

mikeseven commented Apr 3, 2013

oscarbg commented Apr 4, 2013

mikeseven commented Apr 4, 2013

mikeseven commented Dec 11, 2013

jrk commented Dec 11, 2013

mikeseven commented Dec 11, 2013

jrk commented Dec 11, 2013

mikeseven commented Dec 11, 2013

dsharlet-intel commented Dec 11, 2013

mikeseven commented Dec 11, 2013

No kernels or only kernel prototypes found.

No kernels or only kernel prototypes found.