Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using CLOC to compile OpenCL kernels to ROCm platform with Fiji GPU #21

Open
rjfnobre opened this issue Dec 14, 2016 · 5 comments
Open

Comments

@rjfnobre
Copy link

Is is possible to use CLOC to compile OpenCL kernels that can be used with either a ROCm platform (with NVIDIA Fiji GPU) or with the older AMD Catalyst drivers on Linux?

Before knowing about CLOC, I was thinking to to compile OpenCL kernels with Clang/LLVM and the libclc (http://libclc.llvm.org/) library.
But I kind of gave up on that idea after multiple trials, as I could not find a way to use the GCN assembly generated from the Clang/LLVM AMD GCN code generator to assemble a binary that could be loaded by the OpenCL function clCreateProgramWithBinary.

Thanks in advance!

@gstoner
Copy link
Member

gstoner commented Dec 15, 2016 via email

@rjfnobre
Copy link
Author

Thats very good!

And what about using the GCN assembly generated from OpenCL kernels with ROCm?

I'm applying specialized optimization strategies to LLVM IR representing OpenCL kernels, and I would like to use the output of those transformations with ROCm (so that I can generate the binaries that will execute on the GPU).

Do you have an idea on how to do this?

@gstoner
Copy link
Member

gstoner commented Dec 15, 2016 via email

@rjfnobre
Copy link
Author

Because I'm correctly doing experiments in the context of university research using LLVM and I would like to have AMD GPUs represented in the paper, other than only NVIDIA GPUs.

@rjfnobre
Copy link
Author

rjfnobre commented Dec 18, 2016

Can you take a look at this?
ROCm/ROCm#61

Thanks for your time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants