Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[amdgpu] Part1 add codegen #6469

Merged
merged 27 commits into from
Jan 10, 2023
Merged

Conversation

galeselee
Copy link
Contributor

Issue: #6434

Brief Summary

This part contains CHI IR->LLVM IR part. Similar to part0, the skeleton of this part is similar to cuda. Some important notes are following.

  1. ocml is the math library like 'libdevice' in nvidia. It provides most of the functions we need. Remains such as sgn, and abs have been completed by hand codes
  2. The kernel parameter passing part is different from the CUDA. extra_args is the only way that could be accepted by device kernel.
  3. still uses jargon from nvidia.(e.g. sm, block and grid)

@netlify
Copy link

netlify bot commented Oct 28, 2022

Deploy Preview for docsite-preview ready!

Name Link
🔨 Latest commit baaa3dc
🔍 Latest deploy log https://app.netlify.com/sites/docsite-preview/deploys/63bd04613429be0008cfa288
😎 Deploy Preview https://deploy-preview-6469--docsite-preview.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site settings.

@galeselee galeselee changed the title [amdgpu] Part1 add codegen [amdgpu] Part1 add codegen Oct 31, 2022
cmake/TaichiCore.cmake Show resolved Hide resolved
@galeselee galeselee changed the title [amdgpu] Part1 add codegen [amdgpu] Don't merge part1 add codegen Jan 4, 2023
@galeselee galeselee changed the title [amdgpu] Don't merge part1 add codegen [amdgpu] Part1 add codegen Jan 10, 2023
@galeselee galeselee merged commit b1c599c into taichi-dev:master Jan 10, 2023
quadpixels pushed a commit to quadpixels/taichi that referenced this pull request May 13, 2023
Issue: taichi-dev#6434

### Brief Summary
This part contains `CHI IR->LLVM IR` part. Similar to part0, the
skeleton of this part is similar to `cuda`. Some important notes are
following.
1. `ocml` is the math library like 'libdevice' in `nvidia`. It provides
most of the functions we need. Remains such as `sgn`, and `abs` have
been completed by hand codes
2. The kernel parameter passing part is different from the `CUDA`.
`extra_args` is the only way that could be accepted by `device kernel.`
3. still uses jargon from `nvidia`.(e.g. `sm`, `block` and `grid`)

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
amdgpu AMDGPU backend
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants