[Backend] DirectX (D3D12, DXIL) Backend #5276

python3kgae · 2022-06-28T03:19:07Z

Concisely describe the proposed feature
I would like to add a DirectX 12 backend to the compiler so that I can use new features which requires shader model >= 6.0.

Describe the solution you'd like (if any)
The plan is in 3 steps.

Add llvm 15 support which is required for generate DXIL.
Code change will be guarded with something like "#ifdef TI_LLVM_15" if llvm 15 require different implementation.
CMake option TI_LLVM_15 will be added to control it and OFF by default.
Add AOT compile first which could be tested by add c++ directx12 test without backend.
DXIL support in llvm 15 is still in early state. It will be a long journey to support all features.
CodeGenLLVM will be used as base class for llvm ir generation when codegen. Then some llvm passes will be added to prepare for DXIL generation and finally use DirectX backend in llvm 15 to generate DXIL.
A kernel which has more than 1 tasks is a new thing for DXIL, no plan to support this feature at first.
Add DirectX 12 backend.
No an expert on this part :(
Maybe start support for simple cases first.

Additional comments
I'm working on DXIL generation on llvm 15. Targeting language other then HLSL to DXIL will help the design of DXIL generation.
This is personal work, not official contribution (which requires examine and approve).

k-ye · 2022-06-28T08:59:25Z

Hi @python3kgae ,

Thanks for proposing this, it looks really attractive!

Add llvm 15 support which is required for generate DXIL.

Note that we are (conservatively) planning to upgrade to LLVM-12. It's probably fine for us to directly go to 15 as well. One thing we've recently noticed, is the installation speed from apt-get for clang-13 (and likely beyond) is super slow. This could potentially hurt our external contributors...

(Ive followed https://apt.llvm.org/ for clang-13 installation, not sure if you have better suggestions here.)

Then some llvm passes will be added to prepare for DXIL generation and finally use DirectX backend in llvm 15 to generate DXIL.

I see. I'm not sure how feasible it is to share the generated LLVM IR between CUDA and DXIL. Hopefully this is not a problem :-) (Taking SPIR-V as an example. Although it claims to support both Vulkan and OpenCL, they use very different Execution Model, making it nontrivial to share the same SPIR-V between these two APIs).

No an expert on this part :( Maybe start support for simple cases first.

If we can have a working codegen, D3D12 runtime API shouldn't be too hard to sort out :-)

python3kgae · 2022-06-28T15:56:50Z

Note that we are (conservatively) planning to upgrade to LLVM-12. It's probably fine for us to directly go to 15 as well. One thing we've recently noticed, is the installation speed from apt-get for clang-13 (and likely beyond) is super slow. This could potentially hurt our external contributors...

(Ive followed https://apt.llvm.org/ for clang-13 installation, not sure if you have better suggestions here.)

If clang is only used for COMPILE_LLVM_RUNTIME, We can keep using clang-10 first. llvm 15 should be OK to link bitcode generated with llvm 10.
And once apt.llvm.org got the issue fixed, we can update clang to higher version.

ailzhang · 2022-07-01T08:43:13Z

@python3kgae COMPILE_LLVM_RUNTIME isn't the only place that requires clang version 10, I'm afraid that https://github.com/taichi-dev/taichi/blob/master/cmake/TaichiCore.cmake#L253-L288 also requires headers from llvm-10, so existing codebase might need some effort to make it work with llvm-15.
But we'd more than happy to provide guidance if you're interested in hacking a prototype for this, how about joining our slack workspace so that we can discuss how a MVP looks like? https://taichicommunity.slack.com/join/shared_invite/zt-14ic8j6no-Fd~wKNpfskXLfqDr58Tddg#/shared-invite/email Thanks!

python3kgae · 2022-07-01T13:38:30Z

@ailzhang I checked https://github.com/taichi-dev/taichi/blob/master/cmake/TaichiCore.cmake#L253-L288, it is where llvm header and lib are added. it only require llvm version >= 10, llvm 15 doesn't require any change for this part.

I've got taichi repo compiled with llvm 15. Most of the changes are related to the opaque pointer feature which require add type when create LoadInst/GEP. Currently I'm hacking to get the type from llvm type, need help to get the type from taichi type instead of llvm type.

qiao-bo · 2022-07-04T02:05:07Z

One thing we've recently noticed, is the installation speed from apt-get for clang-13 (and likely beyond) is super slow. This could potentially hurt our external contributors...
(Ive followed https://apt.llvm.org/ for clang-13 installation, not sure if you have better suggestions here.)

This could be specific to the nightly package. I tried the release version on a Ubuntu 22.04. The time of apt-get install clang-13 seems ok. But indeed current 20.04 users will have to suffer... Long-term speaking should not be a blocker.

…5998) 2 passes are added for DXIL generation. TaichiIntrinsicLower will translate taichi intrinsic like thread_idx into the form DirectX backend expected. TaichiRuntimeContextLower will translate the TaichiRuntimeContext parameter for kernel into Buffers/ConstantBuffers. TaichiRuntimeContextLower is empty now. It is added after inline so optimizations reduce the load/store on temp ptr. And it is easier to know a store is on the TaichiRuntimeContext. Related issue = #5276 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Fix build fail and enable dx12 build for windows cpu ci to make sure it compiles. Related issue = #5276

Build based on release/15.x branch and cherry-picked some patch for DX12. Issue: #5276 ### Brief Summary

Copied from cuda codegen. Issue: #5276 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Only make sure the pipeline generate something. No real dxil generated yet. Move DX12 build to gpu ci which will run the aot test. Issue: #5276 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

add DirectX-Headers as a submodule. Issue: #5276

Add Dx12ProgramImpl. Prepare change to launch dx12 kernel like dx11 instead of LlvmRuntimeExecutor. Because dx12 uses buffer instead of pointer directly. Issue: #5276 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

python3kgae added the feature request Suggest an idea on this project label Jun 28, 2022

erizmr assigned ailzhang Jul 1, 2022

python3kgae mentioned this issue Jul 2, 2022

[llvm] Drop code for llvm 15. #5313

Merged

ailzhang removed their assignment Jul 4, 2022

This was referenced Jul 7, 2022

[ci] Llvm15 clang10 ci #5368

Merged

[llvm] Allow using clang 15 for COMPILE_LLVM_RUNTIME #5381

Merged

This was referenced Jul 31, 2022

[ci] Update llvm15 prebuild binary. #5581

Merged

[dx12] Drop code for dx12 aot. #5596

Closed

[ci] Add PR tag for dx12. #5614

Merged

This was referenced Aug 20, 2022

[ci] Switch windows cpu build to llvm 15. #5832

Merged

[llvm] Fix PtrOffset address for shared array in llvm 15. #5867

Merged

This was referenced Sep 2, 2022

[dx12] Drop code for dx12 codegen. #5953

Merged

[dx12] Drop code for dxil generation. #5958

Merged

[dx12] Drop code for llvm passes which prepare for DXIL generation. #5998

Merged

python3kgae mentioned this issue Sep 15, 2022

[ci] [dx12] Enable dx12 build for windows cpu ci. #6069

Merged

ailzhang pushed a commit that referenced this issue Sep 15, 2022

[ci] [dx12] Enable dx12 build for windows cpu ci. (#6069)

4d94b31

Fix build fail and enable dx12 build for windows cpu ci to make sure it compiles. Related issue = #5276

This was referenced Sep 18, 2022

[ci] Update prebuild binary for llvm 15. #6091

Merged

[dx12] Update codegen for range_for and mesh_for #6092

Merged

ailzhang pushed a commit that referenced this issue Sep 19, 2022

[ci] Update prebuild binary for llvm 15. (#6091)

5e5321e

Build based on release/15.x branch and cherry-picked some patch for DX12. Issue: #5276 ### Brief Summary

ailzhang pushed a commit that referenced this issue Sep 19, 2022

[dx12] Update codegen for range_for and mesh_for (#6092)

a8b5fc2

Copied from cuda codegen. Issue: #5276 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

python3kgae mentioned this issue Sep 19, 2022

[dx12] Add aot for dx12. #6099

Merged

python3kgae mentioned this issue Sep 27, 2022

[dx12] Add ti.dx12. #6174

Merged

python3kgae mentioned this issue Oct 9, 2022

[dx12] Add DirectX-Headers as a submodule #6259

Merged

ailzhang pushed a commit that referenced this issue Oct 10, 2022

[dx12] Add DirectX-Headers as a submodule (#6259)

8fe7818

add DirectX-Headers as a submodule. Issue: #5276

python3kgae mentioned this issue Oct 17, 2022

[dx12] Only use llvm to compile dx12. #6339

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Backend] DirectX (D3D12, DXIL) Backend #5276

[Backend] DirectX (D3D12, DXIL) Backend #5276

python3kgae commented Jun 28, 2022

k-ye commented Jun 28, 2022

python3kgae commented Jun 28, 2022

ailzhang commented Jul 1, 2022 •

edited

Loading

python3kgae commented Jul 1, 2022

qiao-bo commented Jul 4, 2022 •

edited

Loading

[Backend] DirectX (D3D12, DXIL) Backend #5276

[Backend] DirectX (D3D12, DXIL) Backend #5276

Comments

python3kgae commented Jun 28, 2022

k-ye commented Jun 28, 2022

python3kgae commented Jun 28, 2022

ailzhang commented Jul 1, 2022 • edited Loading

python3kgae commented Jul 1, 2022

qiao-bo commented Jul 4, 2022 • edited Loading

ailzhang commented Jul 1, 2022 •

edited

Loading

qiao-bo commented Jul 4, 2022 •

edited

Loading