[TIR, TVMScript] Add TIR - Triton integration #17395

vinx13 · 2024-09-20T03:19:52Z

Added a macro T.call_triton in TIR script parser, which expands to AOT compilation of the kernel and the host TIR code to launch the kernel.

cc @tqchen @cyx-6

Added a macro `T.call_triton` in TIR script parser, which expands to AOT compilation of the kernel and the host TIR code to launch the kernel.

tqchen · 2024-09-20T13:08:50Z

This is a great mechanism to integrate kernel generators. Some notes on design that might help generalize it abit

Would be great to change the intrinsic to T.call_kernel, which checks the first parameter for kernel types and dispatch accordingly.

Let us think of a base class tir.kernel.BaseKernel which implements the base methods needed
- compile_to_device_module(
  - calls get_meta_data to get the meta_data
- have a registry from class name to Kernel constructor(can be in next PR)
  - So the construct and import of related kernel can be done only when we see the related class name
  - call_kernel will lookup registry and construct related kernels if needed
- TritonKernel subclasses that, call_kernel will lookup the constructor, construct it and call related class method to generate the necessary downstream classes.
Let us also add a tir.kernel.CUDAKernel as an example extension pt
- CUDAKernel takes in cuda C source device code, and ensure call_kernel mechanism works the same

Overall Mechanism

A new DSL/kernel defines a subclass of tir.kernel.Kernel
- the subclass describes the mechanism on how to compile the source to the device module
- it also registers into the mapping a class name => Kernel type
User can either manually construct a instance of tir.kernel.Kernel, or pass in the original data structure triton.JITFunction and call_kernel leverages the registered mapping to do automatic conversion.
call_kernel can contain possible specialization hints(constants etc)

tqchen · 2024-09-20T13:19:54Z

tests/python/contrib/test_tir_triton_integration.py

+    pytestmark = pytest.skip("Triton is not available", allow_module_level=True)
+
+
+@tvm.testing.requires_cuda


add a testcase of rewrite usecase

tqchen · 2024-09-23T13:19:03Z

python/tvm/relax/vm_build.py

    if tir_mod is not None and len(tir_mod.get_global_vars()) > 0:
        lib = tvm.build(
            tir_mod,
            target=target,
            runtime=_autodetect_system_lib_req(target, system_lib),
        )
-    return Executable(_ffi_api.VMLink(builder, target, lib, ext_libs, params))  # type: ignore
+    for ext_mod in ext_libs:
+        if ext_mod.type_key == "cuda":


as a followup, add a function to check if the module is_device_module, this should include cuda, rocm, webgpu, vulkan, opencl

vinx13 force-pushed the feat/tir-triton branch from d8c1b19 to 25c7bc8 Compare September 20, 2024 03:20

github-actions bot requested review from cyx-6 and tqchen September 20, 2024 03:21

cyx-6 approved these changes Sep 20, 2024

View reviewed changes

[TIR, TVMScript] Add TIR - Triton integration

03fb1a4

Added a macro `T.call_triton` in TIR script parser, which expands to AOT compilation of the kernel and the host TIR code to launch the kernel.

vinx13 force-pushed the feat/tir-triton branch from 25c7bc8 to 03fb1a4 Compare September 20, 2024 08:15

tqchen reviewed Sep 20, 2024

View reviewed changes

vinx13 added 7 commits September 22, 2024 15:16

update test

529ccae

refactor

ebf0e78

dedup

4baec55

lint

eb77789

lint

8c75dbd

lint

47b6f71

lint

08b3fa8

tqchen merged commit 48d3ada into apache:main Sep 23, 2024
17 of 18 checks passed

tqchen reviewed Sep 23, 2024

View reviewed changes

ysh329 mentioned this pull request Oct 16, 2024

[Release] v0.18.0 Release Candidate Notes #17468

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TIR, TVMScript] Add TIR - Triton integration #17395

[TIR, TVMScript] Add TIR - Triton integration #17395

vinx13 commented Sep 20, 2024

tqchen commented Sep 20, 2024 •

edited

Loading

tqchen Sep 20, 2024

tqchen Sep 23, 2024

		pytestmark = pytest.skip("Triton is not available", allow_module_level=True)


		@tvm.testing.requires_cuda

[TIR, TVMScript] Add TIR - Triton integration #17395

[TIR, TVMScript] Add TIR - Triton integration #17395

Conversation

vinx13 commented Sep 20, 2024

tqchen commented Sep 20, 2024 • edited Loading

Overall Mechanism

tqchen Sep 20, 2024

Choose a reason for hiding this comment

tqchen Sep 23, 2024

Choose a reason for hiding this comment

tqchen commented Sep 20, 2024 •

edited

Loading