Support JIT Offline Cache for Taichi #4401

PGZXB · 2022-02-28T04:04:18Z

bobcao3 · 2022-03-01T00:52:49Z

Requesting for extending this to all back ends, considering the huge range of users on Mac or non-cuda laptops

PGZXB · 2022-03-01T01:14:32Z

Requesting for extending this to all back ends, considering the huge range of users on Mac or non-cuda laptops

Yes, impl the feature for all backends is my goal.

PGZXB · 2022-03-10T01:27:05Z

#4401 (comment)

bobcao3 · 2022-03-14T07:05:04Z

I argue strongly against this solution. We have profiles showing the LLVM codegen takes about 30% of the entire JIT codegen time, it would be much wiser to spend time figuring out AST->CHI-IR caching first.

bobcao3 · 2022-03-14T07:06:18Z

A two staged caching gives a major JIT time boost to all backends, I'd argue this is a lot cleaner to implement as well compared to having one stage caching for each individual backend, which will cause maintenance problems down the line

PGZXB · 2022-03-14T07:34:16Z

A two staged caching gives a major JIT time boost to all backends, I'd argue this is a lot cleaner to implement as well compared to having one stage caching for each individual backend, which will cause maintenance problems down the line
I argue strongly against this solution. We have profiles showing the LLVM codegen takes about 30% of the entire JIT codegen time, it would be much wiser to spend time figuring out AST->CHI-IR caching first.

Step by step. Maybe temporary solution. We don't have serialization of CHI IR now. After CHI IR's serialization is implemented, maybe 2-level cache is better, especially for multi backends...

ps. I think CHI IR's serialization is very important for standardizing CHI IR, which needs a feasible efficient standard (more .adj to show the importance I think) solution, like llvm-ir, IL, Java bytecode, intel-asm, which is not easy...

k-ye · 2022-03-14T09:03:26Z

Is there a middle ground we can find out? E.g. how easy is it for us to migrate the implementation from caching LLVM to caching CHI IR? If most users don't care about the internal implementation of the cache, I expect the following scenario to happen:

At first, they can only benefit from the caching behavior for CUDA/CPU backends
Then after release X, they find out the caching is working for all the backends automatically.

In addition, IMHO the complexity still comes from the cache key part (considering all the involved global states). The cached contents can be adjusted fairly easily, provided that CHI IR serialization is implemented.

PGZXB · 2022-03-14T09:32:22Z

Is there a middle ground we can find out? E.g. how easy is it for us to migrate the implementation from caching LLVM to caching CHI IR? If most users don't care about the internal implementation of the cache, I expect the following scenario to happen:

At first, they can only benefit from the caching behavior for CUDA/CPU backends

Then after release X, they find out the caching is working for all the backends automatically.

The (new) implementation of offline-cache is transparent. All logic is in C++ side. Frontend only see the offline_cache: bool and offline_cache_file_path: str options. If we have serialization and deserialization of CHI IR, implementing caching CHI IR will be simple. Maybe doing it after standardizing CHI IR is better. After release X, users can also use it by simply set options without any migration cost. And maybe multilevel cache is optional(even better) solution, running backend lang directly is fastest.

PGZXB · 2022-03-14T09:38:41Z

In addition, IMHO the complexity still comes from the cache key part (considering all the involved global states). The cached contents can be adjusted fairly easily, provided that CHI IR serialization is implemented.

Can't agree more. Because taichi's kernels depends on global vars/states, generating a key which can uniquely identifies a kernel is difficult and the key of implementing caching a kernel. And, at present, before we have a standardized de/serializable CHI IR, dumping and loading & running backend-language is more simple than CHI IR because they have mature/standard solution.

ps. Overhead of generating key is what we should consider. Python -> Taichi AST -> CHI IR -> Backend lang. From left to right:

overhead of generating key ↑ ,
overhead of loading & running offline-cache-file ↓ ,
difficulty of generating cache key which can uniquely identifies a kernel ↓

Issue: #4401 * Fix a potential bug in metal AOT * Prepare for implementing offline cache on metal Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

#6233) Issue: #4401

PGZXB · 2022-10-08T10:05:54Z

Supported or not

Backend	Supported or not	Overhead (running Cornell Box)
CPU	✔	393.25ms
CUDA	✔	882.426ms
Vulkan	✔	218.030ms
OpenGL	✔
Metal	✔
AMDGPU	✔
Microsoft DirectX 11	✔
Microsoft DirectX 12	❌	N/A

P.S.

The "overhead" is the time spent on loading cached compiled data and converting it to a callable object.
Testing environment:
- OS: Windows 11, CPU: Intel(R) Core(TM) i7-10710U CPU @ 1.10GHz 1.61 GHz, RAM: 16GB for CPU, CUDA, OpenGL and Vulkan
⏩: Working in progress

Issue: #4401 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Issue: #6263, #4401 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Issue: #6263, #4401 Co-authored-by: Yi Xu <xy_xuyi@foxmail.com>

Issue: #4401 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Issue: #4401, #6614 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Issue: taichi-dev#4401, taichi-dev#6614 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

#7005) Issue: #4401

taichi-dev#7005) Issue: taichi-dev#4401

Issue: #7002, #4401 ### Brief Summary This PR: 1. Introduced `KernelCompilationManager` to unify implementation of the Offline Cache; 2. Used `KernelCompilationManager` re-impl JIT, Offline Cache on gfx backends (vulkan, metal, dx11, opengl); 3. Removed the `gfx::CacheManager`. --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

) Issue: taichi-dev#4401 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Issue: taichi-dev#4401, taichi-dev#6614 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

taichi-dev#7005) Issue: taichi-dev#4401

Issue: taichi-dev#7002, taichi-dev#4401 ### Brief Summary This PR: 1. Introduced `KernelCompilationManager` to unify implementation of the Offline Cache; 2. Used `KernelCompilationManager` re-impl JIT, Offline Cache on gfx backends (vulkan, metal, dx11, opengl); 3. Removed the `gfx::CacheManager`. --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

PGZXB added discussion Welcome discussion! llvm LLVM backend labels Feb 28, 2022

PGZXB self-assigned this Feb 28, 2022

qiao-bo added this to To Triage in Lang Features & Python via automation Feb 28, 2022

PGZXB mentioned this issue Mar 1, 2022

[refactor] Refactor llvm-offloaded-task-name mangling #4418

Merged

PGZXB mentioned this issue Mar 10, 2022

[opt] Support offline-cache for kernel with arch=cpu #4500

Merged

PGZXB removed the discussion Welcome discussion! label Mar 11, 2022

PGZXB mentioned this issue Mar 13, 2022

[misc] Make result of irpass::print hold more information #4517

Merged

This was referenced Mar 29, 2022

[refactor] Remove Expression::serialize and add ExpressionHumanFriendlyPrinter #4657

Merged

[misc] Add compile-config to offline-cache key #4681

Merged

[misc] Add SNode to offline-cache key #4716

Merged

PGZXB changed the title ~~Support offline-cache for llvm backend of Taichi~~ Support offline-cache for Taichi Apr 12, 2022

PGZXB changed the title ~~Support offline-cache for Taichi~~ Support JIT Offline Cache for Taichi Apr 12, 2022

This was referenced Apr 12, 2022

[misc] Remove some unnecessary attributes from offline-cache key of compile-config #4770

Merged

[test] Add simple test for offline-cache-key of compile-config #4805

Merged

k-ye mentioned this issue Apr 18, 2022

Passing struct vs deferencing fields in struct performance #4802

Open

This was referenced Oct 3, 2022

[metal] Support offline cache on metal #6227

Merged

[vulkan] [opengl] Enable offline cache by default on Vulkan and OpenGL #6233

Merged

PGZXB added a commit that referenced this issue Oct 8, 2022

[vulkan] [opengl] Enable offline cache by default on Vulkan and OpenGL (

5c6ad43

#6233) Issue: #4401

PGZXB added a commit that referenced this issue Oct 10, 2022

[metal] Support offline cache on metal (#6227)

38c0277

Issue: #4401 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

This was referenced Oct 10, 2022

Release the Offline Cache in v1.1.4 (Tracker) #6263

Closed

[bug] Fix AotModuleBuilder::add_compiled_kernel #6287

Merged

[metal] Enable offline cache by default on Metal #6307

Merged

PGZXB added a commit that referenced this issue Oct 13, 2022

[metal] Enable offline cache by default on Metal (#6307)

7c6f910

Issue: #6263, #4401 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

PGZXB mentioned this issue Oct 13, 2022

[doc] Update the document about offline cache #6313

Merged

PGZXB added a commit that referenced this issue Oct 14, 2022

[doc] Update the document about offline cache (#6313)

4db2b31

Issue: #6263, #4401 Co-authored-by: Yi Xu <xy_xuyi@foxmail.com>

PGZXB mentioned this issue Nov 4, 2022

[test] Test real function on enabling the offline cache #6523

Merged

PGZXB removed llvm LLVM backend vulkan Vulkan backend labels Nov 5, 2022

PGZXB added a commit that referenced this issue Nov 7, 2022

[test] Test real function on enabling the offline cache (#6523)

0cae0e2

Issue: #4401 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

This was referenced Dec 18, 2022

[bug] Add GetElementExpression to offline cache key #6918

Merged

[misc] Show suggestion when locking metadata.lock fails #6919

Merged

PGZXB added a commit that referenced this issue Dec 19, 2022

[bug] Add GetElementExpression to offline cache key (#6918)

5e86680

Issue: #4401, #6614 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

PGZXB added a commit to PGZXB/taichi that referenced this issue Dec 19, 2022

[bug] Add GetElementExpression to offline cache key (taichi-dev#6918)

1e6dd97

Issue: taichi-dev#4401, taichi-dev#6614 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

This was referenced Dec 28, 2022

Refactor kernel compilation #7002

Closed

[Error] Do not show warning when the offline cache path does not exist #7005

Merged

PGZXB added a commit that referenced this issue Dec 30, 2022

[Error] Do not show warning when the offline cache path does not exist (

793c4a2

#7005) Issue: #4401

feisuzhu pushed a commit to feisuzhu/taichi that referenced this issue Jan 5, 2023

[Error] Do not show warning when the offline cache path does not exist (

83b04dc

taichi-dev#7005) Issue: taichi-dev#4401

PGZXB mentioned this issue Feb 21, 2023

[refactor] Introduce KernelCompilationManager #7409

Merged

quadpixels pushed a commit to quadpixels/taichi that referenced this issue May 13, 2023

[test] Test real function on enabling the offline cache (taichi-dev#6523

191da4e

) Issue: taichi-dev#4401 Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

quadpixels pushed a commit to quadpixels/taichi that referenced this issue May 13, 2023

[Error] Do not show warning when the offline cache path does not exist (

78a7973

taichi-dev#7005) Issue: taichi-dev#4401

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support JIT Offline Cache for Taichi #4401

Support JIT Offline Cache for Taichi #4401

PGZXB commented Feb 28, 2022 •

edited

bobcao3 commented Mar 1, 2022

PGZXB commented Mar 1, 2022

PGZXB commented Mar 10, 2022 •

edited

bobcao3 commented Mar 14, 2022

bobcao3 commented Mar 14, 2022

PGZXB commented Mar 14, 2022 •

edited

k-ye commented Mar 14, 2022 •

edited

PGZXB commented Mar 14, 2022 •

edited

PGZXB commented Mar 14, 2022 •

edited

PGZXB commented Oct 8, 2022 •

edited

Support JIT Offline Cache for Taichi #4401

Support JIT Offline Cache for Taichi #4401

Comments

PGZXB commented Feb 28, 2022 • edited

Solution

Workflow (on llvm backends)

Todo & Memo

Usage

Supported backends

Potential Bugs

bobcao3 commented Mar 1, 2022

PGZXB commented Mar 1, 2022

PGZXB commented Mar 10, 2022 • edited

bobcao3 commented Mar 14, 2022

bobcao3 commented Mar 14, 2022

PGZXB commented Mar 14, 2022 • edited

k-ye commented Mar 14, 2022 • edited

PGZXB commented Mar 14, 2022 • edited

PGZXB commented Mar 14, 2022 • edited

PGZXB commented Oct 8, 2022 • edited

Supported or not

PGZXB commented Feb 28, 2022 •

edited

PGZXB commented Mar 10, 2022 •

edited

PGZXB commented Mar 14, 2022 •

edited

k-ye commented Mar 14, 2022 •

edited

PGZXB commented Mar 14, 2022 •

edited

PGZXB commented Mar 14, 2022 •

edited

PGZXB commented Oct 8, 2022 •

edited