[WIP] on-disk caching of CUDA kernels #8079

gmarkall · 2022-05-19T15:52:59Z

Testing on CI as I work on the tests.

The following: ```python from numba import cuda @cuda.jit(cache=True) def f(): pass f[1, 1]() ``` runs and saves cache files to the disk. The subsequent run loading from the cache still fails.

Presently fails because the codegen can't be pickled due to ctypes pointer.

@c200chromebook

This incorporates fixes to the PTX pickling from @c200chromebook: https://gist.github.com/c200chromebook/1ee304161a39b247e8a029ff9d2b3cd7 For the simple test: ```python from numba import cuda @cuda.jit(cache=True) def f(): pass f[1, 1]() ``` The following CUDA Driver API calls are seen on the first invocation: ``` == CUDA [427] DEBUG -- call driver api: cuLinkCreate_v2 == CUDA [428] DEBUG -- call driver api: cuLinkAddData_v2 == CUDA [428] DEBUG -- call driver api: cuLinkComplete == CUDA [428] DEBUG -- call driver api: cuLinkDestroy ``` but are absent from a subsequent run.

… / link happens

…s though

…functions though" This reverts commit 65b9615.

gmarkall · 2022-05-19T16:02:47Z

I started with the wrong branch here and public CI is busy, so I'll close this now.

gmarkall added 12 commits May 19, 2022 16:45

CUDA caching: enough hacks to save empty function

ea4fc08

The following: ```python from numba import cuda @cuda.jit(cache=True) def f(): pass f[1, 1]() ``` runs and saves cache files to the disk. The subsequent run loading from the cache still fails.

[WIP] Attempt to make the load work

b0fcf79

Presently fails because the codegen can't be pickled due to ctypes pointer.

Add some temporary debug prints to make it easier to see when compile…

a4c85fe

… / link happens

Pickle dispatcher overloads too - issue with pickling device function…

89362c9

…s though

Revert "Pickle dispatcher overloads too - issue with pickling device …

62a15d0

…functions though" This reverts commit 65b9615.

Fix damaged line for CPU caching

49c8fcf

Also serialize the LLVM module in CUDACodeLibrary

88423a2

Use compute capability in cache key

926627e

Use base dispatcher serialization methods

f8dc8ee

Remove debug messages

0d4e447

Fixup to match current main

4864bfa

gmarkall requested review from sklam, stuartarchibald and esc as code owners May 19, 2022 15:52

gmarkall added 2 - In Progress CUDA CUDA related issue/PR labels May 19, 2022

gmarkall added this to the Numba 0.56 RC milestone May 19, 2022

gmarkall closed this May 19, 2022

gmarkall added abandoned PR is abandoned (no reason required) and removed 2 - In Progress labels May 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] on-disk caching of CUDA kernels #8079

[WIP] on-disk caching of CUDA kernels #8079

gmarkall commented May 19, 2022

gmarkall commented May 19, 2022

[WIP] on-disk caching of CUDA kernels #8079

[WIP] on-disk caching of CUDA kernels #8079

Conversation

gmarkall commented May 19, 2022

gmarkall commented May 19, 2022