Cache numba.cuda functions on repeated deserialization #4590
Labels
CUDA
CUDA related issue/PR
feature_request
performance - run time
Performance issue occurring at run time.
As of #3026 , Numba kindly returns the same function when an equivalent bytestring is deserialized many times. This is great for systems like Dask, which may send around the same numba function many times.
Currently, it looks like this isn't being done for numba.cuda functions, which ends up being a bottleneck in Dask + Numba GPU workloads.
cc @seibert
The text was updated successfully, but these errors were encountered: