Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a new cache for HostKernel objects. #1510

Merged
merged 1 commit into from
May 16, 2022
Merged

Add a new cache for HostKernel objects. #1510

merged 1 commit into from
May 16, 2022

Conversation

maleadt
Copy link
Member

@maleadt maleadt commented May 16, 2022

Now that GPUCompiler's compilation (and compilation cache) is using the function type,
we need to allocate HostKernel objects in CUDA.jl where we still have access to
the function instance. Doing so is relatively slow, so introduce another cache.

cc @jpsamaroo AMDGPU.jl probably needs something similar.

Now that GPUCompiler's compilation (and compilation cache) is using the function type,
we need to allocate HostKernel objects in CUDA.jl where we still have access to
the function instance. Doing so is relatively slow, so introduce another cache.
@codecov
Copy link

codecov bot commented May 16, 2022

Codecov Report

Merging #1510 (70ec0cd) into master (4c3edd2) will decrease coverage by 0.63%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #1510      +/-   ##
==========================================
- Coverage   73.53%   72.90%   -0.64%     
==========================================
  Files         131      131              
  Lines        9825     9834       +9     
==========================================
- Hits         7225     7169      -56     
- Misses       2600     2665      +65     
Impacted Files Coverage Δ
lib/cudadrv/module/function.jl 100.00% <100.00%> (+14.28%) ⬆️
src/compiler/execution.jl 87.24% <100.00%> (+0.61%) ⬆️
lib/cudnn/CUDNN.jl 37.50% <0.00%> (-35.94%) ⬇️
lib/cublas/CUBLAS.jl 50.00% <0.00%> (-25.44%) ⬇️
src/utilities.jl 68.91% <0.00%> (-4.06%) ⬇️
lib/cudadrv/CUDAdrv.jl 81.66% <0.00%> (-3.34%) ⬇️
lib/cudadrv/module/linker.jl 68.75% <0.00%> (-3.13%) ⬇️
lib/cudadrv/state.jl 77.84% <0.00%> (-2.28%) ⬇️
lib/cudadrv/memory.jl 78.59% <0.00%> (-1.01%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4c3edd2...70ec0cd. Read the comment docs.

@maleadt maleadt merged commit 137fb64 into master May 16, 2022
@maleadt maleadt deleted the tb/launch_overhead branch May 16, 2022 13:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant