Reclaim in cuDNN conv algorithm search #1711

ToucheSir · 2022-12-29T01:36:31Z

Per discussion on Slack. This dramatically improves the maximum batch size Flux models with conv layers can support without OOMing on GPU, e.g. https://discourse.julialang.org/t/memory-challenges-for-flux-on-resnet/85385.

CarloLucibello · 2022-12-29T08:13:55Z

Is there something we can do on the NNlibCUDA side while this lands in the next release (I don't know how long that will take but the last tag was in july)?

ToucheSir · 2022-12-29T15:49:19Z

I'm not sure if CUDA.jl does backports, but if so I'd be happy to replicate this PR against any backport branch as well.

maleadt · 2023-01-02T18:26:43Z

I'm not sure if CUDA.jl does backports, but if so I'd be happy to replicate this PR against any backport branch as well.

I've done backports in the past, so I'd be fine doing another backports release.

This dramatically improves the maximum batch size Flux models with conv layers can support without OOMing on GPU, e.g. https://discourse.julialang.org/t/memory-challenges-for-flux-on-resnet/85385.

Reclaim in cuDNN algorithm search

b354161

maleadt merged commit 9362065 into JuliaGPU:master Jan 2, 2023

ToucheSir deleted the cudnn-algo-reclaim branch January 2, 2023 18:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reclaim in cuDNN conv algorithm search #1711

Reclaim in cuDNN conv algorithm search #1711

ToucheSir commented Dec 29, 2022

CarloLucibello commented Dec 29, 2022

ToucheSir commented Dec 29, 2022

maleadt commented Jan 2, 2023

Reclaim in cuDNN conv algorithm search #1711

Reclaim in cuDNN conv algorithm search #1711

Conversation

ToucheSir commented Dec 29, 2022

CarloLucibello commented Dec 29, 2022

ToucheSir commented Dec 29, 2022

maleadt commented Jan 2, 2023