make use of conv_bias_act #1302

gartangh · 2020-08-04T14:06:41Z

Make use of conv_bias_act from FluxML/NNlib.jl#228.
Should speed up Convolutional Neural Networks significantly after JuliaGPU/CUDA.jl#321.

DhairyaLGandhi · 2020-09-03T23:49:09Z

Bump

DhairyaLGandhi · 2020-11-02T06:25:06Z

bors try

bors · 2020-11-02T06:37:12Z

try

Build failed:

ci/gitlab/gitlab.com

DhairyaLGandhi

Bump

DhairyaLGandhi · 2021-01-28T22:04:55Z

src/layers/conv.jl

@@ -144,7 +144,7 @@ function (c::Conv)(x::AbstractArray)
  # ndims(x) == ndims(c.weight)-1 && return squeezebatch(c(reshape(x, size(x)..., 1)))
  σ, b = c.σ, reshape(c.bias, ntuple(_->1, length(c.stride))..., :, 1)
  cdims = DenseConvDims(x, c.weight; stride=c.stride, padding=c.pad, dilation=c.dilation)
-  σ.(conv(x, c.weight, cdims) .+ b)
+  conv_bias_act(x, c.weight, cdims, b, σ)
 end



Suggested change

conv_bias_act(x, w, cdims::DenseConvDims, b::Zeros, σ) = σ.(conv(x, w, cdims))

function conv_bias_act(x::CuArray, w::CuArray{T}, cdims::DenseConvDims, b::Zeros, σ) where T

bz = CUDA.zeros(size(b)...)

NNlib.conv_bias_act(x, w, cdims, bz, σ)

end

We need this materialised here since CUDNN expects to get an array here, and the bump in performance over regular Conv calls is enough to justify needing to allocate some memory and quickly putting it back into the pool. We need them 2x speedups on ResNets!

Use reinterpret(reshape, ...) instead

CarloLucibello · 2021-02-07T08:49:08Z

using the conv_bias_act is an inner implementation detail that should not block v0.12 (especially given the fact that the author is unresponsive)

DhairyaLGandhi · 2021-03-30T07:47:10Z

Its not an inner detail - conv being equivalent to the forward pass of Conv is essentially the functional form of the layer. Changing that out for a different function is a breaking change since conv and conv_bias_act aren't compatible functions. (conv does not involve about the bias term for example. conv_bias_act also goes through a completely different code path with CUDA with different assumptions regarding accuracy and the algorithms that can be chosen for the convolution)

CarloLucibello · 2021-04-30T07:37:14Z

Its not an inner detail - conv being equivalent to the forward pass of Conv is essentially the functional form of the layer.

conv is not equivalent to the forward pass of Conv, Conv also adds a bias an activation on top of that, which means that Conv is equivalent to conv_bias_act + keeping an inner state

ToucheSir · 2021-05-04T16:59:59Z

conv_bias_act also goes through a completely different code path with CUDA with different assumptions regarding accuracy and the algorithms that can be chosen for the convolution)

In that case, we should document these assumptions and the tolerences we're okay with in different scenarios. e.g. is Conv allowed to switch to conv_bias_act when bias is set and we're running in a @fastmath context? Whatever the solution, I'm strongly of the opinion that a) we should make conv_bias_act easier to use, and b) defining a separate layer for it is a bad idea.

DhairyaLGandhi · 2021-07-08T07:55:46Z

Well, composition means that we don't need to have wrappers around kernels that effectively compute the raison d'etre for the layer without polluting the namespace/ api. The api to get all the details of a layer for a "function" would be pretty ugly. Its not a big deal for users to broadcast an activation over a conv either.

ToucheSir · 2021-07-08T19:04:55Z

AIUI, the whole point of conv_bias_act is that certain runtimes (e.g. CuDNN) will provide accelerated (e.g. fused) versions that run faster than the usual act(conv(W, x) .+ b) (which would incur at least 3 dispatches/launches). There may be marginally different results because of how different implementations work, but that's why I mentioned having something like @fastmath.

If we don't make use of conv_bias_act in Flux itself, I can count on one hand the number of people who would use it. There's very little harm in having it as a fast path of Conv that only triggers under specific circumstances.

mcabbott · 2021-07-08T19:13:19Z

Seems sensible for the fast way to be the default, no? Flux surely doesn't guarantee that exact floating-point values will be the same between versions, and it's definitely not a library targeting uses which care a lot about the 16th decimal place.

I don't see any discussion of how different this is; if it's more than minor floating point stuff can someone summarise & provide links? If it is e.g. substantially less accurate, then you could argue for some Conv(; precise=true) or some global switch or something.

DhairyaLGandhi · 2021-08-19T18:09:45Z

some global switch or something.

CUDA.math_mode, and fastmath etc

gartangh force-pushed the conv_bias_act branch from 3bf03a3 to b9ff9e9 Compare September 14, 2020 10:49

gartangh mentioned this pull request Sep 14, 2020

Added wrapper for convolution, bias, and activation FluxML/NNlib.jl#228

Merged

make use of conv_bias_act

7b28f6d

gartangh force-pushed the conv_bias_act branch from b9ff9e9 to 7b28f6d Compare October 9, 2020 20:17

bors bot added a commit that referenced this pull request Nov 2, 2020

Try #1302:

fe89dee

CarloLucibello closed this Nov 14, 2020

CarloLucibello reopened this Nov 14, 2020

DhairyaLGandhi requested changes Jan 28, 2021

View reviewed changes

DhairyaLGandhi mentioned this pull request Jan 29, 2021

Allow Zeros with dimensions #1402

Closed

4 tasks

DhairyaLGandhi added this to the v0.12 milestone Feb 5, 2021

CarloLucibello removed this from the v0.12 milestone Feb 7, 2021

DhairyaLGandhi mentioned this pull request Feb 14, 2021

Zeros has old behaviour on releases up to 0.11.6 #1507

Closed

carterjgreen mentioned this pull request Jul 23, 2021

conv_bias_act broken pullback FluxML/NNlib.jl#329

Open

Merge branch 'master' into conv_bias_act

72b5200

darsnack linked an issue Jan 12, 2022 that may be closed by this pull request

Bottleneck in mapreducedim for convolutional layers #558

Open

darsnack mentioned this pull request Jan 12, 2022

Use NNlib.conv_bias_act for Conv #1832

Draft

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make use of conv_bias_act #1302

make use of conv_bias_act #1302

gartangh commented Aug 4, 2020 •

edited

Loading

DhairyaLGandhi commented Sep 3, 2020

DhairyaLGandhi commented Nov 2, 2020

bors bot commented Nov 2, 2020

DhairyaLGandhi left a comment

DhairyaLGandhi Jan 28, 2021 •

edited

Loading

DhairyaLGandhi Mar 30, 2021

DhairyaLGandhi Apr 23, 2021 •

edited

Loading

CarloLucibello commented Feb 7, 2021

DhairyaLGandhi commented Mar 30, 2021

CarloLucibello commented Apr 30, 2021

ToucheSir commented May 4, 2021

DhairyaLGandhi commented Jul 8, 2021

ToucheSir commented Jul 8, 2021 •

edited

Loading

mcabbott commented Jul 8, 2021

DhairyaLGandhi commented Aug 19, 2021

+conv_bias_act(x, w, cdims::DenseConvDims, b::Zeros, σ) = σ.(conv(x, w, cdims))
+function conv_bias_act(x::CuArray, w::CuArray{T}, cdims::DenseConvDims, b::Zeros, σ) where T
+  bz = CUDA.zeros(size(b)...)
+  NNlib.conv_bias_act(x, w, cdims, bz, σ)
+end

make use of conv_bias_act #1302

Are you sure you want to change the base?

make use of conv_bias_act #1302

Conversation

gartangh commented Aug 4, 2020 • edited Loading

DhairyaLGandhi commented Sep 3, 2020

DhairyaLGandhi commented Nov 2, 2020

bors bot commented Nov 2, 2020

try

DhairyaLGandhi left a comment

Choose a reason for hiding this comment

DhairyaLGandhi Jan 28, 2021 • edited Loading

Choose a reason for hiding this comment

DhairyaLGandhi Mar 30, 2021

Choose a reason for hiding this comment

DhairyaLGandhi Apr 23, 2021 • edited Loading

Choose a reason for hiding this comment

CarloLucibello commented Feb 7, 2021

DhairyaLGandhi commented Mar 30, 2021

CarloLucibello commented Apr 30, 2021

ToucheSir commented May 4, 2021

DhairyaLGandhi commented Jul 8, 2021

ToucheSir commented Jul 8, 2021 • edited Loading

mcabbott commented Jul 8, 2021

DhairyaLGandhi commented Aug 19, 2021

gartangh commented Aug 4, 2020 •

edited

Loading

DhairyaLGandhi Jan 28, 2021 •

edited

Loading

DhairyaLGandhi Apr 23, 2021 •

edited

Loading

ToucheSir commented Jul 8, 2021 •

edited

Loading