Implementation of `Inceptionv4`, `InceptionResNetv2` and `Xception` #170

theabhirath · 2022-06-17T14:06:03Z

This PR is my first official contribution as a GSoC student 😁. It does a couple of things:

It adds new models (Inceptionv4 and InceptionResNetv2). Internal docs for the layers of these models are left out because they are repetitive and don't really explain much - but if that's a concern I can always fill them in.
It deprecates Inception3 in favour of Inceptionv3 since the vX notation is being used for models in other libraries (as well as in Metalhead for MobileNetv3) and so it makes sense to be explicit about it.

Other things that I happened to fill in because I found them:

cat_channels is now type-stable as it uses Val(3) for dims - thanks to @ToucheSir pointing that out in some conversations
Catches some formatting issues with Hotfix for ViT on GPU #169

Notes:

TTFG for the Inception models is off the charts. Particularly, InceptionResNetv2 is absolutely insane, sometimes taking minutes. Subsequent gradients are quite slow as well (in comparison, ViTs, which are heavier models, seem to be faster) and also take a lot of memory. This might be helped by using Chain(::Vector) but I haven't used that yet because I'm not sure if it is causing stability issues while training.
Closes Implementation of Inceptionv4 #131 (supersedes). Should also close Implementation of Inception v4 and Inception resnet v2 #129 since that hasn't seen activity for quite some time.

Edit: added the Xception model as well

1. Transition to `Inceptionv3` instead of `Inception3` for standardisation of names 2. Formatting

1. Loosen type constraints on `inputscale` 2. Updated README

darsnack

Looks nearly ready right off the bat, nice job! Just a couple initial comments.

src/convnets/inception.jl

theabhirath · 2022-06-17T16:48:36Z

CI times are through the roof...we might have to disable gradtests again (or maybe enable it only for select models)

theabhirath · 2022-06-18T01:04:20Z

CI times are through the roof...we might have to disable gradtests again (or maybe enable it only for select models)

...how on earth are the second CI times so much lesser than the first one?

ToucheSir · 2022-06-18T01:13:37Z

AIUI GHA runners are containerized but not otherwise isolated, so times can vary drastically depending on what else is running on the host.

theabhirath · 2022-06-18T04:07:34Z

Xception seems to have an even worse gradient benchmark:

julia> x = rand(Float32, 299, 299, 3, 2);

julia> model = Xception();

julia> @benchmark Zygote.gradient(p -> sum($model(p)), $x)
BenchmarkTools.Trial: 1 sample with 1 evaluation.
 Single result which took 9.255 s (78.29% GC) to evaluate,
 with a memory estimate of 10.65 GiB, over 1553316 allocations.

src/utilities.jl

src/Metalhead.jl

src/convnets/inception.jl

src/utilities.jl

src/convnets/inception.jl

1. Support `pretrain` for the Inception model APIs 2. Group deprecations in a single source file to make stuff more organised 3. Random formatting nitpicks 4. Use a plain broadcast instead of `applyactivation`

theabhirath · 2022-06-18T04:49:24Z

This should be ready now, although I'm not sure what exactly I am to make of the gradient times. I suppose they shouldn't block this PR, but is there something that could be a possible contributor that I can change?

theabhirath · 2022-06-18T05:14:07Z

The memory problems return 😬

src/convnets/mobilenet.jl

src/layers/conv.jl

darsnack · 2022-06-18T06:23:14Z

Not sure there's much we can do beyond what's already been tried in Metalhead. We just need to do something about applychain in Flux/Zygote.

darsnack

Only some minor doc stuff

src/layers/conv.jl

src/convnets/inception.jl

darsnack · 2022-06-18T17:16:42Z

src/convnets/inception.jl

+        push!(layers, x -> relu.(x))
+        append!(layers,
+                depthwise_sep_conv_bn((3, 3), inc, outc; pad = 1, bias = false,
+                                      use_bn = (false, false)))


Is there a use case of (true, false) or (false, true)? If not, I was thinking it is kinda silly to use conv_bn with the keyword use_bn = false cause that's just Conv. It makes sense for consistency with depthwise_sep_conv_bn. But it might be cleaner to just introduce depthwise_sep_conv then instead of more keywords in multiple places?

I can't say I see something yet but I think this can be kept for some time...with all the churn over the next two months it might be handy. If not, I'll simplify it later

src/convnets/inception.jl

Co-authored-by: Kyle Daruwalla <daruwalla.k.public@icloud.com>

src/convnets/inception.jl

Co-authored-by: Kyle Daruwalla <daruwalla.k.public@icloud.com>

theabhirath · 2022-06-19T01:25:07Z

Ah sorry, I keep missing these minor doc tweaks 🤦🏽 I'll be a bit more thorough next time

darsnack

No problem! Thanks and nice job!

theabhirath added 5 commits June 17, 2022 10:37

Initial commit for inceptionv4

10255a0

Added tests

df5c5d7

Add Inception_ResNetv2

f61d362

Make Inception models work

97fc329

1. Transition to `Inceptionv3` instead of `Inception3` for standardisation of names 2. Formatting

Add docs

c076c6b

1. Loosen type constraints on `inputscale` 2. Updated README

darsnack requested changes Jun 17, 2022

View reviewed changes

src/convnets/inception.jl Show resolved Hide resolved

src/convnets/inception.jl Show resolved Hide resolved

src/convnets/inception.jl Outdated Show resolved Hide resolved

Minor tweaks

40a8351

theabhirath changed the title ~~Implementation of Inceptionv4 and InceptionResNetv2~~ Implementation of Inceptionv4, InceptionResNetv2 and Xception Jun 18, 2022

Xception

55565d8

theabhirath force-pushed the inception-plus branch from baf311c to 55565d8 Compare June 18, 2022 04:09

theabhirath commented Jun 18, 2022

View reviewed changes

src/utilities.jl Outdated Show resolved Hide resolved

darsnack requested changes Jun 18, 2022

View reviewed changes

src/Metalhead.jl Outdated Show resolved Hide resolved

src/convnets/inception.jl Outdated Show resolved Hide resolved

src/convnets/inception.jl Outdated Show resolved Hide resolved

src/utilities.jl Outdated Show resolved Hide resolved

src/convnets/inception.jl Outdated Show resolved Hide resolved

More minor tweaks

8e6f929

1. Support `pretrain` for the Inception model APIs 2. Group deprecations in a single source file to make stuff more organised 3. Random formatting nitpicks 4. Use a plain broadcast instead of `applyactivation`

darsnack requested changes Jun 18, 2022

View reviewed changes

src/convnets/mobilenet.jl Show resolved Hide resolved

src/layers/conv.jl Outdated Show resolved Hide resolved

src/layers/conv.jl Outdated Show resolved Hide resolved

Final minor tweaks

af504d3

theabhirath force-pushed the inception-plus branch from 418800b to af504d3 Compare June 18, 2022 08:51

theabhirath requested a review from darsnack June 18, 2022 15:34

darsnack requested changes Jun 18, 2022

View reviewed changes

darsnack mentioned this pull request Jun 18, 2022

Add model implementations #112

Open

46 tasks

theabhirath and others added 2 commits June 18, 2022 22:56

More docs

72f5566

Co-authored-by: Kyle Daruwalla <daruwalla.k.public@icloud.com>

grow_at_start instead of grow_first

15f90a6

darsnack requested changes Jun 18, 2022

View reviewed changes

src/convnets/inception.jl Outdated Show resolved Hide resolved

src/convnets/inception.jl Outdated Show resolved Hide resolved

src/convnets/inception.jl Outdated Show resolved Hide resolved

src/convnets/inception.jl Outdated Show resolved Hide resolved

Even more docs

8169f83

Co-authored-by: Kyle Daruwalla <daruwalla.k.public@icloud.com>

darsnack approved these changes Jun 19, 2022

View reviewed changes

darsnack merged commit 93ce7e5 into FluxML:master Jun 19, 2022

theabhirath deleted the inception-plus branch June 19, 2022 03:27

darsnack mentioned this pull request Jun 19, 2022

add inceptions #71

Closed

theabhirath mentioned this pull request Jun 19, 2022

Minor glitch in Xception #172

Closed

ToucheSir mentioned this pull request Jun 19, 2022

Inferrability of cat #149

Closed

theabhirath added the new-model Request or implementation of a new model label Aug 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of `Inceptionv4`, `InceptionResNetv2` and `Xception` #170

Implementation of `Inceptionv4`, `InceptionResNetv2` and `Xception` #170

theabhirath commented Jun 17, 2022 •

edited

Loading

darsnack left a comment

theabhirath commented Jun 17, 2022

theabhirath commented Jun 18, 2022

ToucheSir commented Jun 18, 2022

theabhirath commented Jun 18, 2022 •

edited

Loading

theabhirath commented Jun 18, 2022

theabhirath commented Jun 18, 2022

darsnack commented Jun 18, 2022

darsnack left a comment

darsnack Jun 18, 2022

theabhirath Jun 18, 2022

theabhirath commented Jun 19, 2022

darsnack left a comment

Implementation of Inceptionv4, InceptionResNetv2 and Xception #170

Implementation of Inceptionv4, InceptionResNetv2 and Xception #170

Conversation

theabhirath commented Jun 17, 2022 • edited Loading

darsnack left a comment

Choose a reason for hiding this comment

theabhirath commented Jun 17, 2022

theabhirath commented Jun 18, 2022

ToucheSir commented Jun 18, 2022

theabhirath commented Jun 18, 2022 • edited Loading

theabhirath commented Jun 18, 2022

theabhirath commented Jun 18, 2022

darsnack commented Jun 18, 2022

darsnack left a comment

Choose a reason for hiding this comment

darsnack Jun 18, 2022

Choose a reason for hiding this comment

theabhirath Jun 18, 2022

Choose a reason for hiding this comment

theabhirath commented Jun 19, 2022

darsnack left a comment

Choose a reason for hiding this comment

Implementation of `Inceptionv4`, `InceptionResNetv2` and `Xception` #170

Implementation of `Inceptionv4`, `InceptionResNetv2` and `Xception` #170

theabhirath commented Jun 17, 2022 •

edited

Loading

theabhirath commented Jun 18, 2022 •

edited

Loading