Adding some Untrained Models #17

avik-pal · 2018-07-14T06:37:02Z

I have added the vgg nets (with and without BN). The current API update is
VGG11() fetches VGG11 but without pretrained weights and the model is in train mode.
calling trained(VGG19) will give the pretrained VGG19 model in the test mode. All models that lack pretrained weights will throw an error if called with trained.
@MikeInnes could you review this new changes. I will add the remaining models if these changes are ok.

src/model.jl

src/vgg.jl

staticfloat

This looks great! I love it. We should verify some of these implementations by building a training example and training e.g. ResNet34 on ImageNet, both to verify our implementations but also to show users a worked example with the understanding that "if you setup ImageNet like this, you can train your own models"

src/densenet.jl

src/resnet.jl

staticfloat

Whoops, accidental approval. ;)

I have a few comments, and the tests should be fixed. :)

avik-pal · 2018-10-04T15:37:17Z

Should I consider removing some of the tests, since travis is getting OutOfMemory Error?. I will fix the other conv error, the test follows the old API

staticfloat · 2018-10-04T15:51:52Z

It's weird that it's running out of memory so soon; that's literally just constructing the very first model, and it's smaller than the models we've been training on in the past. That makes me think there's something extremely memory-inefficient in how you're creating the models or something?!

avik-pal · 2018-10-04T16:05:20Z

Yeah, it's quite strange. The test fails for the batchnorm models. I am not quite sure how to loading in a more memory efficient way as it is simply constructing an array of the layers and making a Chain out of it. And even weird the test is passing for the ResNet152 and Densenet264 which are much more memory consuming.
I am not sure if batchnorm is somehow at fault, because the densenet and resnets employ batchnorm as well.

avik-pal · 2018-10-04T16:07:21Z

Looking at the tests I am using, it seems that the previous model stays in memory while I'm trying to generate a new one. I think we should deallocate that before building the next model.

staticfloat · 2018-10-04T16:09:44Z

Ah, I thought it was failing on the untrained model tests. It's the trained model tests. Yes, that makes much more sense. Try putting a call in to GC.gc() at the end of every model run?

staticfloat · 2018-10-05T05:13:06Z

Interesting. The OSX builder got through, but the Linux builder didn't.

staticfloat · 2018-10-05T05:16:43Z

Okay I think what's going on here is that when we call GC.gc() at the end of the loop, technically we still have model available in scope, so DenseNet 264 can't be freed, then we immediately head off to build VGG11, and DenseNet 264 + VGG11 is too much. Try putting a GC.gc() between the two loops, and also put a whos() in to see what else is using memory.

avik-pal · 2018-10-09T10:21:26Z

Also we should postpone merge till the squeezenet and the googlenet have the same updated API.

avik-pal · 2018-10-16T18:04:33Z

Now all the models have a common interface.

staticfloat · 2018-11-17T16:09:26Z

Just as an update on this, I am currently attempting to hack together an allocation profiler so that we can see where the memory is going within Julia. This test suite should really not be using as much memory as it is, and combining that with our GPU memory problems, I think we really just need better tools to be able to see what's going on, so I hope to (in a view days) have something hooked up with the Profile module that will allow us to see something similar to the flame graphs that ProfileView spits out, but showing where large objects are being allocated, and how long they survive.

src/resnet.jl

avik-pal · 2019-04-01T17:51:14Z

@staticfloat any update on the status of the allocation profiler?

staticfloat · 2019-04-02T00:30:16Z

Amazingly enough, YES! I don't have time to apply the memory profiler to this branch immediately, but in the next week or two I hope to. It may shed some light on what's going on here. If someone else want to do this, I suggest telling the memory profiler to only pay attention to big and std allocations, as anything small enough to fit within the pool is likely not useful to our analysis here.

avik-pal added 7 commits July 14, 2018 11:28

Add all vgg models

bd58d1e

Fix path

6773653

Fix pad

51d07f0

trained model import fix

210e2f1

trained model import fix

b05cdf8

Add the resnet models

10ecfd3

Updates

91f8ad1

MikeInnes reviewed Sep 17, 2018

View reviewed changes

src/model.jl Outdated Show resolved Hide resolved

src/vgg.jl Outdated Show resolved Hide resolved

src/vgg.jl Outdated Show resolved Hide resolved

avik-pal added 8 commits September 17, 2018 23:01

Add resnet models

2609a39

Add new pool objects

f705522

Merge conflicts

6bd050f

Merge branch 'master' into newmodels

10de831

Remove treelike

df98afb

Add densenet models + minor bug fixes

62cb3b0

Fix typos

813dbae

Minor Fix

62469b8

staticfloat approved these changes Oct 4, 2018

View reviewed changes

src/densenet.jl Outdated Show resolved Hide resolved

src/resnet.jl Show resolved Hide resolved

src/resnet.jl Show resolved Hide resolved

staticfloat suggested changes Oct 4, 2018

View reviewed changes

Add some comments for the internal functions

a661530

Test Fixes

caf9a2b

avik-pal added 2 commits October 5, 2018 10:12

Add garbage collection at the end of every model test

97dd1dc

Remove import

6ed9331

Add varinfo

af30f95

Update GoogleNet and SqueezeNet

4b433ca

avik-pal changed the title ~~[WIP] Adding some Untrained Models~~ Adding some Untrained Models Oct 16, 2018

avik-pal added 7 commits October 16, 2018 22:41

Minor Fixes

41303e8

Method defination override fix

1efabdb

Chane Maxpool to MaxPool

edf6913

Minor typo

4ae1df0

Meanpool to MeanPool

3bf1e13

Fix Global MeanPool kernel size

85ee194

Fix pooling dimensions

a862160

avik-pal and others added 6 commits October 16, 2018 23:41

Add information about the models to README

b902317

Let's ask if there's possibly a ulimit problem

a5cc7b2

See if setting to nothing helps

1e19520

Remove ulimit query

0b1f716

Clean up the residual block

ce0a26b

See if loading a smaller model solves memory issue on travis

43ac7f5

Fix the pretrained resnet

b66a7de

ekinakyurek reviewed Dec 17, 2018

View reviewed changes

src/resnet.jl Outdated Show resolved Hide resolved

Fix typo

a6c2bf3

avik-pal mentioned this pull request Dec 18, 2018

Different Models have different preprocessing needs #32

Closed

avik-pal added 2 commits December 18, 2018 18:51

Fix densenet loading

5e99e78

Fix the tests

069a4ea

barcoyou mentioned this pull request May 31, 2019

while training, get InvalidIRError #45

Closed

tlienart mentioned this pull request Mar 27, 2020

Chasing dead links JuliaLang/www.julialang.org#690

Closed

avik-pal mentioned this pull request Jun 10, 2020

Add standard vision models darsnack/FluxModels.jl#1

Open

16 tasks

darsnack closed this Jan 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding some Untrained Models #17

Adding some Untrained Models #17

avik-pal commented Jul 14, 2018

staticfloat left a comment

staticfloat left a comment

avik-pal commented Oct 4, 2018

staticfloat commented Oct 4, 2018

avik-pal commented Oct 4, 2018

avik-pal commented Oct 4, 2018

staticfloat commented Oct 4, 2018

staticfloat commented Oct 5, 2018

staticfloat commented Oct 5, 2018

avik-pal commented Oct 9, 2018

avik-pal commented Oct 16, 2018

staticfloat commented Nov 17, 2018

avik-pal commented Apr 1, 2019

staticfloat commented Apr 2, 2019

Adding some Untrained Models #17

Adding some Untrained Models #17

Conversation

avik-pal commented Jul 14, 2018

staticfloat left a comment

Choose a reason for hiding this comment

staticfloat left a comment

Choose a reason for hiding this comment

avik-pal commented Oct 4, 2018

staticfloat commented Oct 4, 2018

avik-pal commented Oct 4, 2018

avik-pal commented Oct 4, 2018

staticfloat commented Oct 4, 2018

staticfloat commented Oct 5, 2018

staticfloat commented Oct 5, 2018

avik-pal commented Oct 9, 2018

avik-pal commented Oct 16, 2018

staticfloat commented Nov 17, 2018

avik-pal commented Apr 1, 2019

staticfloat commented Apr 2, 2019