New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Model update: Update the parameter initialization of FC layer of SE-ResNeXt and add a README #825

Merged

qingqing01 merged 20 commits into PaddlePaddle:develop from BigFishMaster:model_update

Apr 28, 2018

Contributor

BigFishMaster commented Apr 10, 2018 •

edited

fix #826


          revise se_resnext for imagenet classification

dedd1a9

BigFishMaster mentioned this pull request

Update the parameter initialization of FC layer of SE-ResNeXt and add a README #826

Closed

BigFishMaster changed the title ~~Model update~~ Model update: Update the parameter initialization of FC layer of SE-ResNeXt and add a README

BigFishMaster force-pushed the model_update branch from 6111972 to 7c3ec94 Compare

April 10, 2018 08:02


          Add the method to train a SE-ResNeXt model

The current code can successfully implement the result of SE-ResNeXt-50.

BigFishMaster force-pushed the model_update branch from 7c3ec94 to 7103698 Compare

April 10, 2018 08:15

qingqing01 reviewed

View reviewed changes

fluid/image_classification/README.md Outdated

+              # prepare directory
+              mkdir ILSVRC2012/
+              tar zxf XXX
+              tar zxf YYY

Collaborator

qingqing01 Apr 11, 2018

Better to give the download URL.

fluid/image_classification/README.md Outdated

+              n01440764/n01440764_13602.JPEG 0
+              n01440764/n01440764_13625.JPEG 0
+              ...
+              ```

Collaborator

qingqing01 Apr 11, 2018

Can these file lists download in somewhere ? like https://github.com/BVLC/caffe/blob/master/data/ilsvrc12/get_ilsvrc_aux.sh

fluid/image_classification/README.md Outdated

+              ```
+              python train.py --num_layers=50 --batch_size=256 --with_mem_opt=True --parallel_exe=True
+              ```

Collaborator

qingqing01 Apr 11, 2018

It's better to give the convergent curve.

fluid/image_classification/README.md Outdated

+              |- | :-: |:-: | -:
+              |SE-ResNeXt-50 | 77.6%/- | 77.71%/93.63% | 77.42%/93.50%
+              ## Finetune a model

Collaborator

qingqing01 Apr 11, 2018

Need to add the usage here to tell the users how to fine-tune a model.

fluid/image_classification/README.md Outdated

+              ```
+              ## Inference
+              The inference process is conducted after each training epoch.

Collaborator

qingqing01 Apr 11, 2018

Need a infer.py to tell the users how to do inference.

BigFishMaster and others added 12 commits

April 12, 2018 11:37


          Update ImageNet2012 URL

c51499f


          update

7401df1


          update readme

5dde5bf


          Merge branch 'develop' of https://github.com/PaddlePaddle/models into…

58e8075

… model_update


          Update readme with download URL

e833a60


          Merge branch 'model_update' of https://github.com/BigFishMaster/models …

2338db7

…into model_update


          add unzip

5504ee0


          train.py update

52bac00


          Merge branch 'develop' into model_update

369bdb1


          update se_resnext.py with parallel_exe

cc2fa70


          Merge branch 'model_update' of https://github.com/BigFishMaster/models …

dd9271a

…into model_update


          delete train function in se_resnext.py

a8703bf

qingqing01 reviewed

View reviewed changes

Collaborator

qingqing01 left a comment

Need scripts: infer.py and eval.py.

fluid/image_classification/se_resnext.py Outdated

+                  #if global_step % step_each_epoch == 0:
+                  #    print("epoch={0}, global_step={1},decayed_lr={2} \
+                  #          (step_each_epoch={3})".format( \
+                  #          epoch,global_step,decayed_lr,step_each_epoch))

Collaborator

qingqing01 Apr 26, 2018

Remove the unused code.

fluid/image_classification/se_resnext.py Outdated

               def squeeze_excitation(input, num_channels, reduction_ratio):
                   pool = fluid.layers.pool2d(
                       input=input, pool_size=0, pool_type='avg', global_pooling=True)
+                  ### initializer parameter
+                  #print >> sys.stderr, "pool shape:", pool.shape

Collaborator

qingqing01 Apr 26, 2018

Remove the unused code.

fluid/image_classification/se_resnext.py Outdated

+                                            param_attr=fluid.param_attr.ParamAttr(
+                                                initializer=fluid.initializer.Uniform(-stdv,
+                                                                                      stdv)))
+                  #print >> sys.stderr, "squeeze shape:", squeeze.shape

Collaborator

qingqing01 Apr 26, 2018

Remove the unused code.

fluid/image_classification/se_resnext.py Outdated

                   scale = fluid.layers.elementwise_mul(x=input, y=excitation, axis=0)
                   return scale
-              def shortcut(input, ch_out, stride):
+              def shortcut_old(input, ch_out, stride):

Collaborator

qingqing01 Apr 26, 2018

If shortcut_old is not used, please to remove it.

fluid/image_classification/se_resnext.py Outdated

                   else:
                       drop = pool
-                  out = fluid.layers.fc(input=drop, size=class_dim, act='softmax')
+                  #print >> sys.stderr, "drop shape:", drop.shape

Collaborator

qingqing01 Apr 26, 2018

Remove the unused code.

BigFishMaster and others added 4 commits

April 26, 2018 14:05


          Merge branch 'develop' into model_update

aa08d91


          add eval.py and infer.py

a67370c


          add eval.py and infer.py

3b0ecb5


          add eval.py and infer.py

bf17277

qingqing01 reviewed

View reviewed changes

fluid/image_classification/se_resnext.py Outdated

		import math


		def cosine_decay(learning_rate, step_each_epoch, epochs=120):

Collaborator

qingqing01 Apr 28, 2018

这个函数移动到train.py里吧。

fluid/image_classification/train.py

@@ @@ -314,12 +327,15 @@ def train_parallel_exe(args, @@
                   # layers: 50, 152
                   layers = args.num_layers
                   method = train_parallel_exe if args.parallel_exe else train_parallel_do
+                  init_model = args.init_model if args.init_model else None
+                  pretrained_model = args.pretrained_model if args.pretrained_model else None

Collaborator

qingqing01 Apr 28, 2018

上面学习率调整测绿没加cosine_decay

BigFishMaster added 2 commits

April 28, 2018 10:52


          move cosine_decay from se_resnext.py to train.py

52258be


          Merge branch 'develop' of https://github.com/PaddlePaddle/models into…

e270bb9

… model_update

qingqing01 approved these changes

View reviewed changes

Collaborator

qingqing01 left a comment

先approve了，为了方便cloud验证。但是文档和代码后续还需要再提升下。

qingqing01 merged commit f60005c into PaddlePaddle:develop

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment