Difference on epochs in network-slimming #3

erichhhhho · 2018-11-28T08:47:09Z

Hi, I found the default number of epochs in network-slimming(scratch training VGG-11) for Imagenet (which is 90 in the code) is different from the original paper, which is 60.

Eric-mingjie · 2018-11-28T10:40:59Z

Hi, Thanks for your interest in our code!

Yes, in the original network slimming paper, the epochs of ImageNet is 60. In this repo, we use the official Pytorch ImageNet training schedule which is 90 epochs.

liuzhuang13 · 2018-12-03T23:42:06Z

Hi @erichhhhho I'm an author of both this paper and the Network Slimming paper.

Using 60 epochs in the original Network Slimming paper was due to the resource limit at that time, and there was a significant bug (a bug about activation functions in fc layers that was later found) in the original paper for the result on VGG-11 on ImageNet. So in this project, we fixed the bug and used 90 epochs (standard in many papers).

erichhhhho · 2018-12-05T03:25:02Z

@Eric-mingjie @liuzhuang13 I see. Thank you for your clarification.

erichhhhho · 2018-12-05T12:12:38Z

Btw, there is a bug in network-slimming cifar10 main_B.py (line 102)

if args.refine:
AttributeError: 'Namespace' object has no attribute 'refine'

The args.refine should already become args.scratch in your code, and will be redundant.

Eric-mingjie · 2018-12-05T14:16:31Z

Hi, @erichhhhho ! Thanks for pointing it out! I just pushed a fix.

huangbiubiu · 2019-09-14T07:19:20Z

Hi @erichhhhho I'm an author of both this paper and the Network Slimming paper.

Using 60 epochs in the original Network Slimming paper was due to the resource limit at that time, and there was a significant bug (a bug about activation functions in fc layers that was later found) in the original paper for the result on VGG-11 on ImageNet. So in this project, we fixed the bug and used 90 epochs (standard in many papers).

@liuzhuang13 I download the PyTorch model of scratch-E from the link of the trained model (https://github.com/Eric-mingjie/rethinking-network-pruning/tree/master/imagenet/network-slimming#models). I found that the value of epoch is 60 instead of 90 in the model dict. Is the scratch-E model trained 60 epoch? And what're the epoch settings of the other experiments, e.g., Unpruned in the Table. 4 of the paper RETHINKING THE VALUE OF NETWORK PRUNING? Thanks!

Eric-mingjie · 2019-09-15T19:01:03Z

The epoch is actually 90. Don't be bothered by the value of epoch. Standard ImageNet models are trained with 90 epochs.

erichhhhho closed this as completed Dec 5, 2018

erichhhhho reopened this Dec 5, 2018

erichhhhho closed this as completed Dec 29, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Difference on epochs in network-slimming #3

Difference on epochs in network-slimming #3

erichhhhho commented Nov 28, 2018 •

edited

Loading

Eric-mingjie commented Nov 28, 2018 •

edited

Loading

liuzhuang13 commented Dec 3, 2018 •

edited

Loading

erichhhhho commented Dec 5, 2018

erichhhhho commented Dec 5, 2018 •

edited

Loading

Eric-mingjie commented Dec 5, 2018

huangbiubiu commented Sep 14, 2019

Eric-mingjie commented Sep 15, 2019 •

edited

Loading

Difference on epochs in network-slimming #3

Difference on epochs in network-slimming #3

Comments

erichhhhho commented Nov 28, 2018 • edited Loading

Eric-mingjie commented Nov 28, 2018 • edited Loading

liuzhuang13 commented Dec 3, 2018 • edited Loading

erichhhhho commented Dec 5, 2018

erichhhhho commented Dec 5, 2018 • edited Loading

Eric-mingjie commented Dec 5, 2018

huangbiubiu commented Sep 14, 2019

Eric-mingjie commented Sep 15, 2019 • edited Loading

erichhhhho commented Nov 28, 2018 •

edited

Loading

Eric-mingjie commented Nov 28, 2018 •

edited

Loading

liuzhuang13 commented Dec 3, 2018 •

edited

Loading

erichhhhho commented Dec 5, 2018 •

edited

Loading

Eric-mingjie commented Sep 15, 2019 •

edited

Loading