训练速度过慢的问题？ #14

FLHonker · 2019-11-21T12:54:05Z

MNIST从原本的10epcoh在DAFL框架下要训练200epoch，CIFAR-10要训练2000epoch。而且GAN的训练也耗时，经过实验比标准KD训练时间长了20-30倍。

MingSun-Tse · 2019-11-21T20:11:24Z

(I am not among the authors. Just for discussion.) I think this could be normal. One possible reason is that the data is not real. The information per sample can be limited, so basically the student network needs to see many more samples than the training on real data.

FLHonker · 2019-11-22T01:21:53Z

But I think this speed is unacceptable in actual use. Moreover, only small data sets are used in the experiment. If semantic segmentation, imagenet, and high-resolution images tasks are used, the computational complexity is very large. It is estimated that GAN cannot reasonably infer the distribution equivalent to real data.

FLHonker · 2019-11-22T01:26:12Z

(I am not among the authors. Just for discussion.) I think this could be normal. One possible reason is that the data is not real. The information per sample can be limited, so basically the student network needs to see many more samples than the training on real data.

欢迎star我的仓库一起交流KD：https://github.com/FLHonker/Awesome-Knowledge-Distillation

MingSun-Tse · 2019-11-22T02:02:22Z

But I think this speed is unacceptable in actual use. Moreover, only small data sets are used in the experiment. If semantic segmentation, imagenet, and high-resolution images tasks are used, the computational complexity is very large. It is estimated that GAN cannot reasonably infer the distribution equivalent to real data.

Yeah, you've made a point. ImageNet would be substantially harder. It definitely has a long road before practical use. But Rome is not built in one day. I think this paper can be a good start.

FLHonker · 2019-11-22T03:38:28Z

But I think this speed is unacceptable in actual use. Moreover, only small data sets are used in the experiment. If semantic segmentation, imagenet, and high-resolution images tasks are used, the computational complexity is very large. It is estimated that GAN cannot reasonably infer the distribution equivalent to real data.

Yeah, you've made a point. ImageNet would be substantially harder. It definitely has a long road before practical use. But Rome is not built in one day. I think this paper can be a good start.

我也一直试图改进这个问题，除非抛弃GAN，GAN的训练是个痛点。data-free是个很有意思的topic。

HantingChen · 2019-11-25T02:05:27Z

Thanks for MingSun-Tse's answer. That's right. We will develop a more efficient data-free learning method in the future work.

FLHonker closed this as completed Sep 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

训练速度过慢的问题？ #14

训练速度过慢的问题？ #14

FLHonker commented Nov 21, 2019

MingSun-Tse commented Nov 21, 2019

FLHonker commented Nov 22, 2019

FLHonker commented Nov 22, 2019

MingSun-Tse commented Nov 22, 2019

FLHonker commented Nov 22, 2019

HantingChen commented Nov 25, 2019

训练速度过慢的问题？ #14

训练速度过慢的问题？ #14

Comments

FLHonker commented Nov 21, 2019

MingSun-Tse commented Nov 21, 2019

FLHonker commented Nov 22, 2019

FLHonker commented Nov 22, 2019

MingSun-Tse commented Nov 22, 2019

FLHonker commented Nov 22, 2019

HantingChen commented Nov 25, 2019