Could not find the generator loss. #6

PeterouZh · 2020-09-23T02:45:12Z

Hi,

thanks for your great job.

When I read the code, I found there is only the discriminator loss and no generator loss. In other words, there is no adversarial training in MEALv2, which is different from my intuition. I want to know what is the advantage of just using the discriminator.

szq0214 · 2020-09-23T03:37:36Z

Hi @PeterouZh, generator loss is the similarity loss in our framework to produce the same distribution as teachers', i.e., the KL divergence loss. Conventional adversarial training uses alternate updating but since the input and output of our discriminator and generator (student) are differentiable (not images), we can train the whole pipeline jointly.

PeterouZh · 2020-09-24T03:21:32Z

Thanks for your reply. It's very interesting. Maybe a detailed ablation study is needed to verify the effectiveness of the discriminator loss.

szq0214 closed this as completed Sep 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could not find the generator loss. #6

Could not find the generator loss. #6

PeterouZh commented Sep 23, 2020

szq0214 commented Sep 23, 2020

PeterouZh commented Sep 24, 2020

Could not find the generator loss. #6

Could not find the generator loss. #6

Comments

PeterouZh commented Sep 23, 2020

szq0214 commented Sep 23, 2020

PeterouZh commented Sep 24, 2020