question about magic_model #7

luolanfeixue · 2020-05-27T11:06:25Z

how to understand magic_model

why augmentation.classifier line 158 dev_loss.backward() can update the Generator weight

luolanfeixue · 2020-05-27T11:46:36Z

看名字知道您是中国人，所以冒昧用中文提issue了。
1、这个magic model怎么就能做到这样，dev_loss为什么会更新generator的参数呢？

2、这里是用到了前一次classifer到梯度了吗
deltas = _adam_delta(self._optimizer, self._model, grads)
magic_model.update_params(deltas)

tanyuqian · 2020-06-15T02:57:12Z

Yes, you are right:) I'll use English to reply for possible inspirations to everyone.

"how to understand magic_model?" -- It's for running a specified model (e.g., BERT) whose parameters are the sum of multiple sets of parameters (e.g., \theta + \theta'(\phi)) while we don't need to re-write the original forward() function. Our implementation is a bit like a hack to PyTorch.Module. If you have better ways to do it, please let me know.

"why can dev_loss.backward() update the Generator weight?" -- In the code, the route of gradient propagation is dev_loss (classifier.p::Line154) -> deltas (Line 147) -> grads (Line 140) -> aug_probs (Line 119) -> generator parameters (via gumbel_softmax, generator.py::Line104). For the correspondence between our paper and code, please refer to this.

Sorry that I don't quite understand your question "这里是用到了前一次classifer到梯度了吗". Could you specify it more detailedly?

luolanfeixue closed this as completed Jul 15, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question about magic_model #7

question about magic_model #7

luolanfeixue commented May 27, 2020

luolanfeixue commented May 27, 2020

tanyuqian commented Jun 15, 2020

question about magic_model #7

question about magic_model #7

Comments

luolanfeixue commented May 27, 2020

luolanfeixue commented May 27, 2020

tanyuqian commented Jun 15, 2020