I'm really confused with the v2.0 meta.py #14

yaox12 · 2018-12-19T07:40:22Z

First, the comment says the index of losses_q is tasks index.

MAML-Pytorch/meta.py

Line 77 in fc20b31

losses_q = [0 for _ in range(self.update_step + 1)] # losses_q[i], i is tasks idx

However, in each task i , the whole list is updated.

MAML-Pytorch/meta.py

Line 94 in fc20b31

losses_q[0] += loss_q

MAML-Pytorch/meta.py

Line 105 in fc20b31

losses_q[1] += loss_q

MAML-Pytorch/meta.py

Line 123 in fc20b31

losses_q[k + 1] += loss_q
Second, I haven't seen the sum of loss_q?

MAML-Pytorch/meta.py

Lines 134 to 135 in fc20b31

# sum over all losses on query set across all tasks

loss_q = losses_q[-1] / task_num

losses_q[-1] seems to be the last step's loss for the last task?

Third, if update_step == 1, there will be only one inner update. However, the loss after first update is computed under torch.no_grad(), so I think there is no backward update information on the query set.

MAML-Pytorch/meta.py

Lines 100 to 109 in fc20b31

    
           # this is the loss and accuracy after the first update 
        
           with torch.no_grad(): 
        
               # [setsz, nway] 
        
               logits_q = self.net(x_qry[i], fast_weights, bn_training=True) 
        
               loss_q = F.cross_entropy(logits_q, y_qry[i]) 
        
               losses_q[1] += loss_q 
        
               # [setsz] 
        
               pred_q = F.softmax(logits_q, dim=1).argmax(dim=1) 
        
               correct = torch.eq(pred_q, y_qry[i]).sum().item() 
        
               corrects[1] = corrects[1] + correct

The text was updated successfully, but these errors were encountered:

dragen1860 · 2018-12-19T07:49:09Z

index of losses_q is NOT tasks index, but the update step index. sry for wrong comments.
sum of loss_q is accumulated at every inner update, hence just need to average it.
losses_q[-1] is the last step's loss.

3.yes, you are right. For only single step's update setting, the code loss_q = F.cross_entropy(logits_q, y_qry[i]) should be moved out of torch.no_grad().

yaox12 · 2018-12-19T08:17:16Z

At the end of each task i, you append loss_q to list losess_q

MAML-Pytorch/meta.py

Lines 130 to 131 in fc20b31

    
           # 4. record last step's loss for task i 
        
           losses_q.append(loss_q)

The accumulated last step's loss should be losses_q[self.update_step] instead of losses_q[-1], because the length of losses_q is update_step + 1 + task_num in the end.
In fact, I think the above two lines are redundant and useless.

dragen1860 · 2018-12-19T08:27:10Z

Yes, it's a bug!
Thanks for your very helpful insight.
Remove line 130 & 131 !

dragen1860 closed this as completed Dec 19, 2018

dragen1860 reopened this Dec 19, 2018

dragen1860 closed this as completed Dec 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I'm really confused with the v2.0 meta.py #14

I'm really confused with the v2.0 meta.py #14

yaox12 commented Dec 19, 2018

dragen1860 commented Dec 19, 2018

yaox12 commented Dec 19, 2018

dragen1860 commented Dec 19, 2018

I'm really confused with the v2.0 meta.py #14

I'm really confused with the v2.0 meta.py #14

Comments

yaox12 commented Dec 19, 2018

dragen1860 commented Dec 19, 2018

yaox12 commented Dec 19, 2018

dragen1860 commented Dec 19, 2018