Is it necessary to rebuild the model every train iteration? #1

pangyyyyy · 2022-04-11T03:11:32Z

Hi, thanks for the great work!

I noticed that you rebuild the meta-model every iteration (L129), and I was wondering if that is necessary?

Lines 122 to 130 in cabea1f

    
           for iters in range(args.iters): 
        
               adjust_learning_rate(optimizer_model, iters + 1) 
        
               model.train() 
        
               input, target = next(iter(train_loader)) 
        
               input_var = to_var(input, requires_grad=False) 
        
               target_var = to_var(target, requires_grad=False) 
        
               meta_model = build_model() 
        
               meta_model.load_state_dict(model.state_dict())

Would it have any difference or negative impact if i were to just build the meta-model before the loop and reload the model's state_dict every iteration (L130) instead?

The text was updated successfully, but these errors were encountered:

arghosh · 2022-04-11T04:42:14Z

Hi. Good catch. I think, it should be the same and faster. But, make sure, gradients are not accumulated.
This code follows more closely Meta-weight-network implementation. You can check my other repo where I simplified the MWNet implementation using higher package.

pangyyyyy · 2022-04-11T05:15:56Z

@arghosh Thanks for the clarification! Your implementation using higher package seems rather neat, does it support distributed training?

arghosh · 2022-04-11T05:49:19Z

My code does not support distributed training. I don't think higher supports data parallel. But, you may pass the meta batch to different GPUs, do local meta step, compute base model gradients in each node; after that, DDP can handle that, I guess.

pangyyyyy · 2022-04-13T07:42:43Z

@arghosh Thanks for the clarification!

pangyyyyy closed this as completed Apr 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it necessary to rebuild the model every train iteration? #1

Is it necessary to rebuild the model every train iteration? #1

pangyyyyy commented Apr 11, 2022 •

edited

Loading

arghosh commented Apr 11, 2022

pangyyyyy commented Apr 11, 2022 •

edited

Loading

arghosh commented Apr 11, 2022

pangyyyyy commented Apr 13, 2022

Is it necessary to rebuild the model every train iteration? #1

Is it necessary to rebuild the model every train iteration? #1

Comments

pangyyyyy commented Apr 11, 2022 • edited Loading

arghosh commented Apr 11, 2022

pangyyyyy commented Apr 11, 2022 • edited Loading

arghosh commented Apr 11, 2022

pangyyyyy commented Apr 13, 2022

pangyyyyy commented Apr 11, 2022 •

edited

Loading

pangyyyyy commented Apr 11, 2022 •

edited

Loading