which network is used for evaluation? #7

LiangHann · 2020-10-30T23:05:37Z

I am confused for the evaluation codes:
for idx_iteration in range(args.num_generations):
print(f'starting generation {idx_iteration+1}')
print('#'*100)
teacher_net = d_trainer(teacher_net, student_net)
d_trainer.evaluate(teacher_net)
teacher_net.teacher_mode()

    student_net = deepcopy(teacher_net)
    saver.save_net(student_net, f'chk_di_{idx_iteration + 1}')

    student_net.reinit_layers(args.reinit_l4, args.reinit_l3)

Do you use student network or teacher network for evaluation?

The text was updated successfully, but these errors were encountered:

angpo · 2020-10-31T14:01:37Z

We perform the evaluation on the student net; indeed d_trainer(teacher_net, student_net) returns the parameters of the student net. We store it in a variable called teacher_net because we have tried to re-iterate the process (namely, using the student as a teacher for another network). However, as we did not observe any improvement, we simply stopped to one iteration.

angpo closed this as completed Oct 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

which network is used for evaluation? #7

which network is used for evaluation? #7

LiangHann commented Oct 30, 2020

angpo commented Oct 31, 2020

which network is used for evaluation? #7

which network is used for evaluation? #7

Comments

LiangHann commented Oct 30, 2020

angpo commented Oct 31, 2020