Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

which network is used for evaluation? #7

Closed
LiangHann opened this issue Oct 30, 2020 · 1 comment
Closed

which network is used for evaluation? #7

LiangHann opened this issue Oct 30, 2020 · 1 comment

Comments

@LiangHann
Copy link

I am confused for the evaluation codes:
for idx_iteration in range(args.num_generations):
print(f'starting generation {idx_iteration+1}')
print('#'*100)
teacher_net = d_trainer(teacher_net, student_net)
d_trainer.evaluate(teacher_net)
teacher_net.teacher_mode()

    student_net = deepcopy(teacher_net)
    saver.save_net(student_net, f'chk_di_{idx_iteration + 1}')

    student_net.reinit_layers(args.reinit_l4, args.reinit_l3)

Do you use student network or teacher network for evaluation?

@angpo
Copy link
Collaborator

angpo commented Oct 31, 2020

We perform the evaluation on the student net; indeed d_trainer(teacher_net, student_net) returns the parameters of the student net. We store it in a variable called teacher_net because we have tried to re-iterate the process (namely, using the student as a teacher for another network). However, as we did not observe any improvement, we simply stopped to one iteration.

@angpo angpo closed this as completed Oct 31, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants