Sobolev Training with Pytorch

Small scale replication of Sobolev Training for NNs.

Overview

You can use the code by importing SobolevLoss from sobolev.py. In order to use it, checkout the example in main.py. The general guideline for distillation is:

from sobolev import SobolevLoss

teacher = Net()
student = Net()
loss = SobolevLoss(loss=nn.MSELoss(), weight=1.0, order=2)

# compute the gradients of teacher and student

sobolev = loss(student.parameters(), teacher.parameters())

# At this point, the parameters' gradients of student look like:
# s.grad = s.original_grad + s.grad.grad
# where s.grad.grad comes from the Sobolov loss

Remarks:

Make sure that your teacher is well-trained.
It works well towards the end of distillation.
Instead of student.parameters() and teacher.parameters() you can pass an iterable of parameters whose nth order gradients have been computed.
Theoretically should work for higher order, but I didn't test it.

Benchmark results

The results obtained by distilling a LeNet-teacher (converged) into a LeNet-student with the same random architecture. The results are in the form train / test at the 100th epoch of training.

Metric	Vanilla	Sobolev
Distill. Loss	1.2 / 1.19	0.56 / 0.64
Student Loss	0.94 / 0.9	0.8 / 0.82
Teacher Loss	0.7 / 0.72	0.7 / 0.72
Sobolev Loss	n / a	2e-4 / 4e-4

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
lenet.py		lenet.py
main.py		main.py
sobolev.py		sobolev.py
teacher.pth		teacher.pth
train_teacher.py		train_teacher.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

lenet.py

lenet.py

main.py

main.py

sobolev.py

sobolev.py

teacher.pth

teacher.pth

train_teacher.py

train_teacher.py

Repository files navigation

Sobolev Training with Pytorch

Overview

Benchmark results

About

Releases

Packages

Languages

License

seba-1511/sobolev.pth

Folders and files

Latest commit

History

Repository files navigation

Sobolev Training with Pytorch

Overview

Benchmark results

About

Resources

License

Stars

Watchers

Forks

Languages