Knowledge-Distillation-in-NN

Knowledge-Distillation(KD) is a simple way to compress model while keeping the performance of original model. This repository provide a implementation of the paper "Distilling the Knowledge in a Neural Network" with some changes. Please check the references for detailed explanations.

Requirements

python 3.6 >
pytorch 1.3 >

Structure

.
|--experiments (it will include the scripts to run, results of train and test)
|--train_net.py (main solver to train the model)
|--data/
|    |--data_loader.py (Data queue module)
|--models/
|    |--Loss.py (Loss functions that are used in this project)
|--engine/
|    |--trainer.py
|    |--inference.py
|    |--solver.py
|--utils/
|    |--logger.py (module that can visualization of image and training plot)  
|    |--checkpointer.py
|    |--measure.py

You should make clear that all directory and files are located correctly

TODO

Check all requirements to run
Additional feature to improve baseline(Hinton's 15)
Train teachers

Usage

I added two scripts. One for training the teacher model and another for trainingthe student using teacher.
Specify your teacher model in models and build_model if you have own your model and put arguments correctly to run.

How to train?

$ ./experiments/exp1/train.sh

How to test?

Performance

Table will be appear

To track your training progress

$ tensorboard --logdir=experiment/ --port=6666

and go to 'localhost:6666' on webbrowser. You can see the accuracy and loss graph

References

"Distilling the Knowledge in a Neural Network"
"Python implementaion of hinton's KD"
- I refered CS230 report to understand loss function(Cross entropy + KL_div)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
engine		engine
models		models
utils		utils
.gitignore		.gitignore
README.md		README.md
train_student.py		train_student.py
train_student.sh		train_student.sh
train_teacher.py		train_teacher.py
train_teacher.sh		train_teacher.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

engine

engine

models

models

utils

utils

.gitignore

.gitignore

README.md

README.md

train_student.py

train_student.py

train_student.sh

train_student.sh

train_teacher.py

train_teacher.py

train_teacher.sh

train_teacher.sh

Repository files navigation

Knowledge-Distillation-in-NN

Requirements

Structure

TODO

Usage

How to train?

How to test?

Performance

To track your training progress

References

About

Releases

Packages

Contributors 2

Languages

jeong-tae/Knowledge-Distillation-in-NN

Folders and files

Latest commit

History

Repository files navigation

Knowledge-Distillation-in-NN

Requirements

Structure

TODO

Usage

How to train?

How to test?

Performance

To track your training progress

References

About

Resources

Stars

Watchers

Forks

Languages