OTO (Online Training Optimazation)

Installation

To use OTO the standard libraries given with the Anaconda Distribution are necessary.
Furthermore you need to install the PyTorch Framework as described here.

Project Motivation

Training a Neural Network with static Hyperparameter Configurations can be very time consuming and might don't even result in good accuracies.
The idea behind OTO is to actually give the Network itself the oppurtinity to decide when to change the Hyperparameters.
We believe that giving the Network a variety of Hyperparameters while training it is possible to reduce the error and enhance the training time.

Description

To get a benchmark we decided to build a very simple CNN Architecture and the Dataset we were using is the already preprocessed CIFAR10 coming together with the PyTorch Framework. Simply because it is less time consuming and the results can still be compared!
We wanted to compare our method with the static Hyperparameter Configurations you get with Grid- and Random Search.
Learning Rate is the only Hyperparameter we wanted to change dynamically since it is possible to expand the idea if the results seem good.

We split the Dataset into Training and Validation Data and wrote a simple loop to train different models with Grid and Random Search. For comparison we saved the results of Training Accuracy, Validation Accuracy and the error into a csv file.

After getting the results for Grid and Random the trainingsloop has been expanded. The Network got a list of different Learning Rates between 0.1 and 0.0001.
With this pool of different Learning Rates the same amount of Models were trained for 2 Epochs. After that the different models are validated. Picking the best one as new starting point it is trained again with all the different Learning Rates, deciding again which is the best model after 2 Epochs and so on... Iterating to the best result.

Results

Comparing the different methods we can say that we got a better accuracy with OTO. But not only better results with the Validation we also were able to reduce overfitting which is probably because the Network is validated every two epochs and the Learning Rate can be corrected in time.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
OTO_dynamic.py		OTO_dynamic.py
README.md		README.md
rand_grid.py		rand_grid.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OTO (Online Training Optimazation)

Table of Contents

Installation

Project Motivation

Description

Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OTO (Online Training Optimazation)

Table of Contents

Installation

Project Motivation

Description

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages