Skip to content

Asynchronous optimisation algorithm to optimise a population of models and their hyperparameters.

Notifications You must be signed in to change notification settings

saschaschramm/PopulationBasedTraining

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Population Based Training

Introduction

This project is an implementation of Population Based Training. Population Based Training is an asynchronous optimisation algorithm to optimise a population of models and their hyperparameters. The implementation is based on: https://arxiv.org/abs/1711.09846

The project consists of two parts:

  1. Application of Population Based Training on a toy example
  2. Combining the Reinforce algorithm with Population Based Training

Toy Example

The objective is to maximize the function

   1.2 - (pow(param_0,2) + pow(param_1,2))

We assume that we can't observe this function. Instead we maximize a surrogate function

   1.2 - (hyperparam_0 * pow(param_0,2) + hyperparam_1 * pow(param_1,2))

alt text

alt text

alt text

alt text

Reinforce

Performance of the Reinforce algorithm with Population Based Training (5 Workers):

alt text

About

Asynchronous optimisation algorithm to optimise a population of models and their hyperparameters.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages