Decision Genetic Programming

In this project we applied genetic programming to solve OpenAI Gym Environments and compared its performance to RL models.

Paper

The paper with the complete evaluations, results and limitations of this project can be found here.

Quick Start

Installation

git clone git@github.com:AlekseyKorshuk/YaES.git
cd YaES
pip install -r requirements.txt

Dash application

You can easily evaluate any GYM environment with our dash application. Just run the following command and open the link in your browser.

python3 dash_app.py

Demo gym environment

Evaluate PPO, MultiTree and Modi agents on the CartPole-v1 environment.

python3 evaluate.py

Examples

Explanations

Why even try?

In most simple games the mapping from a state to an action can be expressed as closed-form function. It is a natural application of genetic programming and we leverage this technique to find the exact formula.

Single Action Space

Genetic Programming is naturally applicable here. A mathematical formula can be expressed as a tree where root is the result of calculations, internal nodes are operations and terminal nodes are either the input variables (state of the game in our case) or functions without variables such as constants and random number generators.

Picture source: Wikipedia

Decision Making

For binary actions (do or don't do) we make a decision by checking whether the output is greater (do) or less (don't do) than zero. For continuous actions, such as the speed of a car, we return the output as it is.

Fitness Function

We obtain the fitness by taking the reward after running our agents in a Gym.

Mutliple Action Space

Evolution of the usual tree doesn't scale to games with multiple outputs because it returns only single number. For that reason, we implemented modified individuals which return vector of outputs. For discrete games we apply argmax function and return the result as an action. In games with continuous actions we return the result unaltered.

Modi

Source of idea

Files with implementation:

agent/base.py
agent/modi.py

We implemented this idea with a slight modification. The authors of above mentioned paper suggest to add a special node which passes the result of their calculations to the parent (as usual), but also adds this result to the output vector. Each such node has an assigned number which specifies the index to which it will add the result.

Instead, we decided to separate these two functions. We add a special node called 'modi{index}' which passes its input to the parent without changes and adds this input to the output vector. This approach allowed us to simplify the implementation.

Multi-Tree

Source of idea

Files with implementation:

agent/base.py
agent/multi_tree.py

The idea is to create a bag of trees where each one is responsible for specific output index. Thus, for output vector with size N we have N populations. To obtain an action, we take i-th individual from each population, feed them the state of the game and collect outputs.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
dgp		dgp
.gitignore		.gitignore
README.md		README.md
dash_app.py		dash_app.py
evaluate.py		evaluate.py
paper.pdf		paper.pdf
requirements.txt		requirements.txt
visualize_results.py		visualize_results.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Decision Genetic Programming

Paper

Quick Start

Installation

Dash application

Demo gym environment

Examples

Explanations

Single Action Space

Decision Making

Fitness Function

Mutliple Action Space

Modi

Multi-Tree

About

Releases

Packages

Contributors 2

Languages

AlekseyKorshuk/dgp

Folders and files

Latest commit

History

Repository files navigation

Decision Genetic Programming

Paper

Quick Start

Installation

Dash application

Demo gym environment

Examples

Explanations

Single Action Space

Decision Making

Fitness Function

Mutliple Action Space

Modi

Multi-Tree

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages