Learning MDP domain with Q-learning function agents

Overview

Domain

'Mountain car' environment from GYM library.

A car is on a one-dimensional track, positioned between two "mountains". The goal is to drive up the mountain on the right; however, the car’s engine is not strong enough to scale the mountain in a single pass. Therefore, the only way to succeed is to drive back and forth to build up momentum.

Agents

Q-table agent

To determine value of Q-function for every state-action pair agent use a table of a finite size.
Since domain state observation consist of 2 continuous variables observation, received from domain, discretized into natural numeric value in range [0, 40].
In such a way Q-values table has resulting size of 40x40x3 of floating point values.

Multi Layer Perceptron agent

DQN agent:

Input layer - 2x6
Output layer - 6x3
Sigmoid activation function after final layer
Argmax function over output vector to determine optimal policy.

Run

Install dependencies
```
 pip install -r requirements.txt
```
Run training. Use --r_train 0 to run without rendering
```
 python main.py --agent mlp
```

For a complete overview of the supported global flags, use main --help.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
report		report
.gitignore		.gitignore
README.md		README.md
_policies.py		_policies.py
academy.py		academy.py
environment.py		environment.py
main.py		main.py
plot.py		plot.py
requirements.txt		requirements.txt
run_mlp.sh		run_mlp.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

report

report

.gitignore

.gitignore

README.md

README.md

_policies.py

_policies.py

academy.py

academy.py

environment.py

environment.py

main.py

main.py

plot.py

plot.py

requirements.txt

requirements.txt

run_mlp.sh

run_mlp.sh

utils.py

utils.py

Repository files navigation

Learning MDP domain with Q-learning function agents

Overview

Domain

Agents

Run

About

Releases

Packages

Languages

dhorpynchenko/reinforcement-learning

Folders and files

Latest commit

History

Repository files navigation

Learning MDP domain with Q-learning function agents

Overview

Domain

Agents

Run

About

Topics

Resources

Stars

Watchers

Forks

Languages