Modular Multi-Objective Deep Reinforcement Learning with Decision Values

This repository contains source codes for work described in: "Modular Multi-Objective Deep Reinforcement Learning with Decision Values", Tomasz Tajmajer https://arxiv.org/abs/1704.06676

Cleaner environment

Cleaner is a simple game which simulates an autonomous vacuum cleaner. It is based on OpenAI's gym framework. Cleaner comes in several versions: multi-objective and single-objective and it can be used with existing RL methods.

To run cleaner run cleaner_random_agent.py script.

To test cleaner with standard DQN run cleaner_test_with_standard_dqn.py

While cleaner is running you can use 'm' key to display full map and 'q' key to hide it.

Preparation

python3 -m venv env
source env/bin/activate
pip install -r requiremets.txt

Running multi-objective DQNs with decision values

dqn_decision_values.py script will run cleaner with a 3-objective DQN. After training the model will be saved.

Testing

After training the model may be tested with different priorities assigned to each of the objectives:

python dqn_decision_values_load.py MODEL_FILE_NAME PRIORITY1 PRIORITY2 PRIORITY3 NUM_OF_EPISODES

e.g.

python dqn_decision_values_load.py example_model 0.1 0.2 0.7 10

Credits

This work was based mainly on OpenAI baselines.

Help

For more information refer to the paper or contact me.

Tomasz Tajmajer

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
demo		demo
README.md		README.md
build_graph.py		build_graph.py
cleaner.py		cleaner.py
cleaner_random_agent.py		cleaner_random_agent.py
cleaner_test_with_standard_dqn.py		cleaner_test_with_standard_dqn.py
dqn_decision_values.py		dqn_decision_values.py
dqn_decision_values_load.py		dqn_decision_values_load.py
fig1.png		fig1.png
fig2.png		fig2.png
models.py		models.py
multiobjective.py		multiobjective.py
multiobjective_replay_buffer.py		multiobjective_replay_buffer.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Modular Multi-Objective Deep Reinforcement Learning with Decision Values

Cleaner environment

Preparation

Running multi-objective DQNs with decision values

Testing

Credits

Help

About

Releases

Packages

Languages

ttajmajer/morl-dv

Folders and files

Latest commit

History

Repository files navigation

Modular Multi-Objective Deep Reinforcement Learning with Decision Values

Cleaner environment

Preparation

Running multi-objective DQNs with decision values

Testing

Credits

Help

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages