Informed POMDP: Leveraging Additional Information in Model-Based RL

Official implementation of the Informed Dreamer, an adaptation of Dreamer to Informed POMDPs.

If you find this code useful, please reference in your paper:

@article{lambrechts2023informed,
  title={{Informed POMDP: Leveraging Additional Information in Model-Based RL}},
  author={Lambrechts, Gaspard and Bolland, Adrien and Ernst, Damien},
  journal={ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems},
  year={2023},
}

@article{hafner2023dreamerv3,
  title={Mastering Diverse Domains through World Models},
  author={Hafner, Danijar and Pasukonis, Jurgis and Ba, Jimmy and Lillicrap, Timothy},
  journal={arXiv preprint arXiv:2301.04104},
  year={2023}
}

To learn more:

Instructions

For installation, examples and tips, see the original Dreamer repository.

This repository implements the following Informed POMDPs:

Varying Mountain Hike (state informed)
Flickering Atari (annotated-RAM informed)
Velocity DeepMind Control (state informed)
Flickering DeepMind Control (state informed)

By convention, the observation keys starting with info_ are considered as part of the information $i$, while the other observation keys are considered as part of the observation $o$.

The Informed Dreamer and the Uninformed Dreamer agents can be trained as follows:

For the informed POMDP training, use --decoder.outputs 'info_.*' that only uses the information.
For the classical POMDP training, use --decoder.outputs '^(?!info_).*' that ony uses the observation.

For both the information and the observation, use --decoder.outputs '.*' (untested).

Experiments

Varying Mountain Hike

python dreamerv3/train.py --logdir logs/$(date '+%Y-%m-%d_%H.%M.%S') \
    --configs hike --task 'hike_foo' --env.hike.discrete True \
    --configs hike --env.hike.altitude False --env.hike.rotations True \
    --decoder.outputs 'info_.*'

Flickering Atari

python dreamerv3/train.py --logdir logs/$(date '+%Y-%m-%d_%H.%M.%S') \
    --configs atari100k --task 'atari_pong' --env.atari.flickering 0.5 \
    --decoder.outputs 'info_.*'

Velocity Control

python dreamerv3/train.py --logdir logs/$(date '+%Y-%m-%d_%H.%M.%S') \
    --configs dmc_velocity --task 'dmc_hopper_stand' \
    --decoder.outputs 'info_.*'

Flickering Control

python dreamerv3/train.py --logdir logs/$(date '+%Y-%m-%d_%H.%M.%S') \
    --configs dmc_vision --task 'dmc_hopper_stand' --env.dmc.flickering 0.5 \
    --decoder.outputs 'info_.*'

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
dreamerv3		dreamerv3
scores		scores
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
example.py		example.py
informed-pomdp.png		informed-pomdp.png
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dreamerv3

dreamerv3

scores

scores

.gitignore

.gitignore

LICENSE

LICENSE

MANIFEST.in

MANIFEST.in

README.md

README.md

example.py

example.py

informed-pomdp.png

informed-pomdp.png

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

Informed POMDP: Leveraging Additional Information in Model-Based RL

Instructions

Experiments

Varying Mountain Hike

Flickering Atari

Velocity Control

Flickering Control

About

Releases

Packages

Languages

License

glambrechts/informed-dreamer

Folders and files

Latest commit

History

Repository files navigation

Informed POMDP: Leveraging Additional Information in Model-Based RL

Instructions

Experiments

Varying Mountain Hike

Flickering Atari

Velocity Control

Flickering Control

About

Topics

Resources

License

Stars

Watchers

Forks

Languages