Learning General World Models in a Handful of Reward-Free Deployments

Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments.

CASCADE is a novel approach for self-supervised exploration in the reward-free deployment efficient setting. It seeks to learn a world model by collecting data with a population of agents, using an information theoretic objective inspired by Bayesian Active Learning. CASCADE achieves this by specifically maximizing the diversity of trajectories sampled by the population through a novel cascading objective.

Install Dependencies

pip3 install tensorflow==2.6.0 keras=2.6 tensorflow_probability ruamel.yaml 'gym[atari]' dm_control pycparser scikit-learn scipy gym_minigrid

Run

Example: train a population of 10 CASCADE agents on Crafter, collecting 50k steps per deployment.

python main.py --task=crafter_noreward --xpid=test_cascade_walker --num_agents=10 --cascade_alpha=0.1 --train_every=50000 --envs=10 --offline_model_train_steps=5001

Reference

If you find this work useful, please cite:

@article{xu2022cascade,
  title = {Learning General World Models in a Handful of Reward-Free Deployments},
  doi = {10.48550/ARXIV.2210.12719},
  author = {Xu, Yingchen and Parker-Holder, Jack and Pacchiano, Aldo and Ball, Philip J. and Rybkin, Oleh and Roberts, Stephen J. and Rocktäschel, Tim and Grefenstette, Edward},
  publisher = {arXiv},
  url = {https://arxiv.org/abs/2210.12719},
  year = {2022},
}

License

The majority of CASCADE is licensed under CC-BY-NC, however portions of the project are available under separate license terms: https://github.com/danijar/dreamerv2 is licensed under the MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
dreamerv2		dreamerv2
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

dreamerv2

dreamerv2

CODE_OF_CONDUCT.md

CODE_OF_CONDUCT.md

CONTRIBUTING.md

CONTRIBUTING.md

LICENSE

LICENSE

README.md

README.md

main.py

main.py

Repository files navigation

Learning General World Models in a Handful of Reward-Free Deployments

Install Dependencies

Run

Reference

License

About

Releases

Packages

Languages

License

facebookresearch/cascade

Folders and files

Latest commit

History

Repository files navigation

Learning General World Models in a Handful of Reward-Free Deployments

Install Dependencies

Run

Reference

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages