GitHub - RBorn02/MetaIPPO: Multi Agent IPPO implementation for Meta RL

Code for Emergence of collective open-ended exploration from Decentralized Meta-Reinforcement learning

We train two decentralized agents together on an open ended tasks space to study the emergence of collective exploration behaviors. Our agents are able to generalize to novel objects and tasks, as well as an essentially open ended setting. For videos of the trained agents acting in the environment see https://sites.google.com/view/collective-open-ended-explore/

Training To launch the training from scratch on CPU with 16 environments on each of the 8 workers with a total batch size of 128000 environment steps, run:

python3 mp_train.py --num_workers 8 --num_envs 16 --rollout_steps 128000

Evaluation To launch the evaluation of pretrained models with 8 environments on 1 worker with a total batch size of 8000 enivornemnt steps and record a video, run:

python3 testing.py --pretrained models --num_workers 1 --num_envs 8 --rollout_steps 8000 --record_video True

Name		Name	Last commit message	Last commit date
Latest commit History 135 Commits
Envs		Envs
Utils		Utils
__pycache__		__pycache__
models		models
.gitattributes		.gitattributes
README.md		README.md
agent.py		agent.py
environment.yml		environment.yml
mp_train.py		mp_train.py
policy.py		policy.py
requirements.txt		requirements.txt
testing.py		testing.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Envs

Envs

Utils

Utils

pycache

pycache

models

models

.gitattributes

.gitattributes

README.md

README.md

agent.py

agent.py

environment.yml

environment.yml

mp_train.py

mp_train.py

policy.py

policy.py

requirements.txt

requirements.txt

testing.py

testing.py

Repository files navigation

About

Releases

Packages

Languages

RBorn02/MetaIPPO

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages