Multi-Source Tranfser Learning for Deep Model-Based Reinforcement Learning

Code for the paper Multi-Source Tranfser Learning for Deep Model-Based Reinforcement Learning. If this code was useful to your research, please acknowledge it:

@article{
      sasso2023multisource,
      title={Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning},
      author={Remo Sasso and Matthia Sabatelli and Marco A. Wiering},
      journal={Transactions on Machine Learning Research},
      issn={2835-8856},
      year={2023},
      url={https://openreview.net/forum?id=1nhTDzxxMA}
}

Instructions

Please see the Dreamer repository for general dependency requirements and installation instructions.

Multi-Task Learning

For training a multi task agent with, say, the Hopper, Ant, and Cheetah task for 2M environment steps:

python dreamer-multi-task.py --task1 HopperBulletEnv-v0 --task2 AntBulletEnv-v0 --task3 HalfCheetahBulletEnv-v0 --batch_length 50 --envs 3 --steps 2e6 --logdir './logdir/multi-hopper-ant-cheetah'

Modular and Fractional Transfer Learning

For modular and fractional transfer learning, first place the variables of the source (multi-task) agent in the folder for the agent you are about to train. Say we transfer to a HalfCheetah agent, we create a folder './logdir/frac-cheetah/', place the variables.pkl of the multi-task agent in that folder, and then run:

python dreamer-FTL.py --task1 HalfCheetahBulletEnv-v0 --batch_length 50 --envs 1 --steps 1e6 --transfer True --transfer_factor 0.2 --logdir './logdir/frac-cheetah/'

Meta-Model Transfer Learning

For meta-model transfer learning, first locally make use of functions agent.load('./logdir/variables.pkl') and agent.save_single(agent._encode, "./encoder.pkl") to load the variables of a multi-task agent, and then to save the corresponding autoencoder in './logdir/'. Then you can train single agents with the frozen autoencoder using 'dreamer-MMTL.py'. You can then save the reward models similarly to saving the autoencoder with agent.save_single(agent._reward, "./stored_meta/cheetah.pkl"). Say we trained a HalfCheetah and Ant agent with the UFS frozen autoencoder, we place the aforementioned reward parameters in './stored_meta/'. Then, when wanting to perform MMTL, run e.g.:

python dreamer-MMTL.py --task1 HopperBulletEnv-v0 --n_meta 2 --meta1 HalfCheetahBulletEnv-v0 --meta2 AntBulletEnv-v0 --batch_length 50 --envs 1 --steps 1e6 --logdir './logdir/mmtl-hopper/'

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dreamer-FTL.py		dreamer-FTL.py
dreamer-MMTL.py		dreamer-MMTL.py
dreamer-multi-task.py		dreamer-multi-task.py
dreamer.py		dreamer.py
models.py		models.py
tools.py		tools.py
wrappers.py		wrappers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

images

images

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

dreamer-FTL.py

dreamer-FTL.py

dreamer-MMTL.py

dreamer-MMTL.py

dreamer-multi-task.py

dreamer-multi-task.py

dreamer.py

dreamer.py

models.py

models.py

tools.py

tools.py

wrappers.py

wrappers.py

Repository files navigation

Multi-Source Tranfser Learning for Deep Model-Based Reinforcement Learning

Instructions

Multi-Task Learning

Modular and Fractional Transfer Learning

Meta-Model Transfer Learning

About

Releases

Packages

Languages

License

remosasso/multi-source-TL-for-deep-MBRL

Folders and files

Latest commit

History

Repository files navigation

Multi-Source Tranfser Learning for Deep Model-Based Reinforcement Learning

Instructions

Multi-Task Learning

Modular and Fractional Transfer Learning

Meta-Model Transfer Learning

About

Resources

License

Stars

Watchers

Forks

Languages