Minimal implementation of MACAW (ICML 2021)

This repo contains a simplified implementation of the MACAW algorithm (full code here) for easier inspection and extension.

source ~/arcle_env/bin/activate Run the code with python train.py and python test.py.

Overview

This code trains MACAW on the simple Cheetah-Direction problem, which has only two tasks (forwards and backwards). impl.py contains example of loading the offline data (build_networks_and_buffers) and performing meta-training (loop in run.py). losses.py contains the MACAW loss functions for adaptation the value function and policy. utils.py contains the replay buffer implementation that loads the offline data.

Citation

If our code or research was useful for your own work, you can cite us with the following attribution:

@InProceedings{mitchell2021offline,
    title = {Offline Meta-Reinforcement Learning with Advantage Weighting},
    author = {Mitchell, Eric and Rafailov, Rafael and Peng, Xue Bin and Levine, Sergey and Finn, Chelsea},
    booktitle = {Proceedings of the 38th International Conference on Machine Learning},
    year = {2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
..git		..git
__pycache__		__pycache__
config		config
macaw_offline_data		macaw_offline_data
outputs		outputs
task_config		task_config
README.md		README.md
calculate_mc_rewards.py		calculate_mc_rewards.py
check_data.py		check_data.py
create_buffers.py		create_buffers.py
envs_arc.py		envs_arc.py
impl_new.py		impl_new.py
losses.py		losses.py
nn.py		nn.py
requirements.txt		requirements.txt
utils.py		utils.py

SejinKimm/macaw-min

Folders and files

Latest commit

History

Repository files navigation

Minimal implementation of MACAW (ICML 2021)

Overview

Citation

About

Resources

Stars

Watchers

Forks

Languages