Skip to content

tinnerhrhe/MTDiff

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning

This is the official code for the paper "Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning". We propose a diffusion-based effective planner and data synthesizer for multi-task RL.

Dataset

You can download our dataset via this Google Drive link.

Installation

conda env create -f environment.yml
conda activate mtdiff

Instructions

Train a MTDiff-p with:

python scripts/mtdiff_p_meta.py --model models.Tasksmeta --diffusion models.GaussianActDiffusion --loss_type statehuber --loader datasets.RTGActDataset --device cuda:0

Train a MTDiff-s with:

python scripts/mtdiff_s.py --model models.TasksAug --diffusion models.AugDiffusion --loss_type statehuber --loader datasets.AugDataset --device cuda:0

You can tune any hyperparameters in the config for experiments.

Conduct generative planning using MTDiff-p on MT50-rand with:

python scripts/test_mtdiff_p.py --diffusion_loadpath model_saved_path --diffusion_epoch selected_epoch --device cuda:0

You can tune any hyperparameters in the config/locomotion.py and diffusion.py to guide sampling.

Acknowledgment

Our code for MTDiff is partly based on the Diffuser from https://github.com/jannerm/diffuser and Decision Diffuser from https://github.com/anuragajay/decision-diffuser.

References

@inproceedings{he2023mtdiff,
  title={Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning},
  author={Haoran He and Chenjia Bai and Kang Xu and Zhuoran Yang and Weinan Zhang and Dong Wang and Bin Zhao and Xuelong Li},
  booktitle = {Advances in Neural Information Processing Systems},
  year = {2023}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages