Skip to content

wadx2019/qvpo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

[NeurIPS 2024] Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization

Code release for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization (NeurIPS 2024).

[paper] [project page]

Requirements

Installations of PyTorch and MuJoCo are needed. A suitable conda environment named qvpo can be created and activated with:

conda create qvpo
conda activate qvpo

To get started, install the additionally required python packages into your environment.

pip install setuptools==65.0.0 wheel==0.38.4
pip install -r requirements.txt

Running

Running experiments based on our code could be quite easy, so below we use HalfCheetah-v3 task as an example.

python main.py --env_name HalfCheetah-v3 --weighted --aug

Citation

If you find this repository useful in your research, please consider citing:

@inproceedings{
ding2024diffusionbased,
title={Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization},
author={Shutong Ding and Ke Hu and Zhenhao Zhang and Kan Ren and Weinan Zhang and Jingyi Yu, Jingya Wang and Ye Shi},
booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
year={2024},
url={https://arxiv.org/abs/2405.16173}
}

Acknowledgement

The code of QVPO is based on the implementation of DIPO.

About

official implementation of QVPO

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages