Revisiting Data Augmentation in Deep Reinforcement Learning

This is an original PyTorch implementation of tangent prop regularization and KL regularization in DrQ-v2 from

[Revisiting Data Augmentation in Deep Reinforcement Learning] by

Jianshu Hu, Yunpeng Jiang and Paul Weng.

Method

We implement tangent prop regularization and KL regularization based on DrQv2.

Citation

If you use this repo in your research, please consider citing the paper as follows:

@inproceedings{
hu2024revisiting,
title={Revisiting Data Augmentation in Deep Reinforcement Learning},
author={Jianshu Hu and Yunpeng Jiang and Paul Weng},
booktitle={The Twelfth International Conference on Learning Representations},
year={2024},
url={https://openreview.net/forum?id=EGQBpkIEuu}
}

Instructions

Install MuJoCo if it is not already the case:

Obtain a license on the MuJoCo website.
Download MuJoCo binaries here.
Unzip the downloaded archive into ~/.mujoco/mujoco200 and place your license key file mjkey.txt at ~/.mujoco.
Use the env variables MUJOCO_PY_MJKEY_PATH and MUJOCO_PY_MUJOCO_PATH to specify the MuJoCo license key path and the MuJoCo directory path.
Append the MuJoCo subdirectory bin path into the env variable LD_LIBRARY_PATH.

Install the following libraries:

sudo apt update
sudo apt install libosmesa6-dev libgl1-mesa-glx libglfw3

Install dependencies:

conda env create -f conda_env.yml
conda activate drqv2

Train the agent with original DrQv2:

python train.py task=quadruped_walk

Train the agent with tangent prop and KL regularization:

python train.py task=quadruped_walk add_KL_loss=true tangent_prop=true

Monitor results:

tensorboard --logdir exp_local

License

The majority of this code is licensed under the MIT license, however portions of the project are available under separate license terms: DeepMind is licensed under the Apache 2.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
cfgs		cfgs
curves		curves
run_experiments		run_experiments
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
conda_env.yml		conda_env.yml
data_augmentation.py		data_augmentation.py
dmc.py		dmc.py
drqv2.py		drqv2.py
logger.py		logger.py
plot.py		plot.py
replay_buffer.py		replay_buffer.py
train.py		train.py
utils.py		utils.py
video.py		video.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Revisiting Data Augmentation in Deep Reinforcement Learning

Method

Citation

Instructions

License

About

Releases

Packages

Languages

License

Jianshu-Hu/revisiting-data-augmentation-in-DRL

Folders and files

Latest commit

History

Repository files navigation

Revisiting Data Augmentation in Deep Reinforcement Learning

Method

Citation

Instructions

License

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages