GitHub - LQNew/AUMC: Author's PyTorch implementation of Attentive Update of Multi-Critic for Deep Reinforcement Learning (AUMC) for OpenAI gym tasks.

Attentive Update of Multi-Critic for Deep Reinforcement Learning

PyTorch implementation of paper "Attentive Update of Multi-Critic (AUMC)" which is accepted as ICME-2021 oral paper. Method is tested on MuJoCo continuous control tasks in OpenAI Gym. Agents are trained using PyTorch 1.4 and Python 3.6.

Instructions

Recommend: Run with Docker

# python        3.6    (apt)
# pytorch       1.4.0  (pip)
# tensorflow    1.14.0 (pip)
# DMC Control Suite and MuJoCo
cd dockerfiles
docker build . -t aumcRL

For other dockerfiles, you can go to RL Dockefiles.

Launch experiments

Run with the scripts batch_aumc_mujoco_4seed_cuda.sh:

# eg.
bash batch_aumc_mujoco_4seed_cuda.sh Hopper-v2 DDPG_aumc 0 0.4 # env_name: Ant-v2; algorithm: DDPG coupled with AUMC; CUDA_Num : 0; beta: 0.4.

Visualization of the environments

Run with the scripts render_mujoco.py / render_aumc_mujoco.py:

# eg. visulization of the environments with random actions:
python render_mujoco.py --env Ant-v2  # env_name: Ant-v2

# or visulization of the environments with trained policy:
CUDA_VISIBLE_DEVICES=0 python render_bootstrapped.py \
    --policy "TD3_aumc" \
    --env "Ant-v2" \
    --load_model "default" \
    --seed 2  
# env_name: Ant-v2; load policy: policy trained with TD3_aumc with seed equaling 2

Performance on MuJoCo

Including Ant-v2, HalfCheetah-v2, Hopper-v2, Humanoid-v2, Swimmer-v2, Walker2d-v2.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
DDPG		DDPG
SAC		SAC
TD3		TD3
dockerfiles		dockerfiles
learning_curves		learning_curves
spinupUtils		spinupUtils
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
batch_aumc_mujoco_4seed_cuda.sh		batch_aumc_mujoco_4seed_cuda.sh
batch_aumc_mujoco_4seed_cuda_save_model.sh		batch_aumc_mujoco_4seed_cuda_save_model.sh
main_aumc.py		main_aumc.py
render_aumc_mujoco.py		render_aumc_mujoco.py
render_mujoco.py		render_mujoco.py
run_aumc_single_seed.sh		run_aumc_single_seed.sh
run_render.sh		run_render.sh

License

LQNew/AUMC

Folders and files

Latest commit

History

Repository files navigation

Attentive Update of Multi-Critic for Deep Reinforcement Learning

Instructions

Recommend: Run with Docker

Launch experiments

Visualization of the environments

Performance on MuJoCo

About

Topics

Resources

License

Stars

Watchers

Forks

Languages