Neural Episodic Control with State Abstraction

NECSA is based on tianshou platform. Please refer the original repo for installation.

0 Introduction

NECSA is implemented in a highly supplementary way. Please refer to tianshou/data/necsa_collector.py and necsa_atari_collector.py for details.

1 requirements

refer to requirements.txt

2 Anaconda and Python

wget https://repo.anaconda.com/archive/Anaconda3-2020.11-Linux-x86_64.sh
bash ./Anaconda3-2020.11-Linux-x86_64.sh
(should be changed)echo 'export PATH="$pathToAnaconda/anaconda3/bin:$PATH"' >> ~/.bashrc
(optional) conda config --set auto_activate_base false
conda create -n necsa python=3.8.5
conda activate necsa
pip3 install -r requirements.txt

3 Install Atari and MuJoCo

Download the ROM files for Atari, unzip and execute:
python -m atari_py.import_roms
wget https://mujoco.org/download/mujoco210-linux-x86_64.tar.gz
tar xvf mujoco210-linux-x86_64.tar.gz && mkdir -p ~/.mujoco && mv mujoco210 ~/.mujoco/mujoco210
wget https://www.roboti.us/file/mjkey.txt -O ~/.mujoco/mjkey.txt
echo "export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:~/.mujoco/mujoco210/bin" >> ~/.bashrc

4 Execution:

Example:

 python necsa_td3.py --task Walker2d-v3 --epoch 1000 --step 3 --grid_num 5 --epsilon 0.2 --mode state_action

Execute the scripts:

 bash scripts/HalfCheetah-v3/train_NECSA_TD3.sh

5 Experiment results:

Data will be automatically saved into ./results

6 Citing and Thanks

Our program is highly depending on tianshou, thanks to the efforts by the developers. Please kindly cite the paper if you referenced our repo.

@article{tianshou,
  title={Tianshou: A Highly Modularized Deep Reinforcement Learning Library},
  author={Weng, Jiayi and Chen, Huayu and Yan, Dong and You, Kaichao and Duburcq, Alexis and Zhang, Minghao and Su, Yi and Su, Hang and Zhu, Jun},
  journal={arXiv preprint arXiv:2107.14171},
  year={2021}
}

Our work NECSA is also inspired by 3 state-of-the-art episodic control algorithms: EMAC, EVA and GEM. Please refer to the corresponding repo for details.

@article{kuznetsov2021solving,
  title={Solving Continuous Control with Episodic Memory},
  author={Kuznetsov, Igor and Filchenkov, Andrey},
  journal={arXiv preprint arXiv:2106.08832},
  year={2021}
}

@article{hansen2018fast,
title={Fast deep reinforcement learning using online adjustments from the past},
author={Hansen, Steven and Pritzel, Alexander and Sprechmann, Pablo and Barreto, Andr{\'e} and Blundell, Charles},
journal={Advances in Neural Information Processing Systems},
volume={31},
year={2018}
}

@article{hu2021generalizable,
  title={Generalizable episodic memory for deep reinforcement learning},
  author={Hu, Hao and Ye, Jianing and Zhu, Guangxiang and Ren, Zhizhou and Zhang, Chongjie},
  journal={arXiv preprint arXiv:2103.06469},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
rebuttal		rebuttal
scripts		scripts
tianshou		tianshou
.gitignore		.gitignore
README.md		README.md
atari_network.py		atari_network.py
atari_wrapper.py		atari_wrapper.py
ddpg.py		ddpg.py
dqn.py		dqn.py
mujoco_env.py		mujoco_env.py
necsa_ddpg.py		necsa_ddpg.py
necsa_dqn.py		necsa_dqn.py
necsa_rainbow.py		necsa_rainbow.py
necsa_td3.py		necsa_td3.py
rainbow.py		rainbow.py
requirements.txt		requirements.txt
td3.py		td3.py

lizhuo-1994/NECSA

Folders and files

Latest commit

History

Repository files navigation

Neural Episodic Control with State Abstraction

0 Introduction

1 requirements

2 Anaconda and Python

3 Install Atari and MuJoCo

4 Execution:

5 Experiment results:

6 Citing and Thanks

About

Resources

Stars

Watchers

Forks

Languages