Scalable Lifelong Reinforcement Learning

This code requires the following:

For the 2D navigation domains, data is generated from envs/navigation.py
For the Hopper/HalfCheetah/Ant Mujoco domains, the modified Mujoco enviornments are in envs/mujoco/*

For example, to run the code in the 2D navigation domain, just run the bash script navi_v1.sh, also see the usage instructions in the python scripts main_sllrl.py and `main_baselines.py'.
When getting the results in output/*/*.npy files, plot the results using plot_results.py. For example, the result for navi_v1.sh is:

performance comparison	clustering visualization

To ask questions or report issues, please open an issue on the issues tracker, or email to zhiwang@nju.edu.cn.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
algorithms		algorithms
envs		envs
exp		exp
models		models
output		output
.DS_Store		.DS_Store
README.md		README.md
buffers.py		buffers.py
crp.py		crp.py
main_baselines.py		main_baselines.py
main_sllrl.py		main_sllrl.py
navi_v1.sh		navi_v1.sh
plot_results.py		plot_results.py

Provide feedback