Offline Meta Reinforcement Learning with In-Distribution Online Adaptation

For Walker environments, MuJoCo131 is required. Simply install it the same way as MuJoCo200. To swtch between different MuJoCo versions:

export MUJOCO_PY_MJPRO_PATH=~/.mujoco/mjpro${VERSION_NUM}
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:~/.mujoco/mjpro${VERSION_NUM}/bin

Data Generation

Example of training policies and generating trajectories on multiple tasks: For point-robot and cheetah-vel:

python policy_train.py ./configs/cpearl-sparse-point-robot.json   # actually dense reward is used. To run the sparse reward version, uncomment line 205 in ./Offline-MetaRL/rlkit/envs/point_robot.py.
python policy_train.py ./configs/cheetah-vel.json
python policy_train_mt1.py ./configs/cpearl-mt1.json

For Meta-World ML1 tasks (you can modify the task in ./configs/ml1.json):

python data_collection_ml1.py  ./configs/ml1.json

Generated data will be saved in ./data/

Offline RL Experiments

To reproduce an Meta-World ML1 experiment, run:

run_ml1.sh

To run different tasks, modify "env_name" in ./configs/cpearl-ml1.json as well as "datadirs" in run_ml1.sh.

Similarly, for point-robot and cheetah-vel:

run_point.sh
run_cheetah.sh

Name		Name	Last commit message	Last commit date
Latest commit History 249 Commits
configs		configs
environments		environments
rand_param_envs		rand_param_envs
rlkit		rlkit
unused		unused
viskit		viskit
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data.pkl		data.pkl
data_collection_ml1.py		data_collection_ml1.py
data_collection_ml1_2.py		data_collection_ml1_2.py
environment.yaml		environment.yaml
generate_plot.py		generate_plot.py
launch_experiment.py		launch_experiment.py
launch_experiment_cpearl.py		launch_experiment_cpearl.py
launch_experiment_ensemble.py		launch_experiment_ensemble.py
launch_experiment_eval.py		launch_experiment_eval.py
launch_experiment_eval2.py		launch_experiment_eval2.py
launch_experiment_model.py		launch_experiment_model.py
launch_experiment_onlineadapt.py		launch_experiment_onlineadapt.py
plot.sh		plot.sh
plot_new.py		plot_new.py
plot_new_cheetah.py		plot_new_cheetah.py
plot_new_ml1.py		plot_new_ml1.py
plot_new_ml1_2.py		plot_new_ml1_2.py
plot_new_ml1_3.py		plot_new_ml1_3.py
plot_new_ml1_baseline.py		plot_new_ml1_baseline.py
plot_new_reach.py		plot_new_reach.py
plot_point.py		plot_point.py
plot_point_2.py		plot_point_2.py
plot_point_prediction_loss.py		plot_point_prediction_loss.py
plot_utils.py		plot_utils.py
policy_eval.py		policy_eval.py
policy_train.py		policy_train.py
policy_train_ml1.py		policy_train_ml1.py
requrements.txt		requrements.txt
run_ant_goal.sh		run_ant_goal.sh
run_cheetah.sh		run_cheetah.sh
run_ml1.sh		run_ml1.sh
run_ml1_eval.sh		run_ml1_eval.sh
run_ml1_new.sh		run_ml1_new.sh
run_point.sh		run_point.sh
sim_policy.py		sim_policy.py
sparse-point-robot-vis.ipynb		sparse-point-robot-vis.ipynb
test.sh		test.sh
uncertainty_plot.py		uncertainty_plot.py
visual_buffer.py		visual_buffer.py
visual_buffer_2.py		visual_buffer_2.py
visual_buffer_sfter.py		visual_buffer_sfter.py
visual_buffer_uncertianty.py		visual_buffer_uncertianty.py
visual_buffer_uncertianty2.py		visual_buffer_uncertianty2.py
visual_exp_2.py		visual_exp_2.py
visual_exp_3.py		visual_exp_3.py
visual_metaworld.py		visual_metaworld.py
visual_metaworld_2.py		visual_metaworld_2.py
visual_metaworld_3.py		visual_metaworld_3.py
visual_metaworld_distance.py		visual_metaworld_distance.py
visual_metaworld_distance_pick.py		visual_metaworld_distance_pick.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Offline Meta Reinforcement Learning with In-Distribution Online Adaptation

Data Generation

Offline RL Experiments

About

Releases

Packages

Contributors 2

Languages

License

NagisaZj/IDAQ_Public

Folders and files

Latest commit

History

Repository files navigation

Offline Meta Reinforcement Learning with In-Distribution Online Adaptation

Data Generation

Offline RL Experiments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages