Skip to content

polixir/morec

Repository files navigation

MOREC (ICLR'24)

This repository is the official implementation of Model-based Offline reinforcement learning with Reward Consistency (MOREC).

Dependencies

Install via pip

# logger
git clone https://github.com/FanmingL/SmartLogger
cd SmartLogger 
pip install -e .
cd ..
pip install gym==0.24.1 mujoco==2.3.6 mujoco-py==2.1.2.14 numpy==1.22.3
# other dependency
pip install torch tqdm 
# also should install NeoRL via pip install -e 
git clone https://github.com/polixir/NeoRL
cd NeoRL
pip install -e .

docker

docker pull core.116.172.93.164.nip.io:30670/public/luofanming:20240311171020

Pretrained Dynamics Model

The pretrained dynamics rewards and dynamics models are stored in this url. Please unzip d4rl_dataset.zip, dynamics_reward_models.tar.gz, and learned_dynamics.tar.gz to MOREC/pretrained.

We will release the dynamics reward learning code in a near future.

Train

MOREC-MOPO

# generate startup commands
python generate_tmuxp_morec_mopo.py
# execute the commands with tmuxp
tmuxp load run_all.json

MOREC-MOBILE

# generate startup commands
python generate_tmuxp_morec_mobile.py
# execute the commands with tmuxp
tmuxp load run_all.json

Acknowledgement

MOREC is built upon OfflineRL-Kit.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages