Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning

Official Repository of the ICLR2023 paper Value Memory Graph.

Install the Environment

Please run the following commands to install the environment via conda.

conda create -n vmg python=3.9
conda activate vmg
conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch
conda install pytorch-scatter -c pyg

conda install -c conda-forge mesalib
conda install -c conda-forge patchelf

pip install -r requirement.txt

Usage

To train a model, run

python main.py --dataset dataset-you-want --gpu desired-gpu-idx

To evaluate the model, run and set the hyperparameters in the arguments in the following way

python eval_script/eval_policy.py  --dataset dataset-you-want  --action_mode top  --cluster_thresh gamma_{m}  --discount discount  --min_future_step N_{sg}  --ckpt CheckPointYouWant  --gpu 0

The hyperparameters we use can be found in the Appx.D of the paper. Set the argument '--action_mode' to 'top' when the search step N_s is infinit. Otherwise, set it to 'neighbor'. Note that different checkpoints in a single training case may perform differently and you need to search for the checkpoint for the best performance.

Cite

Please cite us as

@article{zhu2022value,
  title={Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning},
  author={Zhu, Deyao and Li, Li Erran and Elhoseiny, Mohamed},
  journal={arXiv preprint arXiv:2206.04384},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
eval_script		eval_script
fig		fig
maprl		maprl
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configs

configs

eval_script

eval_script

fig

fig

maprl

maprl

LICENSE

LICENSE

README.md

README.md

main.py

main.py

requirement.txt

requirement.txt

Repository files navigation

Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning

Install the Environment

Usage

Cite

About

Releases

Packages

Languages

License

TsuTikgiau/ValueMemoryGraph

Folders and files

Latest commit

History

Repository files navigation

Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning

Install the Environment

Usage

Cite

About

Resources

License

Stars

Watchers

Forks

Languages