Policy Gradient With Serial Markov Chain Reasoning Code

This repository contains minimal implementations of the SSPG algorithm from the NeurIPS paper Policy Gradient With Serial Markov Chain Reasoning. We provide two different code bases for the proprioceptive and pixel-based experiments located in the sspg_mujoco and sspg_dmc subfolders, respectively. We refer to the README.md files located in these subfolders for instructions regarding installation and replicating the results.

Extend/contact

For any extension/query/question, feel free to raise a pull request, an issue and/or get in contact with Edoardo Cetin at edoardo.cetin@kcl.ac.uk.

To cite our work, you can use:

@inproceedings{cetin2022serialMCR,
 author = {Cetin, Edoardo and Celiktutan, Oya},
 booktitle = {Advances in Neural Information Processing Systems},
 editor = {S. Koyejo and S. Mohamed and A. Agarwal and D. Belgrave and K. Cho and A. Oh},
 pages = {8824--8839},
 publisher = {Curran Associates, Inc.},
 title = {Policy Gradient With Serial Markov Chain Reasoning},
 url = {https://proceedings.neurips.cc/paper_files/paper/2022/file/39fac857b4467e3ef4f358186bb07d81-Paper-Conference.pdf},
 volume = {35},
 year = {2022}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Policy Gradient With Serial Markov Chain Reasoning Code

Extend/contact

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
sspg_dmc		sspg_dmc
sspg_mujoco		sspg_mujoco
README.md		README.md

Aladoro/markov-chain-reasoning

Folders and files

Latest commit

History

Repository files navigation

Policy Gradient With Serial Markov Chain Reasoning Code

Extend/contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages