TRPO with continuous actions

This repo implements a TRPO agent ( http://arxiv.org/abs/1502.05477 ) by modifying https://github.com/wojzaremba/trpo and replacing the softmax distributions with Gaussian distributions, and adding a tiny bit of bells and whistles.

To run the code, simply type python main.py --task $TASK_NAME. Once training is complete, main.py will upload the run using your OpenAI gym account which should be stored in OPENAI_GYM_API_KEY.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
README.md		README.md
main.py		main.py
space_conversion.py		space_conversion.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TRPO with continuous actions

About

Releases

Packages

Languages

zzmjohn/trpo

Folders and files

Latest commit

History

Repository files navigation

TRPO with continuous actions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages