GitHub - tgangwani/GuidanceRewards: Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)

This repository contains code for our paper Learning Guidance Rewards with Trajectory-space Smoothing, published at the Conference on Neural Information Processing Systems (NeurIPS 2020).

The code reuses the Pytorch SAC code from this awesome repository. It was tested with the following packages:

python 3.6.6
pytorch 0.4.1
gym 0.10.8
hydra 0.11.3

Running command

To run the SAC experiments on MuJoCo, use the command below. The hyperparameters are mentioned in the config folder. Check the file run_cmds.sh for further commands.

python main.py env_name="Hopper-v2" seed=$RANDOM

Credits

denisyarats/pytorch_sac

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
IRCR		IRCR
config		config
README.md		README.md
main.py		main.py
run_cmds.sh		run_cmds.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IRCR

IRCR

config

config

README.md

README.md

main.py

main.py

run_cmds.sh

run_cmds.sh

Repository files navigation

Running command

Credits

About

Releases

Packages

Languages

tgangwani/GuidanceRewards

Folders and files

Latest commit

History

Repository files navigation

Running command

Credits

About

Topics

Resources

Stars

Watchers

Forks

Languages