Skip to content

Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.

License

Notifications You must be signed in to change notification settings

yandexdataschool/gumbel_dpg

Repository files navigation

Discrete deterministic policy gradient with Gumbel.

Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.

Read us here!

To reproduce, you will need theano+lasagne and gym. We have a dockerfile in the repo if you prefer containers.

About

Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published