reinforce

Bandit learning on top of Neural Monkey, an open-source tool for sequence learning in NLP built on TensorFlow. Bandit online learning objectives in branch bandits-acl (ACL17) and counterfactual learning objectives in branch acl-2018 (ACL18).

machine-translation nmt bandit-learning weak-feedback neural-mt reinforce

Updated Sep 5, 2018
Python

siddk / rl-kitchen-sink

Star

PyTorch Implementations of Standard Deep RL Algorithms (including REINFORCE, A2C, PPO)

reinforcement-learning pytorch reinforcement-learning-algorithms reinforce pytorch-rl ppo a2c

Updated Sep 11, 2018
Python

imraviagrawal / Reinforcement-Learning-Implementation

Star

Implementation of Reinforcement Algorithms from scratch

reinforcement-learning q-learning cartpole mountain-car sarsa gridworld reinforce td-learning cross-entropy sarsa-lambda blackbox-optimization gridworld-environment actor-critic-algorithm cross-entropy-policy-search cartpole-environment reinforcement-algorithms q-learning-lambda

Updated Dec 6, 2018
Python

tijiang13 / RL-Gomoku

Star

reinforcement-learning q-learning mcts sarsa reinforce actor-critic

Updated Feb 22, 2019
Python

EdoardoPona / Hex-AI-Reinforcement-Learning

Star

Reinforcement Learning agents for the game of Hex

hex reinforcement-learning deep-learning policy-gradient reinforcement-learning-algorithms reinforce

Updated Apr 22, 2019
Python

agakshat / visualdialog-pytorch

Star

Community Regularization of Visually Grounded Dialog https://arxiv.org/abs/1808.04359

machine-learning natural-language-processing reinforcement-learning computer-vision communication dialog pytorch recurrent-neural-networks multi-agent convolutional-neural-networks reinforce emergent-behavior icml curriculum-learning visual-dialog cvpr2018

Updated May 16, 2019
Python

JayLohokare / reinforce-algorithm-policy-deepRL

Star

OpenAI Gym's Cartpole environment REINFORCE algorithm implementation

deep-reinforcement-learning policy-gradient reinforce gradient-ascent

Updated May 16, 2019
Jupyter Notebook

Improve this page

Add a description, image, and links to the reinforce topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reinforce topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reinforce

Here are 106 public repositories matching this topic...

chingyaoc / pytorch-REINFORCE

TNieuwdorp / Thesis

Twice22 / Reinforcement-Learning

lantunes / mountain-car-continuous

HaiyinPiao / keras-policy-gradient

vincenzosantopietro / RL-Reinforce-with-TensorFlow-and-OpenAI-Gym

alokwhitewolf / Visual-Attention-Model

ajgupta93 / Reinforcement-Learning

huiwenzhang / rl-benchmark

jbecke / Open-AI-Gym-ABCs

vigneshramk / A2C-Reinforce-Behavior-Cloning

JasonZhu1313 / analytics-zoo-reinforcement-learning

xdek42 / AutoReinforce

juliakreutzer / bandit-neuralmonkey

siddk / rl-kitchen-sink

imraviagrawal / Reinforcement-Learning-Implementation

tijiang13 / RL-Gomoku

EdoardoPona / Hex-AI-Reinforcement-Learning

agakshat / visualdialog-pytorch

JayLohokare / reinforce-algorithm-policy-deepRL

Improve this page

Add this topic to your repo