Prevent this user from interacting with your repositories and sending you notifications.
Learn more about blocking users.
You must be logged in to block users.
Contact GitHub support about this user’s behavior.
Learn more about reporting abuse.
Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow
Challenging dexterous manipulation environments for RL that extend the hand manipulation environments introduced in OpenAI's Gym
PPO with multi-head/autoregressive action outputs
Solving Complex Dexterous Manipulation Tasks with Trajectory Optimisation and Reinforcement Learning
Learning to play Settlers of Catan with Deep RL - custom training environment and implementation of PPO
Seeing something unexpected? Take a look at the
GitHub profile guide.