Hide content and notifications from this user.
Contact Support about this user's behavior.
Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
A toolkit for developing and comparing reinforcement learning algorithms.
Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.
A batch-optimized scaling manager for Kubernetes
Public facing notes page
Minimalist Chess Clock in Swift
Seeing something unexpected? Take a look at the
GitHub profile guide.