REINFORCEjs

REINFORCEjs is a Reinforcement Learning library that implements several common RL algorithms, all with web demos. In particular, the library currently includes:

Dynamic Programming methods
(Tabular) Temporal Difference Learning (SARSA/Q-Learning)
Deep Q-Learning for Q-Learning with function approximation with Neural Networks
Stochastic/Deterministic Policy Gradients and Actor Critic architectures for dealing with continuous action spaces. (very alpha, likely buggy or at the very least finicky and inconsistent)

See the main webpage for many more details, documentation and demos.

Code Sketch

The library exports two global variables: R, and RL. The former contains various kinds of utilities for building expression graphs (e.g. LSTMs) and performing automatic backpropagation, and is a fork of my other project recurrentjs. The RL object contains the current implementations:

RL.DPAgent for finite state/action spaces with environment dynamics
RL.TDAgent for finite state/action spaces
RL.DQNAgent for continuous state features but discrete actions

A typical usage might look something like:

// create an environment object
var env = {};
env.getNumStates = function() { return 8; }
env.getMaxNumActions = function() { return 4; }

// create the DQN agent
var spec = { alpha: 0.01 } // see full options on DQN page
agent = new RL.DQNAgent(env, spec); 

setInterval(function(){ // start the learning loop
  var action = agent.act(s); // s is an array of length 8
  //... execute action in environment and get the reward
  agent.learn(reward); // the agent improves its Q,policy,model, etc. reward is a float
}, 0);

The full documentation and demos are on the main webpage.

License

MIT.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
agentzoo		agentzoo
external		external
img		img
lib		lib
README.md		README.md
gridworld_dp.html		gridworld_dp.html
gridworld_td.html		gridworld_td.html
index.html		index.html
loop.svg		loop.svg
puckworld.html		puckworld.html
waterworld.html		waterworld.html
waterworld.js		waterworld.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agentzoo

agentzoo

external

external

img

img

lib

lib

README.md

README.md

gridworld_dp.html

gridworld_dp.html

gridworld_td.html

gridworld_td.html

index.html

index.html

loop.svg

loop.svg

puckworld.html

puckworld.html

waterworld.html

waterworld.html

waterworld.js

waterworld.js

Repository files navigation

REINFORCEjs

Code Sketch

License

About

Releases

Packages

Contributors 3

Languages

karpathy/reinforcejs

Folders and files

Latest commit

History

Repository files navigation

REINFORCEjs

Code Sketch

License

About

Resources

Stars

Watchers

Forks

Languages