php-rl

A reinforcement learning library in PHP

Disclaimer

This library is basically me reimplementing well-known RL algorithms in order to better understand them.

Algorithms

Value-based algorithms

SARSA

A standard state-action-reward-state-action implementation based on a Q table

Q-Learning

Based on a Q-table implemented as a "max" policy SARSA. Current API provides a basic epsilon-greedy agent. See the Tic-Tac-Toe example for some details

Deep Q-Learning

Current API provides a basic epsilon-greedy agent, with separated target model, as described in Mnih, V., Kavukcuoglu, K., Silver, D. et al. Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015). https://doi.org/10.1038/nature14236.

User can choose between a Vanilla DQN ou a Double DQN, (see Hado van Hasselt, Arthur Guez, David Silver. Deep Reinforcement Learning with Double Q-learning. arXiv:1509.06461 [cs.LG])

Experience replay is available as 2 distinct implementations:

random minibatch
prioritized experience replay (Tom Schaul, John Quan, Ioannis Antonoglou, David Silver - Prioritized Experience Replay, arXiv:1511.05952 [cs.LG], 2015)

Policy-based algorithms

TODO

~~Q-learning~~
~~SARSA~~
~~DQN~~
~~Double DQN~~
~~[DQN] prioritized experience replay~~
Vanilla Policy Gradient (REINFORCE)
Actor-Critic
real documentation :)
more examples

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
examples/tic-tac-toe		examples/tic-tac-toe
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
composer.json		composer.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

php-rl

Disclaimer

Algorithms

Value-based algorithms

SARSA

Q-Learning

Deep Q-Learning

Policy-based algorithms

TODO

About

Releases

Packages

Languages

License

mattjmattj/php-rl

Folders and files

Latest commit

History

Repository files navigation

php-rl

Disclaimer

Algorithms

Value-based algorithms

SARSA

Q-Learning

Deep Q-Learning

Policy-based algorithms

TODO

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages