batchRL-SI

This projects applies Synaptic Intelligence (https://arxiv.org/abs/1703.04200) to batch-RL. A neural network is used to approximate Q-value. This network is learnt using batch-RL with experience replay, regularized with Synaptic Intelligence.

Synaptic Intelligence reduces the amount of experience required per batch. It also improves the rate of convergence of batch-RL.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
mdp_data		mdp_data
README.md		README.md
__init__.py		__init__.py
agent.py		agent.py
environment.py		environment.py
generate_mdp.py		generate_mdp.py
main.py		main.py
mdp_batchRL_agent.py		mdp_batchRL_agent.py
mdp_environment.py		mdp_environment.py
mdp_main.ipynb		mdp_main.ipynb
mdp_main.py		mdp_main.py
readme.txt		readme.txt
util.py		util.py
windy_gridworld.py		windy_gridworld.py
windy_gridworld_agent.py		windy_gridworld_agent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

batchRL-SI

About

Releases

Packages

Contributors 2

Languages

shriramsb/batchRL-SI

Folders and files

Latest commit

History

Repository files navigation

batchRL-SI

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages