Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
base_agent.py		base_agent.py
dyna_agents.py		dyna_agents.py
mb_agents.py		mb_agents.py
td_agents.py		td_agents.py

README.md

Algorithm Toolkit

A set of cognitive neuroscience inspired agents and learning algorithms.

These consist of implementations of the canonical Q-Learning, Actor-Critic, Value-Iteration, and Successor Representation algorithms.

All algorithms are "tabular" and work with observations that consist of integer representations of the state of the agent. This corresponds to the index observation type.

Temporal Difference Algorithms

The implementations of the TD algorithms can be found here.

TD-Q
TD-SR
TD-AC

Dyna Algorithms

The implementations of the Dyna algorithms can be found here.

Dyna-Q
Dyna-SR
Dyna-AC

Model Based Algorithms

The implementations of the model-based algorithms can be found here.

Value Iteration (MBV)
TDSR / Value Iteration Hybrid (SRMB)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agents

agents

README.md

README.md

base_agent.py

base_agent.py

dyna_agents.py

dyna_agents.py

mb_agents.py

mb_agents.py

td_agents.py

td_agents.py

README.md

Algorithm Toolkit

Temporal Difference Algorithms

Dyna Algorithms

Model Based Algorithms

Files

agents

Directory actions

More options

Directory actions

More options

Latest commit

History

agents

Folders and files

parent directory

Algorithm Toolkit

Temporal Difference Algorithms

Dyna Algorithms

Model Based Algorithms