Q-Learning in Unity

A simple example implementation of Q Learning in Unity. To solve the the RL problem, an agent needs to learn to take the best action in each of the possible states it encounters. For that, the Q-learning algorithm learns how much long-term reward it will get for each state-action pair (s, a).

Example

The Q Learning Algorithm

int action;

if (Random.Range(0f, 1f) < ExplorationFactor)
{
    action = Random.Range(0, 4); //Exploration
}
else
{
    //Exploitation
    action = QTable[currentState].ToList().IndexOf(QTable[currentState].Max());
}

//Execute Action (Step the Environment)
var envResult = AgentMover.Move(action); 


oldQValue = QTable[currentState][action]; //Old Value in Q-Table
var nextMax = QTable[envResult.state].Max(); //Best Value for next state

//Calculate new Q-Value
var newQValue = oldQValue + LearningRate * (envResult.reward + DiscountFactor * nextMax - oldQValue);
QTable[currentState][action] = newQValue;

currentState = envResult.state;

The core of the algorithm is based on this equation:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Assets		Assets
Packages		Packages
ProjectSettings		ProjectSettings
UserSettings		UserSettings
.gitignore		.gitignore
QLearning.svg		QLearning.svg
README.md		README.md
giphy.gif		giphy.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assets

Assets

Packages

Packages

ProjectSettings

ProjectSettings

UserSettings

UserSettings

.gitignore

.gitignore

QLearning.svg

QLearning.svg

README.md

README.md

giphy.gif

giphy.gif

Repository files navigation

Q-Learning in Unity

Example

The Q Learning Algorithm

About

Releases

Packages

Languages

Sebastian-Schuchmann/Q-Learning-in-Unity

Folders and files

Latest commit

History

Repository files navigation

Q-Learning in Unity

Example

The Q Learning Algorithm

About

Topics

Resources

Stars

Watchers

Forks

Languages