Reinforcement Learning using SARSA Algorithm

Description

This project was part of the Artificial Intelligence Programming (IT3105) course at NTNU spring 2022. The aim of this project was to create a reinforcement learning system using the SARSA RL Algorithm to learn to play a simple physics game. The AI has been trained to play a game, which we will refer to as the The Acrobat Game. The acrobat game is a simulated game designed to approximate an acrobat holding on to a horizontal bar using their hands.

This youtube video briefly explains the project and its results.

Architecture

The RL system consists of a Critic implemented using a neural network and the SARSA algorithm itself, which handles the actual learning and communication with the simulation world. The diagram below shows the structure of the SARSA RL system:

Simulation World

The acrobat world consists of two line segments representing rods. The upper rod is attached to a pivot point at its upper end and can rotate freely around this point. The lower and upper rods are connected tip to tip in another pivot point and allows movement and rotation of the lower rod.

At any given time, the actor has three possible actions: It can apply a rightward or leftward force at the lower pivot, or it can choose to do nothing.

The actor’s goal is to apply forces to the pivot point in such a way that the tip of the lower rod reaches a certain height above the upper pivot point indicated by a dotted line.

Usage

To run this program, download or clone the repository and run main.py using Python 3.9 or higher.

python3 main.py

Requirements

Python 3.9 or higher
Tensorflow
Numpy
Matplotlib

pip install tensorflow numpy matplotlib

Results

The AI's performance before any training:

After training:

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
images		images
.gitignore		.gitignore
README.md		README.md
acrobat.py		acrobat.py
critic.py		critic.py
main.py		main.py
rl_system.py		rl_system.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning using SARSA Algorithm

Description

Architecture

Simulation World

Usage

Requirements

Results

About

Releases

Packages

Contributors 2

Languages

AnmolS99/SARSA-RL

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning using SARSA Algorithm

Description

Architecture

Simulation World

Usage

Requirements

Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages