Skip to content

lincolnmroth/basketball-RL

main
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 

Reinforcement Learning Basketball

Robot Arm learning basketball using NFQ and Q-Learning

basketball

Methods

Q-Learning (Q Table)

Q-learning is slightly rudimentary but it had quite successful results. It uses a table of all possible states and uses the following equuation to explore and discover an optimal policy:

image

NFQ (Neural Fitted Q-Iteration)

NFQ uses a neural network to learn the Q values.

First a bunch of data is created using a random policy. Then a 2-layer neural net using PyTorch and a RPROP optimizer is created. Training was done target seen in the algorithm below. image

Conclusions and Future

There were a lot of issues and there are still are. Firstly and most easily fixed is to change to a dynamic alpha and exploration value (epsilon) in the q-table variant. Much much more work can be done on the nfq side to create a better structured neural net as well tune the other various parameters.

About

Robot Arm learning basketball using NFQ and Q-Learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages