Skip to content

JuliaReinforcementLearning/ReinforcementLearning.jl

main
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
April 25, 2023 12:46
May 30, 2023 15:47
May 15, 2021 11:08
January 26, 2021 18:45
December 16, 2022 12:08
January 27, 2021 13:24
January 26, 2021 18:45
January 27, 2021 13:24
May 23, 2023 16:55
March 15, 2023 11:02

ReinforcementLearning.jl, as the name says, is a package for reinforcement learning research in Julia.

Our design principles are:

  • Reusability and extensibility: Provide elaborately designed components and interfaces to help users implement new algorithms.
  • Easy experimentation: Make it easy for new users to run benchmark experiments, compare different algorithms, evaluate and diagnose agents.
  • Reproducibility: Facilitate reproducibility from traditional tabular methods to modern deep reinforcement learning algorithms.

🏹 Get Started

julia> ] add ReinforcementLearning

julia> using ReinforcementLearning

julia> run(
           RandomPolicy(),
           CartPoleEnv(),
           StopAfterStep(1_000),
           TotalRewardPerEpisode()
       )

The above simple example demonstrates four core components in a general reinforcement learning experiment:

Check out the tutorial page to learn how these four components are assembled together to solve many interesting problems. We also write blog occasionally to explain the implementation details of some algorithms. Among them, the most recommended one is An Introduction to ReinforcementLearning.jl, which explains the design idea of this package. Besides, a collection of experiments are also provided to help you understand how to train or evaluate policies, tune parameters, log intermediate data, load or save parameters, plot results and record videos. For example:

🌲 Project Structure

ReinforcementLearning.jl itself is just a wrapper around several other subpackages. The relationship between them is depicted below:

+-----------------------------------------------------------------------------------+
|                                                                                   |
|  ReinforcementLearning.jl                                                         |
|                                                                                   |
|      +------------------------------+                                             |
|      | ReinforcementLearningBase.jl |                                             |
|      +----|-------------------------+                                             |
|           |                                                                       |
|           |     +--------------------------------------+                          |
|           +---->+ ReinforcementLearningEnvironments.jl |                          |
|           |     +--------------------------------------+                          |
|           |                                                                       |
|           |     +------------------------------+                                  |
|           +---->+ ReinforcementLearningCore.jl |                                  |
|                 +----|-------------------------+                                  |
|                      |                                                            |
|                      |     +-----------------------------+                        |
|                      +---->+ ReinforcementLearningZoo.jl |                        |
|                            +----|------------------------+                        |
|                                 |                                                 |
|                                 |     +-------------------------------------+     |
|                                 +---->+ DistributedReinforcementLearning.jl |     |
|                                       +-------------------------------------+     |
|                                                                                   |
+------|----------------------------------------------------------------------------+
       |
       |     +-------------------------------------+
       +---->+ ReinforcementLearningExperiments.jl |
       |     +-------------------------------------+
       |
       |     +----------------------------------------+
       +---->+ ReinforcementLearningAnIntroduction.jl |
             +----------------------------------------+

Getting Help

Are you looking for help with ReinforcementLearning.jl? Here are ways to find help:

  1. Read the online documentation! Most likely the answer is already provided in an example or in the API documents. Search using the search bar in the upper left.
  1. Chat with us in Julia Slack in the #reinforcement-learnin channel.
  2. Post a question in the Julia discourse forum in the category "Machine Learning" and use "reinforcement-learning" as a tag.
  3. For issues with unexpected behavior or defects in ReinforcementLearning.jl, then please open an issue on the ReinforcementLearning GitHub page with a minimal working example and steps to reproduce.

🖖 Supporting

ReinforcementLearning.jl is a MIT licensed open source project with its ongoing development made possible by many contributors in their spare time. However, modern reinforcement learning research requires huge computing resource, which is unaffordable for individual contributors. So if you or your organization could provide the computing resource in some degree and would like to cooperate in some way, please contact us!

This package is written in pure Julia. Please consider supporting the JuliaLang org if you find this package useful.

✍️ Citing

If you use ReinforcementLearning.jl in a scientific publication, we would appreciate references to the CITATION.bib.

Contributors

Thanks goes to these wonderful people (emoji key):


jbrea

💻 📖 🚧

Jun Tian

💻 📖 🚧 🤔

Aman Bhatia

📖

Alexander Terenin

💻

Sid-Bhatia-0

💻

norci

💻 🚧

Sriram

💻

Pavan B Govindaraju

💻

Alex Lewandowski

💻

Raj Ghugare

💻

Roman Bange

💻

Felix Chalumeau

💻

Rishabh Varshney

💻

Zachary Sunberg

💻 📖 🚧 🤔

Jonathan Laurent

🤔

Andriy Drozdyuk

📖

Ritchie Lee

🐛

Xirui Zhao

💻

Nerd

📖

Albin Heimerson

💻 📖 🚧

michelangelo21

🐛

GuoYu Yang

📖 💻 🐛

Prasidh Srikumar

💻

Ilan Coulon

💻

Jinrae Kim

📖 🐛

luigiannelli

🐛

Jacob Boerma

💻

Xavier Valcarce

🐛

Ashwani Rathee

💻

Goran Nakerst

💻

ultradian

📖

Ikko Ashimine

📖

Krishna Bhogaonker

🐛

Philipp A. Kienscherf

🐛

Stefan Krastanov

📖

LaarsOman

📖

Bo Lu

💻

Peter Chen

💻 📖

Shuhua Gao

💻 💬

johannes-fischer

💻

Tom Marty

🐛 💻

Abhinav Bhatia

🐛 💻

Harley Wiltzer

💻 📖 🐛

Dylan Asmar

💻

andreyzhitnikov

🐛

Andrea PIERRÉ

📖

Mo8it

💻

Benoît Legat

📖

Henri Dehaybe

💻 📖

NPLawrence

💻

Bileam Scheuvens

📖

Jarbus

🐛

tyleringebrand

🐛

baedan

💻

ll7

📖

Matthew LeMay

📖

Ludvig Killingberg

💻

This project follows the all-contributors specification. Contributions of any kind welcome!