Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] Why is the SARSA algorithm not available in Stable Baselines 3 #786

Closed
PBerit opened this issue Feb 23, 2022 · 5 comments
Closed
Labels
question Further information is requested

Comments

@PBerit
Copy link

PBerit commented Feb 23, 2022

Actually I am new in the field of Reinforcement Learning and I have often encoutered the SARSA algorithms in books, tutorials, videos etc.. It seems to be a very popular on-policy learning algorithm for Reinforcement learning. However, it noticed that it is not available in Stable Baselines 3. Is there a specific reason for that and is it planned to be included in future updates of Stable Baselines 3?

@PBerit PBerit added the question Further information is requested label Feb 23, 2022
@Miffyli
Copy link
Collaborator

Miffyli commented Feb 23, 2022

stable-baselines3 is mainly for "deep" reinforcement learning algorithms, where algorithms like A2C and DQN and PPO are the prominent "baselines". While SARSA is applicable, maybe as an modification of DQN, it has not been used much in deep learning literature, and has not received enough attention for somebody to add it to SB3. However if you feel like experiment, we could review a PR to add it to our contrib package :).

@araffin
Copy link
Member

araffin commented Feb 23, 2022

Hello,
I would consider SARSA out of scope of SB3 (because we mostly focus on DeepRL and not tabular RL), however you can take a look at Mushroom RL if you are looking for implementation of such algorithm ;)

@PBerit
Copy link
Author

PBerit commented Feb 24, 2022

Thanks for your answers.

@PBerit PBerit closed this as completed Feb 24, 2022
@RylanSchaeffer
Copy link

RylanSchaeffer commented Mar 22, 2022

@araffin can you highlight the differences between SB3 and Mushroom RL?

Edit: Would you agree with their chart here?

image

@araffin
Copy link
Member

araffin commented Mar 22, 2022

can you highlight the differences between SB3 and Mushroom RL?

i recommend you to read our paper/blog post (link is in the readme), we also have an issue here: #20

The table you are showing is about Stable baselines (SB2), not SB3.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

4 participants