[Question] Why is the SARSA algorithm not available in Stable Baselines 3 #786

PBerit · 2022-02-23T17:00:32Z

Actually I am new in the field of Reinforcement Learning and I have often encoutered the SARSA algorithms in books, tutorials, videos etc.. It seems to be a very popular on-policy learning algorithm for Reinforcement learning. However, it noticed that it is not available in Stable Baselines 3. Is there a specific reason for that and is it planned to be included in future updates of Stable Baselines 3?

Miffyli · 2022-02-23T17:10:09Z

stable-baselines3 is mainly for "deep" reinforcement learning algorithms, where algorithms like A2C and DQN and PPO are the prominent "baselines". While SARSA is applicable, maybe as an modification of DQN, it has not been used much in deep learning literature, and has not received enough attention for somebody to add it to SB3. However if you feel like experiment, we could review a PR to add it to our contrib package :).

araffin · 2022-02-23T17:37:58Z

Hello,
I would consider SARSA out of scope of SB3 (because we mostly focus on DeepRL and not tabular RL), however you can take a look at Mushroom RL if you are looking for implementation of such algorithm ;)

PBerit · 2022-02-24T16:39:11Z

Thanks for your answers.

RylanSchaeffer · 2022-03-22T20:06:01Z

@araffin can you highlight the differences between SB3 and Mushroom RL?

Edit: Would you agree with their chart here?

araffin · 2022-03-22T20:29:56Z

can you highlight the differences between SB3 and Mushroom RL?

i recommend you to read our paper/blog post (link is in the readme), we also have an issue here: #20

The table you are showing is about Stable baselines (SB2), not SB3.

PBerit added the question Further information is requested label Feb 23, 2022

PBerit closed this as completed Feb 24, 2022

araffin mentioned this issue Jun 30, 2023

[Question] Would you like a pull request implementing classical tabular RL algorithms ? Stable-Baselines-Team/stable-baselines3-contrib#193

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Why is the SARSA algorithm not available in Stable Baselines 3 #786

[Question] Why is the SARSA algorithm not available in Stable Baselines 3 #786

PBerit commented Feb 23, 2022

Miffyli commented Feb 23, 2022

araffin commented Feb 23, 2022

PBerit commented Feb 24, 2022

RylanSchaeffer commented Mar 22, 2022 •

edited

Loading

araffin commented Mar 22, 2022

[Question] Why is the SARSA algorithm not available in Stable Baselines 3 #786

[Question] Why is the SARSA algorithm not available in Stable Baselines 3 #786

Comments

PBerit commented Feb 23, 2022

Miffyli commented Feb 23, 2022

araffin commented Feb 23, 2022

PBerit commented Feb 24, 2022

RylanSchaeffer commented Mar 22, 2022 • edited Loading

araffin commented Mar 22, 2022

RylanSchaeffer commented Mar 22, 2022 •

edited

Loading