Slime env compatible with SB3 #5

vamsianumula · 2023-02-22T05:20:37Z

Updated Slime environment that is compatible with Stable Baselines3. This PR has added/modified:

slime_environments/environments/SlimeEnvSingleAgent.py: Modify Slime env
env-test-gym.py: Tests env compatability with SB3
slime_environments/agents/SA_QLearning.py: Modify the Q-learning algo with updated env
single-test-03-2023-02-22 13:54:51.756431.csv: Test run of env with Q-learning

smarianimore

This is great, thanks!
However, I'd like to ask for an addition as a further "guarantee" of compliance: would you mind adding a test script similar to SA_QLearning.py where any single agent learning algorithm of your choice from stable-baselines3 is tested?
This way we are pretty sure that "everything works".

smarianimore

Maybe function observation_to_int_map in SA_QLearning.py is not necessary anymore with the new observation space?

smarianimore

You removed the additional boolean I put in observations with comment DOC Gym v26 has additional..... Is it because they changed it again or beacause stable-baselines3 is not ready for Gymnasium?
In the latter case I think we should rely on this PR and not in the PIP version of stable-baselines3, so that our code will be ready when the next version of stable-baselines3 hits.
Instructions on how to use it are in the PR itself.

smarianimore

I've seen you removed type hints, is there a specific reason?

vamsianumula · 2023-02-28T01:16:30Z

This is great, thanks! However, I'd like to ask for an addition as a further "guarantee" of compliance: would you mind adding a test script similar to SA_QLearning.py where any single agent learning algorithm of your choice from stable-baselines3 is tested? This way we are pretty sure that "everything works".

Sure, I will do that.

Maybe function observation_to_int_map in SA_QLearning.py is not necessary anymore with the new observation space?

I think we need it because the mapped integer is being used as a key for the custom implemented Q-learning algo right? But, do you mean for the SB3's Q Learning?

You removed the additional boolean I put in observations with comment DOC Gym v26 has additional..... Is it because they changed it again or beacause stable-baselines3 is not ready for Gymnasium? In the latter case I think we should rely on this PR and not in the PIP version of stable-baselines3, so that our code will be ready when the next version of stable-baselines3 hits. Instructions on how to use it are in the PR itself.

Yes, SB3 supports only gym v21 for now, not v26. I didn't know about that PR, I will look into it.

I've seen you removed type hints, is there a specific reason?

Initially, there were resulting in some error, so I tried without them. I will restore them now.

smarianimore · 2023-03-07T09:54:02Z

Maybe function observation_to_int_map in SA_QLearning.py is not necessary anymore with the new observation space?

I think we need it because the mapped integer is being used as a key for the custom implemented Q-learning algo right? But, do you mean for the SB3's Q Learning?

You're right, let's leave it there for the moment, I will check if custom Q-learning implementation can be updated according to new observation space to get rid of it

Thanks for all your work :)

vamsianumula · 2023-03-10T11:37:10Z

Maybe function observation_to_int_map in SA_QLearning.py is not necessary anymore with the new observation space?

I think we need it because the mapped integer is being used as a key for the custom implemented Q-learning algo right? But, do you mean for the SB3's Q Learning?

You're right, let's leave it there for the moment, I will check if custom Q-learning implementation can be updated according to new observation space to get rid of it

Thanks for all your work :)

No problem. I had been a little occupied past week. I will push the requested changes soon.

vamsianumula · 2023-03-15T02:15:53Z

This is great, thanks! However, I'd like to ask for an addition as a further "guarantee" of compliance: would you mind adding a test script similar to SA_QLearning.py where any single agent learning algorithm of your choice from stable-baselines3 is tested? This way we are pretty sure that "everything works".

Sure, I will do that.

I have added the test training code in env-test-gym.py and its working fine.

I've seen you removed type hints, is there a specific reason?

Initially, there were resulting in some error, so I tried without them. I will restore them now.

I have restored the hints.

@smarianimore please let me know in case of any more suggestions and the next steps for this project.

vamsianumula and others added 5 commits February 21, 2023 10:30

Init commit

d4b8652

SB3 compatible version

ece66ad

Remove requirements.txt

5f052af

Add requirements.txt

fce05e5

Modify with updated env

7810852

vamsianumula mentioned this pull request Feb 22, 2023

Stable-baselines3 compatibility #2

Closed

vamsianumula changed the title ~~Vamsi dev~~ Slime env compatible with SB3 Feb 22, 2023

smarianimore requested changes Feb 27, 2023

View reviewed changes

smarianimore reviewed Feb 27, 2023

View reviewed changes

vamsianumula added 2 commits March 15, 2023 11:09

Test training with SB3

94963ef

Restore parameter types

53a263b

smarianimore merged commit b86baa9 into LucaInis:sm-baselines-api Apr 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slime env compatible with SB3 #5

Slime env compatible with SB3 #5

vamsianumula commented Feb 22, 2023 •

edited

Loading

smarianimore left a comment

smarianimore left a comment

smarianimore left a comment

smarianimore left a comment

vamsianumula commented Feb 28, 2023

smarianimore commented Mar 7, 2023

vamsianumula commented Mar 10, 2023

vamsianumula commented Mar 15, 2023

Slime env compatible with SB3 #5

Slime env compatible with SB3 #5

Conversation

vamsianumula commented Feb 22, 2023 • edited Loading

smarianimore left a comment

Choose a reason for hiding this comment

smarianimore left a comment

Choose a reason for hiding this comment

smarianimore left a comment

Choose a reason for hiding this comment

smarianimore left a comment

Choose a reason for hiding this comment

vamsianumula commented Feb 28, 2023

smarianimore commented Mar 7, 2023

vamsianumula commented Mar 10, 2023

vamsianumula commented Mar 15, 2023

vamsianumula commented Feb 22, 2023 •

edited

Loading