Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added A2C models for Ant-v3, Humanoid-v3 and Walker2d-v3 environments #17

Closed

Conversation

pseudo-rnd-thoughts
Copy link
Contributor

The SB3 benchmarks are missing the A2C Ant, Humanoid and Walker2d environments, benchmarks and zoo

This PR adds those environments for A2C

@araffin
Copy link
Member

araffin commented May 18, 2022

Hello,
thanks.
I think I will actually wait for Huggingface integration DLR-RM/rl-baselines3-zoo#198 (same as DLR-RM/rl-baselines3-zoo#244)

@pseudo-rnd-thoughts
Copy link
Contributor Author

No worries

@araffin
Copy link
Member

araffin commented May 20, 2022

Basic version is ready to review ;)
DLR-RM/rl-baselines3-zoo#250

An example: https://huggingface.co/sb3/ppo-MountainCarContinuous-v0

@pseudo-rnd-thoughts
Copy link
Contributor Author

Sorry, I didn't totally understand what you want me to do

@araffin
Copy link
Member

araffin commented May 21, 2022

Sorry, I didn't totally understand what you want me to do

sorry, we are going to replace the trained agents repo with huggingface hub, and the PR for that is ready, you can use it to upload models ;)

@araffin
Copy link
Member

araffin commented Jun 10, 2022

Closing in favor of DLR-RM/rl-baselines3-zoo#262
models are now uploaded on Hugginface hub, see https://huggingface.co/sb3/a2c-Walker2d-v3 for instance.
Btw, the pre-trained models do not have good performance and are missing some hyperparameter tuning (as expected, that was the reason I didn't upload them in the first place).

@araffin araffin closed this Jun 10, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants