New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fails to run SAC from terminal. #184
Comments
The spinning up version of SAC only works with continuous action environments by default. LunarLander has discrete actions however LunarLanderContinuous-v2 uses continuous actions, just replacing LunarLander-v2 with LunarLanderContinuous-v2 should work! |
@ac-93 I actually read it in the documentation but I assumed it is continuous already - is there a simple way to predict whether a gym environment is discrete or continuous? |
@AsherYartsevTech Here is a table of OpenAI environments https://github.com/openai/gym/wiki/Table-of-environments, in the 'Action Space' column 'Box' effectively means continuous actions. There may be some other compatibility issues dependent on the environment, e.g. CarRacing-v0 uses image observations which are not compatible with the default Spinning up algorithms. |
@ac-93 You brought up an important issue for me: I actually planned to use your SAC implementation for a racing car model we built (with Unreal Engine 4) - can you think of a way of feeding the algorithm with features extracted from racing car's cameras? |
@AsherYartsevTech The most general way would be to pass the images directly into a neural net (probably a convolutional NN) and use that to map directly to actions. You could also create an observation space compiled of features that you extract from the images and then feed these as input to the NN, hopefully the features will contain enough information to make good action predictions. If you're using continuous actions then there's a slightly modified version of the spinning up SAC algo that works with image observations here https://github.com/ac-93/soft-actor-critic/tree/master/sac_cont_image. I think your original question is answered so should probably close this issue. It sounds like a fun project though so if you've got any more questions feel free to drop me an email! (alexchurch1993@gmail.com). |
Thanks a lot!! I'll probably consult with you in the near future! |
Hi Everyone!
I work in a PyCharm environment over Ubuntu 18.04
when I try to execute a variation on this command:
python -m spinup.run sac --hid "[32,32]" --env LunarLander-v2 --exp_name installtest --gamma 0.999
variation means I've tried different combinations of --env, with/without --gamma with/without --hid and its always the next Error Callstack
Here is the Error Callstack:
The text was updated successfully, but these errors were encountered: