-
Notifications
You must be signed in to change notification settings - Fork 337
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
env.step(env.action_space.sample()) vs env.pos #27
Comments
Hi @MegaCreater, I should double check your numbers and slicing but one sure reason why the
You can use CtrlAviary for unbounded, raw obs .
|
Hi @JacopoPan , |
Because kinematic information is retrieved from PyBullet in:
Which is called at every step:
And those value are used to build a state:
But then that state is clipped to max, min values and divided by ranges in: gym-pybullet-drones/gym_pybullet_drones/envs/single_agent_rl/BaseSingleAgentAviary.py Line 355 in 7688e72
Before being returned as an obs .If you spot any numerical mistake please let me know but I wouldn't expect obs and env.pos to contain the same values.
|
Thanks @JacopoPan , |
Hello,
I am very new to this. A want to use TensorFlow-agents to train models as I don't know much about
stable-baselines3
. I had generated environment usingTakeoffAviary.py
.`
as
obs, reward, done, info = env.step(env.action_space.sample())
. So whyobs[:3] != env.pos[0]
orobs[3:6]!=env.rpy[0]
similarly for velocity and angular velocity also ? (obs-space is Box(12,) -> x,y,z,r,p,y,vx,vy,vz,wx,wy,wz receptively (in world frame)? (Is i am correct?)). What does env.pos or env.rpy or env.ang_v or env.vel returns ?The text was updated successfully, but these errors were encountered: