Thank you for the great work on this project!
I tried training the g1_ground task, and here are the results after 10,000 iterations. I noticed that the leg posture does not look very reasonable, and the knee joints seem to be close to the lower limit (around -0.087267).
In addition, within the regularization reward group, some of the rewards did not converge and instead showed a downward trend. I am not sure if this behavior is expected or if there might be an issue in my setup.
I’ve attached the reward plots below for reference.Could you please take a look and let me know what might be going wrong?
Thanks again for your efforts and contributions!
