Create Home Position Reward #33

goobta · 2020-12-17T02:52:23Z

Closes #24.

Basically as mentioned before, this gives the maximal reward (~1) for the robot reaching the home position (quadrupedal standing). Big thing to note here is that the reward is dependent on both the height and the orientation, as shown in the line below:

gym_solo/gym_solo/core/rewards.py

Line 146 in 0c96a33

return 0.25 * orientation_reward + 0.75 * height_reward

However, notice that these are being weighted and summed together. This functionality is also built into our rewards_factory, but I just thought it was cleaner to implement it this way as the quadrupedal standing max height will be different from the bipedal standing one. However, if the relative weights between the two rewards needs to be optimized, then they will probably have to be split up so that W&B and tune it for us.

Note that #32 should probably be merged first cause that renames the test_rewards.py file, which this PR modifies. This PR is branched off of the previous one, so if #32 gets merged, this shouldn't have any problems.

coveralls · 2020-12-17T02:53:39Z

Pull Request Test Coverage Report for Build 427141405

34 of 34 (100.0%) changed or added relevant lines in 2 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.09%) to 98.123%

Totals
Change from base Build 406391266:	0.09%
Covered Lines:	732
Relevant Lines:	746

💛 - Coveralls

mahajanrevant

You have the same code in one of the other reviews so those comments are applicable. I can approve this PR and we can resolve any concerning issues in the PR after this?

examples/solo8_vanilla/observation_dump.py

goobta · 2021-01-07T01:39:14Z

Since this is one of the earlier PRs, do you want me to merge this one first? idk if you left most of your comments in the other PRs or what would make this easiest for ya...

mahajanrevant · 2021-01-07T22:46:23Z

I am okay with you merging this in because I looked at the other PR before this so all my relevant comments are there

mahajanrevant

These changes are present in another PR. All relevant comments are present in that PR. Giving this PR approval based on this

goobta added 6 commits December 15, 2020 09:31

Rename rewards factory

9016371

Update gitignore

52dd0cc

Add long termination condition to obs_dump

533203c

Add starter logic for the home position reward

e613f2c

Merge branch 'agupta231-rewards-test-rename' into agupta231-home-reward

f64055d

Add tests

0c96a33

goobta added the new feature New feature or request label Dec 17, 2020

goobta requested a review from mahajanrevant December 17, 2020 02:52

goobta mentioned this pull request Dec 17, 2020

Implement Rewards PyBullet Independance #35

Merged

mahajanrevant reviewed Jan 6, 2021

View reviewed changes

examples/solo8_vanilla/observation_dump.py Show resolved Hide resolved

goobta mentioned this pull request Jan 7, 2021

Create Never Ending Termination #37

Closed

goobta requested a review from mahajanrevant January 7, 2021 01:42

mahajanrevant approved these changes Jan 7, 2021

View reviewed changes

goobta merged commit 649eca8 into master Jan 8, 2021

goobta deleted the agupta231-home-reward branch January 8, 2021 16:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create Home Position Reward #33

Create Home Position Reward #33

goobta commented Dec 17, 2020

coveralls commented Dec 17, 2020 •

edited

mahajanrevant left a comment •

edited

goobta commented Jan 7, 2021

mahajanrevant commented Jan 7, 2021

mahajanrevant left a comment

Create Home Position Reward #33

Create Home Position Reward #33

Conversation

goobta commented Dec 17, 2020

coveralls commented Dec 17, 2020 • edited

Pull Request Test Coverage Report for Build 427141405

💛 - Coveralls

mahajanrevant left a comment • edited

Choose a reason for hiding this comment

goobta commented Jan 7, 2021

mahajanrevant commented Jan 7, 2021

mahajanrevant left a comment

Choose a reason for hiding this comment

coveralls commented Dec 17, 2020 •

edited

mahajanrevant left a comment •

edited