-
Notifications
You must be signed in to change notification settings - Fork 306
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Fix reward sum within parallel envs #1454
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Aug 9, 2023
3 tasks
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_single | 0.1187s | 0.1181s | 8.4662 Ops/s | 8.4917 Ops/s | |
test_sync | 0.1232s | 65.9804ms | 15.1560 Ops/s | 14.6966 Ops/s | |
test_async | 0.1782s | 61.2057ms | 16.3383 Ops/s | 16.2376 Ops/s | |
test_simple | 0.5840s | 0.5278s | 1.8947 Ops/s | 1.8573 Ops/s | |
test_transformed | 1.3627s | 1.3220s | 0.7564 Ops/s | 0.7384 Ops/s | |
test_serial | 1.6806s | 1.6400s | 0.6097 Ops/s | 0.5890 Ops/s | |
test_parallel | 1.7371s | 1.4892s | 0.6715 Ops/s | 0.6889 Ops/s | |
test_step_mdp_speed[True-True-True-True-True] | 0.3417ms | 41.9488μs | 23.8386 KOps/s | 23.2842 KOps/s | |
test_step_mdp_speed[True-True-True-True-False] | 92.4000μs | 23.9487μs | 41.7559 KOps/s | 40.9901 KOps/s | |
test_step_mdp_speed[True-True-True-False-True] | 54.3000μs | 29.0353μs | 34.4408 KOps/s | 33.7143 KOps/s | |
test_step_mdp_speed[True-True-True-False-False] | 34.2000μs | 16.3468μs | 61.1740 KOps/s | 59.1834 KOps/s | |
test_step_mdp_speed[True-True-False-True-True] | 66.8000μs | 43.3320μs | 23.0776 KOps/s | 22.3314 KOps/s | |
test_step_mdp_speed[True-True-False-True-False] | 48.1000μs | 25.7455μs | 38.8417 KOps/s | 37.7635 KOps/s | |
test_step_mdp_speed[True-True-False-False-True] | 78.5000μs | 31.3653μs | 31.8824 KOps/s | 31.6230 KOps/s | |
test_step_mdp_speed[True-True-False-False-False] | 71.3990μs | 18.3653μs | 54.4505 KOps/s | 53.6252 KOps/s | |
test_step_mdp_speed[True-False-True-True-True] | 85.1000μs | 45.5888μs | 21.9352 KOps/s | 21.0858 KOps/s | |
test_step_mdp_speed[True-False-True-True-False] | 0.1808ms | 27.5003μs | 36.3632 KOps/s | 35.2333 KOps/s | |
test_step_mdp_speed[True-False-True-False-True] | 52.9000μs | 30.8654μs | 32.3987 KOps/s | 31.4311 KOps/s | |
test_step_mdp_speed[True-False-True-False-False] | 49.8000μs | 18.2504μs | 54.7933 KOps/s | 53.6000 KOps/s | |
test_step_mdp_speed[True-False-False-True-True] | 0.2154ms | 46.9304μs | 21.3081 KOps/s | 20.3611 KOps/s | |
test_step_mdp_speed[True-False-False-True-False] | 50.4000μs | 28.8974μs | 34.6052 KOps/s | 33.1031 KOps/s | |
test_step_mdp_speed[True-False-False-False-True] | 77.5000μs | 32.3668μs | 30.8959 KOps/s | 29.8479 KOps/s | |
test_step_mdp_speed[True-False-False-False-False] | 1.7450ms | 20.1915μs | 49.5258 KOps/s | 48.9124 KOps/s | |
test_step_mdp_speed[False-True-True-True-True] | 62.7000μs | 45.2768μs | 22.0864 KOps/s | 21.2660 KOps/s | |
test_step_mdp_speed[False-True-True-True-False] | 73.2000μs | 27.5787μs | 36.2599 KOps/s | 35.1213 KOps/s | |
test_step_mdp_speed[False-True-True-False-True] | 60.0000μs | 35.6254μs | 28.0698 KOps/s | 26.8566 KOps/s | |
test_step_mdp_speed[False-True-True-False-False] | 64.7000μs | 20.8645μs | 47.9283 KOps/s | 47.6794 KOps/s | |
test_step_mdp_speed[False-True-False-True-True] | 74.8000μs | 46.7144μs | 21.4067 KOps/s | 20.6727 KOps/s | |
test_step_mdp_speed[False-True-False-True-False] | 57.7000μs | 29.2292μs | 34.2124 KOps/s | 33.1761 KOps/s | |
test_step_mdp_speed[False-True-False-False-True] | 86.4000μs | 37.3229μs | 26.7932 KOps/s | 25.7547 KOps/s | |
test_step_mdp_speed[False-True-False-False-False] | 0.1353ms | 21.9363μs | 45.5865 KOps/s | 43.7678 KOps/s | |
test_step_mdp_speed[False-False-True-True-True] | 77.8000μs | 48.6579μs | 20.5517 KOps/s | 19.7860 KOps/s | |
test_step_mdp_speed[False-False-True-True-False] | 55.9010μs | 31.0734μs | 32.1819 KOps/s | 30.7067 KOps/s | |
test_step_mdp_speed[False-False-True-False-True] | 61.4000μs | 37.4335μs | 26.7140 KOps/s | 25.8353 KOps/s | |
test_step_mdp_speed[False-False-True-False-False] | 72.9000μs | 21.9582μs | 45.5412 KOps/s | 43.7955 KOps/s | |
test_step_mdp_speed[False-False-False-True-True] | 2.0420ms | 50.4116μs | 19.8367 KOps/s | 19.1992 KOps/s | |
test_step_mdp_speed[False-False-False-True-False] | 74.3010μs | 32.5262μs | 30.7445 KOps/s | 29.3083 KOps/s | |
test_step_mdp_speed[False-False-False-False-True] | 78.7000μs | 38.8093μs | 25.7671 KOps/s | 24.9180 KOps/s | |
test_step_mdp_speed[False-False-False-False-False] | 45.9010μs | 23.5089μs | 42.5371 KOps/s | 41.0821 KOps/s | |
test_values[generalized_advantage_estimate-True-True] | 14.7766ms | 13.3994ms | 74.6300 Ops/s | 73.9753 Ops/s | |
test_values[vec_generalized_advantage_estimate-True-True] | 56.2641ms | 50.4043ms | 19.8396 Ops/s | 19.4426 Ops/s | |
test_values[td0_return_estimate-False-False] | 0.5622ms | 0.1992ms | 5.0199 KOps/s | 4.4960 KOps/s | |
test_values[td1_return_estimate-False-False] | 13.4107ms | 13.1775ms | 75.8868 Ops/s | 73.0290 Ops/s | |
test_values[vec_td1_return_estimate-False-False] | 55.9084ms | 50.5512ms | 19.7819 Ops/s | 19.7221 Ops/s | |
test_values[td_lambda_return_estimate-True-False] | 34.2593ms | 31.7868ms | 31.4596 Ops/s | 31.0166 Ops/s | |
test_values[vec_td_lambda_return_estimate-True-False] | 56.5529ms | 51.5904ms | 19.3835 Ops/s | 19.7846 Ops/s | |
test_gae_speed[generalized_advantage_estimate-False-1-512] | 12.9894ms | 12.0165ms | 83.2189 Ops/s | 81.3979 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 3.1648ms | 2.3907ms | 418.2893 Ops/s | 385.3225 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 4.8243ms | 0.4119ms | 2.4277 KOps/s | 2.4140 KOps/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 57.6616ms | 52.6833ms | 18.9814 Ops/s | 19.4273 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 4.1776ms | 3.7811ms | 264.4758 Ops/s | 265.3367 Ops/s | |
test_dqn_speed | 6.5966ms | 1.7374ms | 575.5831 Ops/s | 576.9624 Ops/s | |
test_ddpg_speed | 7.4951ms | 2.4671ms | 405.3290 Ops/s | 403.9577 Ops/s | |
test_sac_speed | 12.3025ms | 7.7134ms | 129.6451 Ops/s | 128.6577 Ops/s | |
test_redq_speed | 19.6086ms | 15.5083ms | 64.4817 Ops/s | 65.6992 Ops/s | |
test_redq_deprec_speed | 17.8464ms | 12.6545ms | 79.0233 Ops/s | 80.3229 Ops/s | |
test_td3_speed | 10.7144ms | 9.5769ms | 104.4181 Ops/s | 104.1795 Ops/s | |
test_cql_speed | 39.2337ms | 29.2928ms | 34.1380 Ops/s | 39.7049 Ops/s | |
test_a2c_speed | 10.2868ms | 5.2879ms | 189.1124 Ops/s | 189.7434 Ops/s | |
test_ppo_speed | 14.8883ms | 5.7549ms | 173.7655 Ops/s | 173.7969 Ops/s | |
test_reinforce_speed | 8.9519ms | 4.1314ms | 242.0488 Ops/s | 244.2003 Ops/s | |
test_iql_speed | 25.7726ms | 20.9834ms | 47.6567 Ops/s | 45.4287 Ops/s | |
test_sample_rb[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 3.0218ms | 2.4917ms | 401.3271 Ops/s | 413.2824 Ops/s | |
test_sample_rb[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 4.2661ms | 2.6169ms | 382.1360 Ops/s | 388.1360 Ops/s | |
test_sample_rb[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.1070s | 2.8202ms | 354.5820 Ops/s | 366.0240 Ops/s | |
test_sample_rb[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 3.1679ms | 2.4319ms | 411.2077 Ops/s | 329.6348 Ops/s | |
test_sample_rb[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 3.7614ms | 2.4900ms | 401.5987 Ops/s | 359.1965 Ops/s | |
test_sample_rb[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 4.0341ms | 2.5112ms | 398.2085 Ops/s | 360.6777 Ops/s | |
test_sample_rb[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 2.8632ms | 2.3477ms | 425.9409 Ops/s | 381.0714 Ops/s | |
test_sample_rb[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 3.6314ms | 2.5590ms | 390.7798 Ops/s | 369.6513 Ops/s | |
test_sample_rb[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 2.5290ms | 2.3608ms | 423.5822 Ops/s | 373.7505 Ops/s | |
test_iterate_rb[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 3.0717ms | 2.3423ms | 426.9329 Ops/s | 410.5260 Ops/s | |
test_iterate_rb[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.1002s | 2.7863ms | 358.8958 Ops/s | 370.2692 Ops/s | |
test_iterate_rb[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 3.9479ms | 2.5273ms | 395.6767 Ops/s | 369.2233 Ops/s | |
test_iterate_rb[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 2.9742ms | 2.5770ms | 388.0538 Ops/s | 405.8804 Ops/s | |
test_iterate_rb[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 4.7281ms | 2.6856ms | 372.3622 Ops/s | 385.0157 Ops/s | |
test_iterate_rb[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 4.1243ms | 2.7011ms | 370.2133 Ops/s | 385.2467 Ops/s | |
test_iterate_rb[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 3.3702ms | 2.6084ms | 383.3779 Ops/s | 407.4434 Ops/s | |
test_iterate_rb[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 4.1245ms | 2.6359ms | 379.3736 Ops/s | 388.3527 Ops/s | |
test_iterate_rb[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 4.7200ms | 2.5666ms | 389.6244 Ops/s | 359.6003 Ops/s | |
test_populate_rb[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 0.1809s | 26.3173ms | 37.9978 Ops/s | 36.6085 Ops/s | |
test_populate_rb[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 0.1072s | 22.9400ms | 43.5920 Ops/s | 38.8907 Ops/s | |
test_populate_rb[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 0.1070s | 24.7535ms | 40.3984 Ops/s | 42.2116 Ops/s | |
test_populate_rb[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.1106s | 23.1160ms | 43.2600 Ops/s | 38.5408 Ops/s | |
test_populate_rb[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 0.1083s | 24.8719ms | 40.2061 Ops/s | 41.1872 Ops/s | |
test_populate_rb[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 0.1170s | 23.5993ms | 42.3742 Ops/s | 42.1946 Ops/s | |
test_populate_rb[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 0.1186s | 25.8829ms | 38.6356 Ops/s | 42.2681 Ops/s | |
test_populate_rb[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 0.1182s | 23.7273ms | 42.1456 Ops/s | 39.6871 Ops/s | |
test_populate_rb[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 0.1169s | 25.5321ms | 39.1665 Ops/s | 42.2982 Ops/s |
matteobettini
approved these changes
Aug 30, 2023
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
osalpekar
pushed a commit
to osalpekar/rl
that referenced
this pull request
Aug 30, 2023
vmoens
added a commit
to hyerra/rl
that referenced
this pull request
Oct 10, 2023
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
Fixes #1453