Skip to content

Conversation

vmoens
Copy link
Collaborator

@vmoens vmoens commented Nov 4, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Nov 4, 2024
ghstack-source-id: 675a403
Pull Request resolved: #2533
Copy link

pytorch-bot bot commented Nov 4, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2533

Note: Links to docs will display an error until the docs builds have been completed.

❌ 19 New Failures, 4 Unrelated Failures

As of commit 18d343b with merge base 05aeb89 (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 4, 2024
@vmoens vmoens merged commit 18d343b into gh/vmoens/35/base Nov 4, 2024
32 of 51 checks passed
@vmoens vmoens deleted the gh/vmoens/35/head branch November 4, 2024 13:12
Copy link

github-actions bot commented Nov 4, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 145. Improved: $\large\color{#35bf28}18$. Worsened: $\large\color{#d91a1a}8$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7841s 0.7615s 1.3132 Ops/s 1.2719 Ops/s $\color{#35bf28}+3.25\%$
test_transformed 1.0296s 1.0042s 0.9959 Ops/s 0.9918 Ops/s $\color{#35bf28}+0.41\%$
test_serial 2.3500s 2.2694s 0.4406 Ops/s 0.4531 Ops/s $\color{#d91a1a}-2.76\%$
test_parallel 2.0890s 1.9747s 0.5064 Ops/s 0.5110 Ops/s $\color{#d91a1a}-0.89\%$
test_step_mdp_speed[True-True-True-True-True] 0.2333ms 36.2769μs 27.5657 KOps/s 27.8163 KOps/s $\color{#d91a1a}-0.90\%$
test_step_mdp_speed[True-True-True-True-False] 52.4110μs 20.8790μs 47.8950 KOps/s 47.4646 KOps/s $\color{#35bf28}+0.91\%$
test_step_mdp_speed[True-True-True-False-True] 49.5310μs 20.5423μs 48.6801 KOps/s 49.5025 KOps/s $\color{#d91a1a}-1.66\%$
test_step_mdp_speed[True-True-True-False-False] 49.3910μs 11.8641μs 84.2881 KOps/s 85.3733 KOps/s $\color{#d91a1a}-1.27\%$
test_step_mdp_speed[True-True-False-True-True] 69.9710μs 39.0477μs 25.6097 KOps/s 25.7499 KOps/s $\color{#d91a1a}-0.54\%$
test_step_mdp_speed[True-True-False-True-False] 54.8410μs 22.5732μs 44.3003 KOps/s 44.1224 KOps/s $\color{#35bf28}+0.40\%$
test_step_mdp_speed[True-True-False-False-True] 55.1310μs 22.2261μs 44.9922 KOps/s 44.9706 KOps/s $\color{#35bf28}+0.05\%$
test_step_mdp_speed[True-True-False-False-False] 49.3310μs 13.7466μs 72.7454 KOps/s 76.2556 KOps/s $\color{#d91a1a}-4.60\%$
test_step_mdp_speed[True-False-True-True-True] 75.7310μs 40.7690μs 24.5284 KOps/s 24.6836 KOps/s $\color{#d91a1a}-0.63\%$
test_step_mdp_speed[True-False-True-True-False] 59.9110μs 24.6797μs 40.5191 KOps/s 40.5766 KOps/s $\color{#d91a1a}-0.14\%$
test_step_mdp_speed[True-False-True-False-True] 48.4010μs 22.2582μs 44.9273 KOps/s 45.1531 KOps/s $\color{#d91a1a}-0.50\%$
test_step_mdp_speed[True-False-True-False-False] 39.3500μs 13.9189μs 71.8447 KOps/s 72.1332 KOps/s $\color{#d91a1a}-0.40\%$
test_step_mdp_speed[True-False-False-True-True] 92.8320μs 42.5460μs 23.5040 KOps/s 23.4077 KOps/s $\color{#35bf28}+0.41\%$
test_step_mdp_speed[True-False-False-True-False] 64.7110μs 26.6846μs 37.4749 KOps/s 37.6909 KOps/s $\color{#d91a1a}-0.57\%$
test_step_mdp_speed[True-False-False-False-True] 52.8610μs 24.0203μs 41.6315 KOps/s 41.2734 KOps/s $\color{#35bf28}+0.87\%$
test_step_mdp_speed[True-False-False-False-False] 42.6510μs 15.7814μs 63.3657 KOps/s 63.6822 KOps/s $\color{#d91a1a}-0.50\%$
test_step_mdp_speed[False-True-True-True-True] 71.7120μs 40.8022μs 24.5085 KOps/s 24.7250 KOps/s $\color{#d91a1a}-0.88\%$
test_step_mdp_speed[False-True-True-True-False] 59.0410μs 24.8447μs 40.2501 KOps/s 40.1015 KOps/s $\color{#35bf28}+0.37\%$
test_step_mdp_speed[False-True-True-False-True] 55.2910μs 25.6118μs 39.0446 KOps/s 39.1097 KOps/s $\color{#d91a1a}-0.17\%$
test_step_mdp_speed[False-True-True-False-False] 48.3010μs 15.5208μs 64.4295 KOps/s 65.0240 KOps/s $\color{#d91a1a}-0.91\%$
test_step_mdp_speed[False-True-False-True-True] 89.4610μs 42.4972μs 23.5310 KOps/s 23.6095 KOps/s $\color{#d91a1a}-0.33\%$
test_step_mdp_speed[False-True-False-True-False] 56.3210μs 26.4129μs 37.8603 KOps/s 37.7004 KOps/s $\color{#35bf28}+0.42\%$
test_step_mdp_speed[False-True-False-False-True] 3.3743ms 28.0546μs 35.6448 KOps/s 35.9758 KOps/s $\color{#d91a1a}-0.92\%$
test_step_mdp_speed[False-True-False-False-False] 47.9510μs 17.4700μs 57.2409 KOps/s 57.1487 KOps/s $\color{#35bf28}+0.16\%$
test_step_mdp_speed[False-False-True-True-True] 76.3720μs 45.1426μs 22.1520 KOps/s 22.3444 KOps/s $\color{#d91a1a}-0.86\%$
test_step_mdp_speed[False-False-True-True-False] 63.3710μs 28.9868μs 34.4985 KOps/s 34.6998 KOps/s $\color{#d91a1a}-0.58\%$
test_step_mdp_speed[False-False-True-False-True] 64.9120μs 27.6743μs 36.1346 KOps/s 36.8731 KOps/s $\color{#d91a1a}-2.00\%$
test_step_mdp_speed[False-False-True-False-False] 55.3310μs 17.2954μs 57.8188 KOps/s 58.0843 KOps/s $\color{#d91a1a}-0.46\%$
test_step_mdp_speed[False-False-False-True-True] 72.4420μs 45.9467μs 21.7643 KOps/s 21.6802 KOps/s $\color{#35bf28}+0.39\%$
test_step_mdp_speed[False-False-False-True-False] 59.5110μs 30.6668μs 32.6086 KOps/s 32.7065 KOps/s $\color{#d91a1a}-0.30\%$
test_step_mdp_speed[False-False-False-False-True] 62.1620μs 29.2677μs 34.1674 KOps/s 34.7920 KOps/s $\color{#d91a1a}-1.80\%$
test_step_mdp_speed[False-False-False-False-False] 51.2410μs 18.8487μs 53.0539 KOps/s 52.5003 KOps/s $\color{#35bf28}+1.05\%$
test_values[generalized_advantage_estimate-True-True] 24.8486ms 23.9786ms 41.7039 Ops/s 41.2685 Ops/s $\color{#35bf28}+1.05\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1007s 2.9006ms 344.7563 Ops/s 328.0210 Ops/s $\textbf{\color{#35bf28}+5.10\%}$
test_values[td0_return_estimate-False-False] 86.1820μs 66.7271μs 14.9864 KOps/s 15.0805 KOps/s $\color{#d91a1a}-0.62\%$
test_values[td1_return_estimate-False-False] 55.7605ms 53.1881ms 18.8012 Ops/s 18.5834 Ops/s $\color{#35bf28}+1.17\%$
test_values[vec_td1_return_estimate-False-False] 1.2877ms 1.0698ms 934.7395 Ops/s 936.9023 Ops/s $\color{#d91a1a}-0.23\%$
test_values[td_lambda_return_estimate-True-False] 89.8610ms 85.2045ms 11.7365 Ops/s 11.5588 Ops/s $\color{#35bf28}+1.54\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.2686ms 1.0666ms 937.5597 Ops/s 938.0760 Ops/s $\color{#d91a1a}-0.06\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 23.8047ms 23.5771ms 42.4140 Ops/s 41.8447 Ops/s $\color{#35bf28}+1.36\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0307ms 0.7372ms 1.3565 KOps/s 1.3533 KOps/s $\color{#35bf28}+0.23\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7448ms 0.6521ms 1.5334 KOps/s 1.5314 KOps/s $\color{#35bf28}+0.13\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5295ms 1.4689ms 680.7874 Ops/s 681.7394 Ops/s $\color{#d91a1a}-0.14\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7427ms 0.6693ms 1.4942 KOps/s 1.4932 KOps/s $\color{#35bf28}+0.07\%$
test_dqn_speed[False-None] 6.7997ms 1.2741ms 784.8418 Ops/s 770.5968 Ops/s $\color{#35bf28}+1.85\%$
test_dqn_speed[False-backward] 1.8905ms 1.8011ms 555.2063 Ops/s 557.4348 Ops/s $\color{#d91a1a}-0.40\%$
test_dqn_speed[True-None] 0.6824ms 0.5428ms 1.8423 KOps/s 1.7837 KOps/s $\color{#35bf28}+3.29\%$
test_dqn_speed[True-backward] 1.0059ms 0.9833ms 1.0169 KOps/s 937.2264 Ops/s $\textbf{\color{#35bf28}+8.51\%}$
test_dqn_speed[reduce-overhead-None] 0.6844ms 0.5530ms 1.8082 KOps/s 1.7663 KOps/s $\color{#35bf28}+2.37\%$
test_dqn_speed[reduce-overhead-backward] 1.0319ms 0.9904ms 1.0097 KOps/s 1.0056 KOps/s $\color{#35bf28}+0.40\%$
test_ddpg_speed[False-None] 3.8106ms 2.6009ms 384.4839 Ops/s 376.4558 Ops/s $\color{#35bf28}+2.13\%$
test_ddpg_speed[False-backward] 3.8916ms 3.7858ms 264.1461 Ops/s 260.7610 Ops/s $\color{#35bf28}+1.30\%$
test_ddpg_speed[True-None] 1.3093ms 1.2126ms 824.6853 Ops/s 789.6019 Ops/s $\color{#35bf28}+4.44\%$
test_ddpg_speed[True-backward] 2.2503ms 2.1777ms 459.2101 Ops/s 376.3843 Ops/s $\textbf{\color{#35bf28}+22.01\%}$
test_ddpg_speed[reduce-overhead-None] 1.3553ms 1.2186ms 820.6435 Ops/s 787.8367 Ops/s $\color{#35bf28}+4.16\%$
test_ddpg_speed[reduce-overhead-backward] 2.2546ms 2.1774ms 459.2654 Ops/s 452.3348 Ops/s $\color{#35bf28}+1.53\%$
test_sac_speed[False-None] 8.2557ms 7.2894ms 137.1848 Ops/s 133.5360 Ops/s $\color{#35bf28}+2.73\%$
test_sac_speed[False-backward] 14.3735ms 10.6915ms 93.5325 Ops/s 94.5410 Ops/s $\color{#d91a1a}-1.07\%$
test_sac_speed[True-None] 2.0735ms 1.9674ms 508.2724 Ops/s 504.0235 Ops/s $\color{#35bf28}+0.84\%$
test_sac_speed[True-backward] 3.9491ms 3.8342ms 260.8073 Ops/s 247.3312 Ops/s $\textbf{\color{#35bf28}+5.45\%}$
test_sac_speed[reduce-overhead-None] 2.0834ms 1.9718ms 507.1579 Ops/s 507.9868 Ops/s $\color{#d91a1a}-0.16\%$
test_sac_speed[reduce-overhead-backward] 3.9788ms 3.8466ms 259.9718 Ops/s 260.8727 Ops/s $\color{#d91a1a}-0.35\%$
test_redq_speed[False-None] 15.0753ms 10.0027ms 99.9734 Ops/s 92.2440 Ops/s $\textbf{\color{#35bf28}+8.38\%}$
test_redq_speed[False-backward] 21.6403ms 16.8321ms 59.4103 Ops/s 61.2152 Ops/s $\color{#d91a1a}-2.95\%$
test_redq_speed[True-None] 4.0297ms 3.5047ms 285.3294 Ops/s 291.2227 Ops/s $\color{#d91a1a}-2.02\%$
test_redq_speed[True-backward] 8.4991ms 8.2824ms 120.7379 Ops/s 122.8495 Ops/s $\color{#d91a1a}-1.72\%$
test_redq_speed[reduce-overhead-None] 3.6677ms 3.4186ms 292.5184 Ops/s 295.5619 Ops/s $\color{#d91a1a}-1.03\%$
test_redq_speed[reduce-overhead-backward] 8.6351ms 8.3283ms 120.0718 Ops/s 124.3961 Ops/s $\color{#d91a1a}-3.48\%$
test_redq_deprec_speed[False-None] 10.9138ms 10.2344ms 97.7100 Ops/s 93.5287 Ops/s $\color{#35bf28}+4.47\%$
test_redq_deprec_speed[False-backward] 15.4985ms 14.9637ms 66.8283 Ops/s 64.9456 Ops/s $\color{#35bf28}+2.90\%$
test_redq_deprec_speed[True-None] 3.3353ms 3.0894ms 323.6892 Ops/s 313.4699 Ops/s $\color{#35bf28}+3.26\%$
test_redq_deprec_speed[True-backward] 7.1143ms 6.8966ms 144.9981 Ops/s 135.6329 Ops/s $\textbf{\color{#35bf28}+6.90\%}$
test_redq_deprec_speed[reduce-overhead-None] 3.2716ms 3.0987ms 322.7141 Ops/s 320.3949 Ops/s $\color{#35bf28}+0.72\%$
test_redq_deprec_speed[reduce-overhead-backward] 7.0117ms 6.8469ms 146.0512 Ops/s 134.4525 Ops/s $\textbf{\color{#35bf28}+8.63\%}$
test_td3_speed[False-None] 7.4327ms 7.2277ms 138.3558 Ops/s 134.2787 Ops/s $\color{#35bf28}+3.04\%$
test_td3_speed[False-backward] 10.4070ms 9.9587ms 100.4150 Ops/s 95.8268 Ops/s $\color{#35bf28}+4.79\%$
test_td3_speed[True-None] 1.8696ms 1.8511ms 540.2189 Ops/s 527.0503 Ops/s $\color{#35bf28}+2.50\%$
test_td3_speed[True-backward] 3.7030ms 3.5915ms 278.4390 Ops/s 279.3087 Ops/s $\color{#d91a1a}-0.31\%$
test_td3_speed[reduce-overhead-None] 1.8954ms 1.8455ms 541.8456 Ops/s 540.4583 Ops/s $\color{#35bf28}+0.26\%$
test_td3_speed[reduce-overhead-backward] 3.7443ms 3.6053ms 277.3732 Ops/s 278.6226 Ops/s $\color{#d91a1a}-0.45\%$
test_cql_speed[False-None] 27.1241ms 24.3389ms 41.0865 Ops/s 41.5431 Ops/s $\color{#d91a1a}-1.10\%$
test_cql_speed[False-backward] 36.2728ms 33.2317ms 30.0917 Ops/s 30.4242 Ops/s $\color{#d91a1a}-1.09\%$
test_cql_speed[True-None] 10.8391ms 10.5884ms 94.4432 Ops/s 96.1983 Ops/s $\color{#d91a1a}-1.82\%$
test_cql_speed[True-backward] 16.5171ms 16.2024ms 61.7194 Ops/s 60.9693 Ops/s $\color{#35bf28}+1.23\%$
test_cql_speed[reduce-overhead-None] 10.8654ms 10.6213ms 94.1508 Ops/s 96.9431 Ops/s $\color{#d91a1a}-2.88\%$
test_cql_speed[reduce-overhead-backward] 16.6841ms 16.2704ms 61.4612 Ops/s 62.8907 Ops/s $\color{#d91a1a}-2.27\%$
test_a2c_speed[False-None] 5.3971ms 5.1243ms 195.1505 Ops/s 195.1092 Ops/s $\color{#35bf28}+0.02\%$
test_a2c_speed[False-backward] 12.5833ms 11.3567ms 88.0539 Ops/s 88.3724 Ops/s $\color{#d91a1a}-0.36\%$
test_a2c_speed[True-None] 3.1486ms 2.9696ms 336.7476 Ops/s 335.6384 Ops/s $\color{#35bf28}+0.33\%$
test_a2c_speed[True-backward] 8.5290ms 8.2694ms 120.9272 Ops/s 119.6598 Ops/s $\color{#35bf28}+1.06\%$
test_a2c_speed[reduce-overhead-None] 3.1358ms 2.9599ms 337.8464 Ops/s 333.3713 Ops/s $\color{#35bf28}+1.34\%$
test_a2c_speed[reduce-overhead-backward] 8.4560ms 8.1532ms 122.6516 Ops/s 123.6230 Ops/s $\color{#d91a1a}-0.79\%$
test_ppo_speed[False-None] 5.7988ms 5.4969ms 181.9204 Ops/s 178.9917 Ops/s $\color{#35bf28}+1.64\%$
test_ppo_speed[False-backward] 12.1790ms 11.8724ms 84.2287 Ops/s 84.0546 Ops/s $\color{#35bf28}+0.21\%$
test_ppo_speed[True-None] 3.5511ms 3.3835ms 295.5527 Ops/s 289.6848 Ops/s $\color{#35bf28}+2.03\%$
test_ppo_speed[True-backward] 8.1450ms 7.8927ms 126.6995 Ops/s 125.3443 Ops/s $\color{#35bf28}+1.08\%$
test_ppo_speed[reduce-overhead-None] 3.5511ms 3.3809ms 295.7811 Ops/s 297.6263 Ops/s $\color{#d91a1a}-0.62\%$
test_ppo_speed[reduce-overhead-backward] 8.2305ms 7.9927ms 125.1148 Ops/s 124.0299 Ops/s $\color{#35bf28}+0.87\%$
test_reinforce_speed[False-None] 4.5362ms 4.3402ms 230.4059 Ops/s 229.4701 Ops/s $\color{#35bf28}+0.41\%$
test_reinforce_speed[False-backward] 7.3884ms 7.0659ms 141.5239 Ops/s 141.7357 Ops/s $\color{#d91a1a}-0.15\%$
test_reinforce_speed[True-None] 2.2924ms 2.1414ms 466.9921 Ops/s 463.5162 Ops/s $\color{#35bf28}+0.75\%$
test_reinforce_speed[True-backward] 7.0145ms 6.8599ms 145.7749 Ops/s 146.9193 Ops/s $\color{#d91a1a}-0.78\%$
test_reinforce_speed[reduce-overhead-None] 2.3017ms 2.1419ms 466.8675 Ops/s 466.5225 Ops/s $\color{#35bf28}+0.07\%$
test_reinforce_speed[reduce-overhead-backward] 7.1731ms 6.8623ms 145.7228 Ops/s 145.9793 Ops/s $\color{#d91a1a}-0.18\%$
test_iql_speed[False-None] 19.5970ms 18.9772ms 52.6950 Ops/s 51.3207 Ops/s $\color{#35bf28}+2.68\%$
test_iql_speed[False-backward] 30.4199ms 29.1522ms 34.3028 Ops/s 33.3289 Ops/s $\color{#35bf28}+2.92\%$
test_iql_speed[True-None] 7.7488ms 6.5459ms 152.7670 Ops/s 152.3554 Ops/s $\color{#35bf28}+0.27\%$
test_iql_speed[True-backward] 15.1749ms 14.8739ms 67.2320 Ops/s 62.1192 Ops/s $\textbf{\color{#35bf28}+8.23\%}$
test_iql_speed[reduce-overhead-None] 6.8667ms 6.5467ms 152.7487 Ops/s 151.0093 Ops/s $\color{#35bf28}+1.15\%$
test_iql_speed[reduce-overhead-backward] 16.0479ms 14.9076ms 67.0799 Ops/s 66.2397 Ops/s $\color{#35bf28}+1.27\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.3895ms 6.2776ms 159.2954 Ops/s 156.9724 Ops/s $\color{#35bf28}+1.48\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8558ms 0.2552ms 3.9186 KOps/s 3.1441 KOps/s $\textbf{\color{#35bf28}+24.63\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.4306ms 0.2337ms 4.2784 KOps/s 3.3622 KOps/s $\textbf{\color{#35bf28}+27.25\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.3104ms 6.0467ms 165.3785 Ops/s 164.2831 Ops/s $\color{#35bf28}+0.67\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.7767ms 0.2689ms 3.7184 KOps/s 3.4867 KOps/s $\textbf{\color{#35bf28}+6.65\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5152ms 0.2669ms 3.7461 KOps/s 3.7185 KOps/s $\color{#35bf28}+0.74\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.3887ms 1.2010ms 832.6717 Ops/s 688.3926 Ops/s $\textbf{\color{#35bf28}+20.96\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.3658ms 1.1535ms 866.9424 Ops/s 724.3632 Ops/s $\textbf{\color{#35bf28}+19.68\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5001ms 6.2880ms 159.0341 Ops/s 159.8498 Ops/s $\color{#d91a1a}-0.51\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.3280ms 0.4992ms 2.0031 KOps/s 2.2372 KOps/s $\textbf{\color{#d91a1a}-10.46\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6811ms 0.4103ms 2.4370 KOps/s 2.3238 KOps/s $\color{#35bf28}+4.87\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.2648ms 6.1547ms 162.4783 Ops/s 163.4508 Ops/s $\color{#d91a1a}-0.59\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.1170ms 0.3309ms 3.0217 KOps/s 3.8447 KOps/s $\textbf{\color{#d91a1a}-21.41\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5293ms 0.2856ms 3.5011 KOps/s 4.1877 KOps/s $\textbf{\color{#d91a1a}-16.40\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.3952ms 6.0792ms 164.4954 Ops/s 163.9290 Ops/s $\color{#35bf28}+0.35\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.0837ms 0.3733ms 2.6785 KOps/s 3.8462 KOps/s $\textbf{\color{#d91a1a}-30.36\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6239ms 0.3361ms 2.9754 KOps/s 3.5080 KOps/s $\textbf{\color{#d91a1a}-15.18\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 10.1796ms 6.6093ms 151.3022 Ops/s 159.4480 Ops/s $\textbf{\color{#d91a1a}-5.11\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.7907ms 0.4551ms 2.1973 KOps/s 2.1815 KOps/s $\color{#35bf28}+0.72\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6759ms 0.4349ms 2.2991 KOps/s 2.1542 KOps/s $\textbf{\color{#35bf28}+6.73\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 0.4451s 14.5465ms 68.7450 Ops/s 194.4946 Ops/s $\textbf{\color{#d91a1a}-64.65\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 10.1431ms 2.1175ms 472.2488 Ops/s 448.2634 Ops/s $\textbf{\color{#35bf28}+5.35\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.0747ms 1.2422ms 805.0401 Ops/s 795.1416 Ops/s $\color{#35bf28}+1.24\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 6.9674ms 5.3052ms 188.4941 Ops/s 33.4929 Ops/s $\textbf{\color{#35bf28}+462.79\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 10.2097ms 2.0347ms 491.4679 Ops/s 481.6494 Ops/s $\color{#35bf28}+2.04\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 3.4734ms 1.1292ms 885.5663 Ops/s 818.6970 Ops/s $\textbf{\color{#35bf28}+8.17\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.3738s 12.8900ms 77.5792 Ops/s 177.1717 Ops/s $\textbf{\color{#d91a1a}-56.21\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 4.0594ms 2.0265ms 493.4510 Ops/s 446.4715 Ops/s $\textbf{\color{#35bf28}+10.52\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 8.7546ms 1.3592ms 735.7514 Ops/s 726.7856 Ops/s $\color{#35bf28}+1.23\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000-100-True] 48.2226ms 46.3340ms 21.5824 Ops/s 21.3015 Ops/s $\color{#35bf28}+1.32\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000-100-False] 9.9973ms 9.4314ms 106.0290 Ops/s 104.4651 Ops/s $\color{#35bf28}+1.50\%$

vmoens pushed a commit that referenced this pull request Nov 14, 2024
ghstack-source-id: 675a403
Pull Request resolved: #2533

(cherry picked from commit fa64c2f)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants