Skip to content

Conversation

@vmoens
Copy link
Collaborator

@vmoens vmoens commented Sep 8, 2025

No description provided.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 8, 2025
@vmoens vmoens added the CI label Sep 8, 2025
@vmoens vmoens merged commit 6139ec2 into main Sep 8, 2025
65 of 69 checks passed
@github-actions
Copy link

github-actions bot commented Sep 8, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 233. Improved: $\large\color{#35bf28}8$. Worsened: $\large\color{#d91a1a}6$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 43.3610μs 14.5373μs 68.7887 KOps/s 68.2563 KOps/s $\color{#35bf28}+0.78\%$
test_plain_set_stack_nested 36.6610μs 14.6720μs 68.1571 KOps/s 67.7264 KOps/s $\color{#35bf28}+0.64\%$
test_plain_set_nested_inplace 40.8010μs 16.0924μs 62.1412 KOps/s 62.1002 KOps/s $\color{#35bf28}+0.07\%$
test_plain_set_stack_nested_inplace 58.6110μs 16.0221μs 62.4138 KOps/s 61.8555 KOps/s $\color{#35bf28}+0.90\%$
test_items 38.4310μs 5.8429μs 171.1468 KOps/s 166.3643 KOps/s $\color{#35bf28}+2.87\%$
test_items_nested 0.5670ms 0.5136ms 1.9470 KOps/s 1.9083 KOps/s $\color{#35bf28}+2.02\%$
test_items_nested_locked 0.6217ms 0.5190ms 1.9267 KOps/s 1.9132 KOps/s $\color{#35bf28}+0.70\%$
test_items_nested_leaf 0.1169ms 92.0266μs 10.8664 KOps/s 11.0138 KOps/s $\color{#d91a1a}-1.34\%$
test_items_stack_nested 0.5832ms 0.5204ms 1.9215 KOps/s 1.9302 KOps/s $\color{#d91a1a}-0.45\%$
test_items_stack_nested_leaf 0.1401ms 92.5484μs 10.8052 KOps/s 10.9826 KOps/s $\color{#d91a1a}-1.62\%$
test_items_stack_nested_locked 0.5798ms 0.5169ms 1.9347 KOps/s 1.9054 KOps/s $\color{#35bf28}+1.54\%$
test_keys 23.9610μs 4.1382μs 241.6521 KOps/s 240.9602 KOps/s $\color{#35bf28}+0.29\%$
test_keys_nested 0.1552ms 0.1159ms 8.6276 KOps/s 8.6032 KOps/s $\color{#35bf28}+0.28\%$
test_keys_nested_locked 2.0325ms 0.1258ms 7.9480 KOps/s 7.9949 KOps/s $\color{#d91a1a}-0.59\%$
test_keys_nested_leaf 0.1436ms 0.1080ms 9.2621 KOps/s 9.3508 KOps/s $\color{#d91a1a}-0.95\%$
test_keys_stack_nested 0.1547ms 0.1174ms 8.5208 KOps/s 8.5860 KOps/s $\color{#d91a1a}-0.76\%$
test_keys_stack_nested_leaf 0.1573ms 0.1077ms 9.2827 KOps/s 9.3969 KOps/s $\color{#d91a1a}-1.22\%$
test_keys_stack_nested_locked 0.1726ms 0.1247ms 8.0214 KOps/s 7.9374 KOps/s $\color{#35bf28}+1.06\%$
test_values 7.1660μs 1.0264μs 974.3170 KOps/s 1.0077 MOps/s $\color{#d91a1a}-3.31\%$
test_values_nested 77.0920μs 47.0131μs 21.2707 KOps/s 21.4245 KOps/s $\color{#d91a1a}-0.72\%$
test_values_nested_locked 85.8120μs 50.0029μs 19.9988 KOps/s 20.0766 KOps/s $\color{#d91a1a}-0.39\%$
test_values_nested_leaf 0.1313ms 52.9027μs 18.9026 KOps/s 19.0411 KOps/s $\color{#d91a1a}-0.73\%$
test_values_stack_nested 75.4220μs 46.8250μs 21.3561 KOps/s 21.3984 KOps/s $\color{#d91a1a}-0.20\%$
test_values_stack_nested_leaf 0.5772ms 52.6379μs 18.9977 KOps/s 18.9968 KOps/s $+0.00\%$
test_values_stack_nested_locked 77.9420μs 49.8828μs 20.0470 KOps/s 20.1388 KOps/s $\color{#d91a1a}-0.46\%$
test_membership 6.6420μs 0.8052μs 1.2420 MOps/s 1.2488 MOps/s $\color{#d91a1a}-0.54\%$
test_membership_nested 30.9010μs 2.9349μs 340.7241 KOps/s 339.9766 KOps/s $\color{#35bf28}+0.22\%$
test_membership_nested_leaf 36.5210μs 2.9753μs 336.0961 KOps/s 338.4440 KOps/s $\color{#d91a1a}-0.69\%$
test_membership_stacked_nested 28.1010μs 2.9644μs 337.3375 KOps/s 339.1253 KOps/s $\color{#d91a1a}-0.53\%$
test_membership_stacked_nested_leaf 25.7910μs 2.9571μs 338.1669 KOps/s 343.9553 KOps/s $\color{#d91a1a}-1.68\%$
test_membership_nested_last 32.3510μs 4.3525μs 229.7549 KOps/s 230.4443 KOps/s $\color{#d91a1a}-0.30\%$
test_membership_nested_leaf_last 26.8910μs 4.3361μs 230.6240 KOps/s 232.2873 KOps/s $\color{#d91a1a}-0.72\%$
test_membership_stacked_nested_last 26.6900μs 4.3436μs 230.2234 KOps/s 232.9889 KOps/s $\color{#d91a1a}-1.19\%$
test_membership_stacked_nested_leaf_last 36.4600μs 4.3521μs 229.7752 KOps/s 232.0616 KOps/s $\color{#d91a1a}-0.99\%$
test_nested_getleaf 51.7410μs 20.4919μs 48.7998 KOps/s 48.4810 KOps/s $\color{#35bf28}+0.66\%$
test_nested_get 48.1710μs 19.3402μs 51.7057 KOps/s 51.1571 KOps/s $\color{#35bf28}+1.07\%$
test_stacked_getleaf 54.3810μs 20.2727μs 49.3275 KOps/s 48.4513 KOps/s $\color{#35bf28}+1.81\%$
test_stacked_get 90.1520μs 19.3629μs 51.6452 KOps/s 51.2145 KOps/s $\color{#35bf28}+0.84\%$
test_nested_getitemleaf 43.3610μs 20.9671μs 47.6937 KOps/s 47.0313 KOps/s $\color{#35bf28}+1.41\%$
test_nested_getitem 46.7810μs 19.8990μs 50.2538 KOps/s 50.1391 KOps/s $\color{#35bf28}+0.23\%$
test_stacked_getitemleaf 59.7410μs 20.8931μs 47.8627 KOps/s 47.5142 KOps/s $\color{#35bf28}+0.73\%$
test_stacked_getitem 47.8310μs 19.8321μs 50.4233 KOps/s 49.9315 KOps/s $\color{#35bf28}+0.98\%$
test_lock_nested 0.5327ms 0.4585ms 2.1811 KOps/s 2.1912 KOps/s $\color{#d91a1a}-0.46\%$
test_lock_stack_nested 0.5450ms 0.4580ms 2.1835 KOps/s 2.1876 KOps/s $\color{#d91a1a}-0.19\%$
test_unlock_nested 0.4431ms 0.3712ms 2.6941 KOps/s 2.6941 KOps/s $+0.00\%$
test_unlock_stack_nested 0.4121ms 0.3668ms 2.7261 KOps/s 2.7037 KOps/s $\color{#35bf28}+0.83\%$
test_flatten_speed 0.1589ms 0.1165ms 8.5835 KOps/s 8.5096 KOps/s $\color{#35bf28}+0.87\%$
test_unflatten_speed 0.6417ms 0.5686ms 1.7588 KOps/s 1.7365 KOps/s $\color{#35bf28}+1.29\%$
test_common_ops 0.8661ms 0.7282ms 1.3733 KOps/s 1.3785 KOps/s $\color{#d91a1a}-0.38\%$
test_creation 87.4120μs 2.5775μs 387.9704 KOps/s 390.6471 KOps/s $\color{#d91a1a}-0.69\%$
test_creation_empty 35.1110μs 8.5189μs 117.3867 KOps/s 116.1148 KOps/s $\color{#35bf28}+1.10\%$
test_creation_nested_1 40.0010μs 11.4422μs 87.3958 KOps/s 87.5582 KOps/s $\color{#d91a1a}-0.19\%$
test_creation_nested_2 50.2510μs 15.1481μs 66.0150 KOps/s 65.4631 KOps/s $\color{#35bf28}+0.84\%$
test_clone 46.2510μs 12.8960μs 77.5434 KOps/s 76.8457 KOps/s $\color{#35bf28}+0.91\%$
test_getitem[int] 1.1476ms 13.8258μs 72.3283 KOps/s 72.1008 KOps/s $\color{#35bf28}+0.32\%$
test_getitem[slice_int] 0.1439ms 27.9671μs 35.7563 KOps/s 36.0222 KOps/s $\color{#d91a1a}-0.74\%$
test_getitem[range] 0.1677ms 48.2398μs 20.7298 KOps/s 20.6836 KOps/s $\color{#35bf28}+0.22\%$
test_getitem[tuple] 0.1407ms 23.8151μs 41.9902 KOps/s 42.5383 KOps/s $\color{#d91a1a}-1.29\%$
test_getitem[list] 0.1636ms 43.1544μs 23.1726 KOps/s 23.3947 KOps/s $\color{#d91a1a}-0.95\%$
test_setitem_dim[int] 44.5410μs 24.5728μs 40.6954 KOps/s 40.8279 KOps/s $\color{#d91a1a}-0.32\%$
test_setitem_dim[slice_int] 70.8720μs 47.6728μs 20.9763 KOps/s 20.7394 KOps/s $\color{#35bf28}+1.14\%$
test_setitem_dim[range] 0.1165ms 66.6353μs 15.0071 KOps/s 14.8296 KOps/s $\color{#35bf28}+1.20\%$
test_setitem_dim[tuple] 60.0910μs 39.4865μs 25.3251 KOps/s 24.3820 KOps/s $\color{#35bf28}+3.87\%$
test_setitem 52.7810μs 17.5811μs 56.8794 KOps/s 56.4874 KOps/s $\color{#35bf28}+0.69\%$
test_set 50.5810μs 16.7238μs 59.7949 KOps/s 58.6825 KOps/s $\color{#35bf28}+1.90\%$
test_set_shared 0.5000ms 0.1989ms 5.0287 KOps/s 5.0516 KOps/s $\color{#d91a1a}-0.45\%$
test_update 0.3532ms 21.2334μs 47.0955 KOps/s 46.4238 KOps/s $\color{#35bf28}+1.45\%$
test_update_nested 75.8920μs 33.2853μs 30.0433 KOps/s 29.6040 KOps/s $\color{#35bf28}+1.48\%$
test_update__nested 0.4728ms 32.7920μs 30.4952 KOps/s 30.0265 KOps/s $\color{#35bf28}+1.56\%$
test_set_nested 59.9210μs 18.4259μs 54.2713 KOps/s 53.4864 KOps/s $\color{#35bf28}+1.47\%$
test_set_nested_new 54.2710μs 23.1864μs 43.1287 KOps/s 42.4401 KOps/s $\color{#35bf28}+1.62\%$
test_select 81.0620μs 40.2213μs 24.8625 KOps/s 24.7068 KOps/s $\color{#35bf28}+0.63\%$
test_select_nested 0.1045ms 71.1356μs 14.0577 KOps/s 14.0650 KOps/s $\color{#d91a1a}-0.05\%$
test_exclude_nested 0.1308ms 92.4241μs 10.8197 KOps/s 10.7279 KOps/s $\color{#35bf28}+0.86\%$
test_empty[True] 0.4600ms 0.4178ms 2.3937 KOps/s 2.4132 KOps/s $\color{#d91a1a}-0.81\%$
test_empty[False] 8.1952μs 1.2681μs 788.5736 KOps/s 782.2337 KOps/s $\color{#35bf28}+0.81\%$
test_to 94.9520μs 65.7803μs 15.2021 KOps/s 15.3203 KOps/s $\color{#d91a1a}-0.77\%$
test_to_nonblocking 0.1008ms 57.5574μs 17.3739 KOps/s 17.2889 KOps/s $\color{#35bf28}+0.49\%$
test_unbind_speed 0.3459ms 0.3168ms 3.1569 KOps/s 3.1825 KOps/s $\color{#d91a1a}-0.81\%$
test_unbind_speed_stack0 0.3620ms 0.3127ms 3.1977 KOps/s 3.2050 KOps/s $\color{#d91a1a}-0.23\%$
test_unbind_speed_stack1 97.9835ms 0.9254ms 1.0806 KOps/s 1.1721 KOps/s $\textbf{\color{#d91a1a}-7.80\%}$
test_split 1.1706ms 1.1298ms 885.0766 Ops/s 763.5941 Ops/s $\textbf{\color{#35bf28}+15.91\%}$
test_chunk 98.1428ms 1.2115ms 825.3934 Ops/s 919.9837 Ops/s $\textbf{\color{#d91a1a}-10.28\%}$
test_consolidate[False-None] 4.0815ms 3.7634ms 265.7165 Ops/s 241.3274 Ops/s $\textbf{\color{#35bf28}+10.11\%}$
test_consolidate[default-None] 2.2066ms 2.1093ms 474.0942 Ops/s 458.7655 Ops/s $\color{#35bf28}+3.34\%$
test_consolidate[reduce-overhead-None] 2.1861ms 2.1077ms 474.4433 Ops/s 454.8644 Ops/s $\color{#35bf28}+4.30\%$
test_consolidate_njt[False-None] 8.7496ms 8.5466ms 117.0052 Ops/s 117.7364 Ops/s $\color{#d91a1a}-0.62\%$
test_to[False-False-None] 2.0665ms 2.0024ms 499.4055 Ops/s 501.7514 Ops/s $\color{#d91a1a}-0.47\%$
test_to[True-False-None] 2.1579ms 1.8697ms 534.8365 Ops/s 548.4458 Ops/s $\color{#d91a1a}-2.48\%$
test_to[within-False-None] 0.1882s 6.8481ms 146.0248 Ops/s 127.7931 Ops/s $\textbf{\color{#35bf28}+14.27\%}$
test_to[True-default-None] 6.8874ms 6.7016ms 149.2175 Ops/s 146.4585 Ops/s $\color{#35bf28}+1.88\%$
test_to_njt[False-False-None] 8.3142ms 8.1918ms 122.0735 Ops/s 121.7551 Ops/s $\color{#35bf28}+0.26\%$
test_to_njt[True-False-None] 7.1899ms 7.0550ms 141.7427 Ops/s 144.0148 Ops/s $\color{#d91a1a}-1.58\%$
test_to_njt[within-False-None] 15.7387ms 15.6450ms 63.9182 Ops/s 64.5109 Ops/s $\color{#d91a1a}-0.92\%$
test_creation[device0] 0.2794ms 0.1076ms 9.2904 KOps/s 9.3725 KOps/s $\color{#d91a1a}-0.88\%$
test_creation_from_tensor 0.4623ms 0.1101ms 9.0853 KOps/s 9.1919 KOps/s $\color{#d91a1a}-1.16\%$
test_add_one[memmap_tensor0] 0.2016ms 6.6402μs 150.5970 KOps/s 151.0192 KOps/s $\color{#d91a1a}-0.28\%$
test_contiguous[memmap_tensor0] 25.7710μs 0.7362μs 1.3583 MOps/s 1.8714 MOps/s $\textbf{\color{#d91a1a}-27.42\%}$
test_stack[memmap_tensor0] 33.7210μs 4.8365μs 206.7614 KOps/s 208.8671 KOps/s $\color{#d91a1a}-1.01\%$
test_memmaptd_index 1.1226ms 0.2856ms 3.5011 KOps/s 3.5159 KOps/s $\color{#d91a1a}-0.42\%$
test_memmaptd_index_astensor 0.5246ms 0.3742ms 2.6727 KOps/s 2.7009 KOps/s $\color{#d91a1a}-1.05\%$
test_memmaptd_index_op 0.7594ms 0.6152ms 1.6256 KOps/s 1.6500 KOps/s $\color{#d91a1a}-1.48\%$
test_serialize_model 0.1312s 0.1298s 7.7014 Ops/s 7.6864 Ops/s $\color{#35bf28}+0.20\%$
test_serialize_model_pickle 1.3486s 1.1955s 0.8364 Ops/s 0.8167 Ops/s $\color{#35bf28}+2.42\%$
test_serialize_weights 0.1310s 0.1298s 7.7049 Ops/s 6.3803 Ops/s $\textbf{\color{#35bf28}+20.76\%}$
test_serialize_weights_returnearly 0.3617s 61.5277ms 16.2528 Ops/s 15.8603 Ops/s $\color{#35bf28}+2.47\%$
test_serialize_weights_pickle 1.4196s 1.2035s 0.8309 Ops/s 0.8216 Ops/s $\color{#35bf28}+1.14\%$
test_reshape_pytree 0.3642ms 32.8819μs 30.4119 KOps/s 30.5048 KOps/s $\color{#d91a1a}-0.30\%$
test_reshape_td 66.5210μs 39.1507μs 25.5424 KOps/s 25.7117 KOps/s $\color{#d91a1a}-0.66\%$
test_view_pytree 0.2161ms 32.0202μs 31.2302 KOps/s 31.0941 KOps/s $\color{#35bf28}+0.44\%$
test_view_td 74.3510μs 45.7505μs 21.8577 KOps/s 21.8449 KOps/s $\color{#35bf28}+0.06\%$
test_unbind_pytree 0.2356ms 36.5483μs 27.3610 KOps/s 27.2443 KOps/s $\color{#35bf28}+0.43\%$
test_unbind_td 0.1514ms 47.9067μs 20.8739 KOps/s 21.0750 KOps/s $\color{#d91a1a}-0.95\%$
test_split_pytree 0.1857ms 42.4977μs 23.5307 KOps/s 23.4411 KOps/s $\color{#35bf28}+0.38\%$
test_split_td 0.2101ms 61.6364μs 16.2242 KOps/s 15.9854 KOps/s $\color{#35bf28}+1.49\%$
test_add_pytree 0.2337ms 41.8975μs 23.8678 KOps/s 23.7606 KOps/s $\color{#35bf28}+0.45\%$
test_add_td 90.8720μs 51.5673μs 19.3921 KOps/s 19.6078 KOps/s $\color{#d91a1a}-1.10\%$
test_compile_add_one_nested[tensordict-compile] 0.1882ms 0.1370ms 7.2984 KOps/s 7.0969 KOps/s $\color{#35bf28}+2.84\%$
test_compile_add_one_nested[tensordict-eager] 0.2844ms 0.1820ms 5.4957 KOps/s 5.4354 KOps/s $\color{#35bf28}+1.11\%$
test_compile_add_one_nested[pytree-compile] 0.1445ms 0.1061ms 9.4232 KOps/s 9.1275 KOps/s $\color{#35bf28}+3.24\%$
test_compile_add_one_nested[pytree-eager] 0.3557ms 0.1724ms 5.8016 KOps/s 5.7491 KOps/s $\color{#35bf28}+0.91\%$
test_compile_copy_nested[tensordict-compile] 67.9310μs 28.4683μs 35.1268 KOps/s 31.9178 KOps/s $\textbf{\color{#35bf28}+10.05\%}$
test_compile_copy_nested[tensordict-eager] 90.6420μs 48.2591μs 20.7215 KOps/s 20.2391 KOps/s $\color{#35bf28}+2.38\%$
test_compile_copy_nested[pytree-compile] 0.2182ms 13.0458μs 76.6529 KOps/s 75.2989 KOps/s $\color{#35bf28}+1.80\%$
test_compile_copy_nested[pytree-eager] 0.3941ms 71.3291μs 14.0195 KOps/s 13.8240 KOps/s $\color{#35bf28}+1.41\%$
test_compile_add_one_flat[tensordict-compile] 0.2035ms 0.1598ms 6.2567 KOps/s 6.1100 KOps/s $\color{#35bf28}+2.40\%$
test_compile_add_one_flat[tensordict-eager] 0.3276ms 0.2530ms 3.9533 KOps/s 3.9372 KOps/s $\color{#35bf28}+0.41\%$
test_compile_add_one_flat[tensorclass-compile] 0.1588ms 0.1109ms 9.0142 KOps/s 8.8990 KOps/s $\color{#35bf28}+1.29\%$
test_compile_add_one_flat[tensorclass-eager] 0.1310ms 68.2960μs 14.6421 KOps/s 14.7023 KOps/s $\color{#d91a1a}-0.41\%$
test_compile_add_one_flat[pytree-compile] 0.1980ms 0.1553ms 6.4399 KOps/s 6.3493 KOps/s $\color{#35bf28}+1.43\%$
test_compile_add_one_flat[pytree-eager] 0.7014ms 0.4953ms 2.0189 KOps/s 1.9950 KOps/s $\color{#35bf28}+1.20\%$
test_compile_add_self_flat[tensordict-eager] 0.3642ms 0.3048ms 3.2810 KOps/s 3.2525 KOps/s $\color{#35bf28}+0.87\%$
test_compile_add_self_flat[tensordict-compile] 0.2090ms 0.1618ms 6.1794 KOps/s 6.0717 KOps/s $\color{#35bf28}+1.77\%$
test_compile_add_self_flat[tensorclass-eager] 0.1485ms 82.8794μs 12.0657 KOps/s 12.1038 KOps/s $\color{#d91a1a}-0.31\%$
test_compile_add_self_flat[tensorclass-compile] 0.1531ms 0.1127ms 8.8726 KOps/s 8.7264 KOps/s $\color{#35bf28}+1.68\%$
test_compile_add_self_flat[pytree-eager] 0.5983ms 0.4264ms 2.3454 KOps/s 2.3636 KOps/s $\color{#d91a1a}-0.77\%$
test_compile_add_self_flat[pytree-compile] 0.1961ms 0.1563ms 6.3992 KOps/s 6.4123 KOps/s $\color{#d91a1a}-0.20\%$
test_compile_copy_flat[tensordict-compile] 0.1095ms 23.4626μs 42.6210 KOps/s 43.0358 KOps/s $\color{#d91a1a}-0.96\%$
test_compile_copy_flat[tensordict-eager] 77.7220μs 40.4282μs 24.7352 KOps/s 24.6455 KOps/s $\color{#35bf28}+0.36\%$
test_compile_copy_flat[pytree-compile] 51.0210μs 19.0271μs 52.5565 KOps/s 52.2326 KOps/s $\color{#35bf28}+0.62\%$
test_compile_copy_flat[pytree-eager] 0.3665ms 66.1899μs 15.1080 KOps/s 15.1760 KOps/s $\color{#d91a1a}-0.45\%$
test_compile_assign_and_add[tensordict-compile] 1.9248ms 0.4548ms 2.1988 KOps/s 1.9419 KOps/s $\textbf{\color{#35bf28}+13.23\%}$
test_compile_assign_and_add[tensordict-eager] 3.2042ms 3.1162ms 320.9085 Ops/s 317.6616 Ops/s $\color{#35bf28}+1.02\%$
test_compile_assign_and_add[pytree-compile] 1.9056ms 0.5073ms 1.9713 KOps/s 1.9611 KOps/s $\color{#35bf28}+0.52\%$
test_compile_assign_and_add[pytree-eager] 2.7783ms 2.7081ms 369.2562 Ops/s 367.9068 Ops/s $\color{#35bf28}+0.37\%$
test_compile_indexing[tensor-tensordict-compile] 0.1807ms 0.1280ms 7.8111 KOps/s 7.2981 KOps/s $\textbf{\color{#35bf28}+7.03\%}$
test_compile_indexing[tensor-tensordict-eager] 0.2576ms 91.5107μs 10.9277 KOps/s 10.5236 KOps/s $\color{#35bf28}+3.84\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1834ms 0.1252ms 7.9889 KOps/s 8.2115 KOps/s $\color{#d91a1a}-2.71\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2695ms 81.0051μs 12.3449 KOps/s 12.8339 KOps/s $\color{#d91a1a}-3.81\%$
test_compile_indexing[tensor-pytree-compile] 0.1714ms 0.1229ms 8.1372 KOps/s 7.8595 KOps/s $\color{#35bf28}+3.53\%$
test_compile_indexing[tensor-pytree-eager] 0.2974ms 80.5543μs 12.4140 KOps/s 12.3537 KOps/s $\color{#35bf28}+0.49\%$
test_compile_indexing[slice-tensordict-compile] 0.1714ms 0.1185ms 8.4388 KOps/s 8.6827 KOps/s $\color{#d91a1a}-2.81\%$
test_compile_indexing[slice-tensordict-eager] 0.1844ms 25.1358μs 39.7840 KOps/s 39.7584 KOps/s $\color{#35bf28}+0.06\%$
test_compile_indexing[slice-tensorclass-compile] 0.1415ms 0.1098ms 9.1094 KOps/s 9.0632 KOps/s $\color{#35bf28}+0.51\%$
test_compile_indexing[slice-tensorclass-eager] 0.2132ms 22.2214μs 45.0016 KOps/s 43.8865 KOps/s $\color{#35bf28}+2.54\%$
test_compile_indexing[slice-pytree-compile] 0.1474ms 0.1111ms 9.0019 KOps/s 8.9811 KOps/s $\color{#35bf28}+0.23\%$
test_compile_indexing[slice-pytree-eager] 0.2490ms 22.3912μs 44.6604 KOps/s 43.8726 KOps/s $\color{#35bf28}+1.80\%$
test_compile_indexing[int-tensordict-compile] 0.1587ms 0.1169ms 8.5543 KOps/s 8.5516 KOps/s $\color{#35bf28}+0.03\%$
test_compile_indexing[int-tensordict-eager] 0.2479ms 25.3561μs 39.4383 KOps/s 39.3002 KOps/s $\color{#35bf28}+0.35\%$
test_compile_indexing[int-tensorclass-compile] 0.1599ms 0.1154ms 8.6638 KOps/s 8.6791 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_indexing[int-tensorclass-eager] 0.2292ms 22.5082μs 44.4283 KOps/s 43.7997 KOps/s $\color{#35bf28}+1.44\%$
test_compile_indexing[int-pytree-compile] 0.1753ms 0.1137ms 8.7949 KOps/s 8.7850 KOps/s $\color{#35bf28}+0.11\%$
test_compile_indexing[int-pytree-eager] 0.2329ms 22.3531μs 44.7366 KOps/s 43.9720 KOps/s $\color{#35bf28}+1.74\%$
test_mod_add[eager] 89.5120μs 48.9383μs 20.4339 KOps/s 19.9979 KOps/s $\color{#35bf28}+2.18\%$
test_mod_add[compile] 0.1411ms 96.8751μs 10.3226 KOps/s 10.7744 KOps/s $\color{#d91a1a}-4.19\%$
test_mod_add[compile-overhead] 0.3248ms 0.1795ms 5.5695 KOps/s 5.4369 KOps/s $\color{#35bf28}+2.44\%$
test_mod_wrap[eager] 0.3706ms 0.2991ms 3.3434 KOps/s 3.4822 KOps/s $\color{#d91a1a}-3.98\%$
test_mod_wrap[compile] 0.4063ms 0.3320ms 3.0122 KOps/s 3.0441 KOps/s $\color{#d91a1a}-1.05\%$
test_mod_wrap[compile-overhead] 7.5329ms 4.1374ms 241.6958 Ops/s 253.9339 Ops/s $\color{#d91a1a}-4.82\%$
test_mod_wrap_and_backward[eager] 1.7105ms 1.5903ms 628.8086 Ops/s 628.8948 Ops/s $\color{#d91a1a}-0.01\%$
test_mod_wrap_and_backward[compile] 1.6350ms 1.5537ms 643.6048 Ops/s 639.5971 Ops/s $\color{#35bf28}+0.63\%$
test_mod_wrap_and_backward[compile-overhead] 1.5043ms 1.0464ms 955.6480 Ops/s 910.0359 Ops/s $\textbf{\color{#35bf28}+5.01\%}$
test_seq_add[eager] 0.2228ms 0.1495ms 6.6892 KOps/s 6.7735 KOps/s $\color{#d91a1a}-1.24\%$
test_seq_add[compile] 0.2872ms 0.1042ms 9.5996 KOps/s 9.5867 KOps/s $\color{#35bf28}+0.14\%$
test_seq_add[compile-overhead] 0.1847ms 0.1410ms 7.0911 KOps/s 7.2020 KOps/s $\color{#d91a1a}-1.54\%$
test_seq_wrap[eager] 0.5872ms 0.5121ms 1.9528 KOps/s 1.9817 KOps/s $\color{#d91a1a}-1.46\%$
test_seq_wrap[compile] 0.4302ms 0.3557ms 2.8115 KOps/s 2.8906 KOps/s $\color{#d91a1a}-2.74\%$
test_seq_wrap[compile-overhead] 0.2956ms 0.2480ms 4.0327 KOps/s 4.0556 KOps/s $\color{#d91a1a}-0.56\%$
test_func_call_runtime[False-eager] 1.1296ms 0.8487ms 1.1783 KOps/s 1.1960 KOps/s $\color{#d91a1a}-1.48\%$
test_func_call_runtime[False-compile] 0.9544ms 0.8833ms 1.1321 KOps/s 1.1416 KOps/s $\color{#d91a1a}-0.83\%$
test_func_call_runtime[False-compile-overhead] 0.4524ms 0.4020ms 2.4874 KOps/s 2.4929 KOps/s $\color{#d91a1a}-0.22\%$
test_func_call_runtime[True-eager] 1.2937ms 1.0577ms 945.4065 Ops/s 939.7305 Ops/s $\color{#35bf28}+0.60\%$
test_func_call_runtime[True-compile] 1.0871ms 0.9531ms 1.0492 KOps/s 1.1163 KOps/s $\textbf{\color{#d91a1a}-6.01\%}$
test_func_call_runtime[True-compile-overhead] 0.5364ms 0.4259ms 2.3477 KOps/s 2.3639 KOps/s $\color{#d91a1a}-0.68\%$
test_func_call_cm_runtime[False-eager] 0.9027ms 0.8344ms 1.1985 KOps/s 1.1925 KOps/s $\color{#35bf28}+0.50\%$
test_func_call_cm_runtime[False-compile] 0.9865ms 0.9303ms 1.0749 KOps/s 1.1357 KOps/s $\textbf{\color{#d91a1a}-5.35\%}$
test_func_call_cm_runtime[False-compile-overhead] 0.4548ms 0.4043ms 2.4735 KOps/s 2.4897 KOps/s $\color{#d91a1a}-0.65\%$
test_func_call_cm_runtime[True-eager] 1.3208ms 1.2043ms 830.3717 Ops/s 832.1951 Ops/s $\color{#d91a1a}-0.22\%$
test_func_call_cm_runtime[True-compile] 1.0528ms 0.9409ms 1.0628 KOps/s 1.0745 KOps/s $\color{#d91a1a}-1.09\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5202ms 0.4572ms 2.1870 KOps/s 2.2099 KOps/s $\color{#d91a1a}-1.04\%$
test_vmap_func_call_cm_runtime[eager] 2.7907ms 2.2699ms 440.5504 Ops/s 441.7624 Ops/s $\color{#d91a1a}-0.27\%$
test_vmap_func_call_cm_runtime[compile] 1.0155ms 0.9546ms 1.0476 KOps/s 1.0497 KOps/s $\color{#d91a1a}-0.20\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5276ms 0.4536ms 2.2046 KOps/s 2.1946 KOps/s $\color{#35bf28}+0.46\%$
test_distributed 0.5587ms 0.1494ms 6.6914 KOps/s 6.6894 KOps/s $\color{#35bf28}+0.03\%$
test_tdmodule 0.4055ms 27.4101μs 36.4829 KOps/s 36.8080 KOps/s $\color{#d91a1a}-0.88\%$
test_tdmodule_dispatch 78.8220μs 46.7680μs 21.3821 KOps/s 21.4926 KOps/s $\color{#d91a1a}-0.51\%$
test_tdseq 46.1410μs 25.4893μs 39.2322 KOps/s 38.6621 KOps/s $\color{#35bf28}+1.47\%$
test_tdseq_dispatch 72.4720μs 48.1963μs 20.7485 KOps/s 20.7178 KOps/s $\color{#35bf28}+0.15\%$
test_instantiation_functorch 2.0339ms 1.9671ms 508.3672 Ops/s 509.0152 Ops/s $\color{#d91a1a}-0.13\%$
test_exec_functorch 0.2266ms 0.1778ms 5.6241 KOps/s 5.6492 KOps/s $\color{#d91a1a}-0.44\%$
test_exec_functional_call 0.1988ms 0.1552ms 6.4413 KOps/s 6.3284 KOps/s $\color{#35bf28}+1.78\%$
test_exec_td_decorator 0.4343ms 0.2260ms 4.4243 KOps/s 4.4042 KOps/s $\color{#35bf28}+0.46\%$
test_vmap_mlp_speed_decorator[True-True] 0.9127ms 0.7580ms 1.3193 KOps/s 1.3305 KOps/s $\color{#d91a1a}-0.84\%$
test_vmap_mlp_speed_decorator[True-False] 0.9020ms 0.7585ms 1.3184 KOps/s 1.3199 KOps/s $\color{#d91a1a}-0.11\%$
test_vmap_mlp_speed_decorator[False-True] 0.7896ms 0.6480ms 1.5431 KOps/s 1.5539 KOps/s $\color{#d91a1a}-0.69\%$
test_vmap_mlp_speed_decorator[False-False] 0.8078ms 0.6492ms 1.5404 KOps/s 1.5482 KOps/s $\color{#d91a1a}-0.50\%$
test_vmap_transformer_speed_decorator[True-True] 20.2576ms 20.0748ms 49.8136 Ops/s 50.0376 Ops/s $\color{#d91a1a}-0.45\%$
test_vmap_transformer_speed_decorator[True-False] 20.2162ms 20.1184ms 49.7057 Ops/s 50.0088 Ops/s $\color{#d91a1a}-0.61\%$
test_vmap_transformer_speed_decorator[False-True] 20.0277ms 19.9013ms 50.2481 Ops/s 50.5813 Ops/s $\color{#d91a1a}-0.66\%$
test_vmap_transformer_speed_decorator[False-False] 20.0628ms 19.9286ms 50.1791 Ops/s 50.5180 Ops/s $\color{#d91a1a}-0.67\%$
test_to_module_speed[True] 1.4903ms 1.3983ms 715.1680 Ops/s 712.4137 Ops/s $\color{#35bf28}+0.39\%$
test_to_module_speed[False] 1.4811ms 1.3848ms 722.1317 Ops/s 725.4212 Ops/s $\color{#d91a1a}-0.45\%$
test_tc_init 83.9320μs 50.3855μs 19.8470 KOps/s 20.0179 KOps/s $\color{#d91a1a}-0.85\%$
test_tc_init_tensor_only 94.9120μs 14.3269μs 69.7986 KOps/s 69.1816 KOps/s $\color{#35bf28}+0.89\%$
test_tc_init_nested 0.1354ms 99.3928μs 10.0611 KOps/s 10.0726 KOps/s $\color{#d91a1a}-0.11\%$
test_tc_first_layer_tensor 28.1210μs 1.6624μs 601.5370 KOps/s 590.3384 KOps/s $\color{#35bf28}+1.90\%$
test_tc_first_layer_tensor_only 3.8500μs 0.6605μs 1.5139 MOps/s 1.5263 MOps/s $\color{#d91a1a}-0.81\%$
test_tc_first_layer_tensor_set 33.5400μs 4.0115μs 249.2830 KOps/s 249.3624 KOps/s $\color{#d91a1a}-0.03\%$
test_tc_first_layer_tensor_only_set 18.0055μs 2.8950μs 345.4179 KOps/s 339.8841 KOps/s $\color{#35bf28}+1.63\%$
test_tc_first_layer_nontensor 34.5910μs 5.6165μs 178.0478 KOps/s 176.7605 KOps/s $\color{#35bf28}+0.73\%$
test_tc_second_layer_tensor 34.0410μs 4.0711μs 245.6317 KOps/s 244.4600 KOps/s $\color{#35bf28}+0.48\%$
test_tc_second_layer_nontensor 39.4210μs 7.9514μs 125.7639 KOps/s 124.8201 KOps/s $\color{#35bf28}+0.76\%$
test_unbind 0.2336s 13.1926ms 75.8003 Ops/s 107.1149 Ops/s $\textbf{\color{#d91a1a}-29.23\%}$
test_full_like 4.5673ms 4.4153ms 226.4839 Ops/s 229.7766 Ops/s $\color{#d91a1a}-1.43\%$
test_zeros_like 4.5417ms 4.3805ms 228.2859 Ops/s 228.0004 Ops/s $\color{#35bf28}+0.13\%$
test_ones_like 4.5572ms 4.3958ms 227.4919 Ops/s 227.3513 Ops/s $\color{#35bf28}+0.06\%$
test_clone 7.2366ms 6.5875ms 151.8036 Ops/s 151.3825 Ops/s $\color{#35bf28}+0.28\%$
test_squeeze 89.3420μs 13.9196μs 71.8414 KOps/s 72.3222 KOps/s $\color{#d91a1a}-0.66\%$
test_unsqueeze 0.1500ms 0.1035ms 9.6618 KOps/s 9.8182 KOps/s $\color{#d91a1a}-1.59\%$
test_split 0.2305ms 0.1799ms 5.5598 KOps/s 5.7154 KOps/s $\color{#d91a1a}-2.72\%$
test_permute 0.2353ms 0.2000ms 4.9996 KOps/s 5.0488 KOps/s $\color{#d91a1a}-0.97\%$
test_stack 51.5902ms 50.9554ms 19.6250 Ops/s 19.6606 Ops/s $\color{#d91a1a}-0.18\%$
test_cat 51.6551ms 50.9222ms 19.6378 Ops/s 19.6517 Ops/s $\color{#d91a1a}-0.07\%$

@github-actions
Copy link

github-actions bot commented Sep 8, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 233. Improved: $\large\color{#35bf28}25$. Worsened: $\large\color{#d91a1a}3$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 36.6910μs 15.1642μs 65.9449 KOps/s 66.7782 KOps/s $\color{#d91a1a}-1.25\%$
test_plain_set_stack_nested 38.4010μs 15.4435μs 64.7520 KOps/s 66.2603 KOps/s $\color{#d91a1a}-2.28\%$
test_plain_set_nested_inplace 45.4910μs 16.8044μs 59.5082 KOps/s 60.4131 KOps/s $\color{#d91a1a}-1.50\%$
test_plain_set_stack_nested_inplace 44.6310μs 16.7105μs 59.8426 KOps/s 60.5106 KOps/s $\color{#d91a1a}-1.10\%$
test_items 29.2000μs 6.1171μs 163.4762 KOps/s 166.7135 KOps/s $\color{#d91a1a}-1.94\%$
test_items_nested 0.6160ms 0.5472ms 1.8274 KOps/s 1.8726 KOps/s $\color{#d91a1a}-2.42\%$
test_items_nested_locked 0.6175ms 0.5536ms 1.8062 KOps/s 1.8758 KOps/s $\color{#d91a1a}-3.71\%$
test_items_nested_leaf 0.1532ms 95.7135μs 10.4478 KOps/s 10.3769 KOps/s $\color{#35bf28}+0.68\%$
test_items_stack_nested 0.6787ms 0.5503ms 1.8172 KOps/s 1.8586 KOps/s $\color{#d91a1a}-2.22\%$
test_items_stack_nested_leaf 0.1507ms 97.2536μs 10.2824 KOps/s 10.3625 KOps/s $\color{#d91a1a}-0.77\%$
test_items_stack_nested_locked 0.6442ms 0.5446ms 1.8361 KOps/s 1.8668 KOps/s $\color{#d91a1a}-1.65\%$
test_keys 34.5510μs 4.1985μs 238.1795 KOps/s 235.1449 KOps/s $\color{#35bf28}+1.29\%$
test_keys_nested 0.1902ms 0.1211ms 8.2601 KOps/s 8.3538 KOps/s $\color{#d91a1a}-1.12\%$
test_keys_nested_locked 2.1546ms 0.1302ms 7.6788 KOps/s 7.7667 KOps/s $\color{#d91a1a}-1.13\%$
test_keys_nested_leaf 0.1771ms 0.1125ms 8.8912 KOps/s 9.0988 KOps/s $\color{#d91a1a}-2.28\%$
test_keys_stack_nested 0.1980ms 0.1212ms 8.2489 KOps/s 8.3219 KOps/s $\color{#d91a1a}-0.88\%$
test_keys_stack_nested_leaf 0.1840ms 0.1130ms 8.8502 KOps/s 9.1293 KOps/s $\color{#d91a1a}-3.06\%$
test_keys_stack_nested_locked 0.1818ms 0.1300ms 7.6901 KOps/s 7.8671 KOps/s $\color{#d91a1a}-2.25\%$
test_values 5.3520μs 1.0190μs 981.3182 KOps/s 938.1585 KOps/s $\color{#35bf28}+4.60\%$
test_values_nested 76.1710μs 48.6177μs 20.5686 KOps/s 20.6476 KOps/s $\color{#d91a1a}-0.38\%$
test_values_nested_locked 0.1090ms 51.4654μs 19.4305 KOps/s 19.5484 KOps/s $\color{#d91a1a}-0.60\%$
test_values_nested_leaf 76.8720μs 55.7684μs 17.9313 KOps/s 18.4588 KOps/s $\color{#d91a1a}-2.86\%$
test_values_stack_nested 86.4220μs 48.6056μs 20.5738 KOps/s 20.9867 KOps/s $\color{#d91a1a}-1.97\%$
test_values_stack_nested_leaf 77.5310μs 55.4287μs 18.0412 KOps/s 18.5664 KOps/s $\color{#d91a1a}-2.83\%$
test_values_stack_nested_locked 83.8310μs 51.8097μs 19.3014 KOps/s 19.6973 KOps/s $\color{#d91a1a}-2.01\%$
test_membership 4.5652μs 0.8438μs 1.1850 MOps/s 1.1627 MOps/s $\color{#35bf28}+1.92\%$
test_membership_nested 49.3610μs 3.1340μs 319.0859 KOps/s 323.1977 KOps/s $\color{#d91a1a}-1.27\%$
test_membership_nested_leaf 26.4100μs 3.1372μs 318.7557 KOps/s 322.2289 KOps/s $\color{#d91a1a}-1.08\%$
test_membership_stacked_nested 33.5610μs 3.1446μs 318.0061 KOps/s 318.2355 KOps/s $\color{#d91a1a}-0.07\%$
test_membership_stacked_nested_leaf 38.9710μs 3.1500μs 317.4626 KOps/s 319.1481 KOps/s $\color{#d91a1a}-0.53\%$
test_membership_nested_last 29.8700μs 4.6072μs 217.0515 KOps/s 221.3848 KOps/s $\color{#d91a1a}-1.96\%$
test_membership_nested_leaf_last 36.0500μs 4.5567μs 219.4571 KOps/s 219.4107 KOps/s $\color{#35bf28}+0.02\%$
test_membership_stacked_nested_last 31.6700μs 4.5705μs 218.7924 KOps/s 220.0106 KOps/s $\color{#d91a1a}-0.55\%$
test_membership_stacked_nested_leaf_last 40.7300μs 4.5929μs 217.7286 KOps/s 222.1470 KOps/s $\color{#d91a1a}-1.99\%$
test_nested_getleaf 50.1110μs 21.6044μs 46.2868 KOps/s 46.0135 KOps/s $\color{#35bf28}+0.59\%$
test_nested_get 70.6010μs 20.6471μs 48.4329 KOps/s 49.8245 KOps/s $\color{#d91a1a}-2.79\%$
test_stacked_getleaf 53.3110μs 21.5206μs 46.4671 KOps/s 46.5962 KOps/s $\color{#d91a1a}-0.28\%$
test_stacked_get 45.4710μs 20.4845μs 48.8173 KOps/s 47.9813 KOps/s $\color{#35bf28}+1.74\%$
test_nested_getitemleaf 81.3920μs 22.1711μs 45.1038 KOps/s 45.9617 KOps/s $\color{#d91a1a}-1.87\%$
test_nested_getitem 49.3210μs 21.0529μs 47.4994 KOps/s 47.8894 KOps/s $\color{#d91a1a}-0.81\%$
test_stacked_getitemleaf 45.9210μs 21.9790μs 45.4979 KOps/s 45.5896 KOps/s $\color{#d91a1a}-0.20\%$
test_stacked_getitem 56.9710μs 21.1918μs 47.1881 KOps/s 48.2421 KOps/s $\color{#d91a1a}-2.18\%$
test_lock_nested 0.5676ms 0.4722ms 2.1176 KOps/s 2.1059 KOps/s $\color{#35bf28}+0.55\%$
test_lock_stack_nested 0.5467ms 0.4721ms 2.1182 KOps/s 2.0990 KOps/s $\color{#35bf28}+0.91\%$
test_unlock_nested 0.4477ms 0.3828ms 2.6125 KOps/s 2.6113 KOps/s $\color{#35bf28}+0.04\%$
test_unlock_stack_nested 0.4479ms 0.3795ms 2.6353 KOps/s 2.6026 KOps/s $\color{#35bf28}+1.26\%$
test_flatten_speed 0.1676ms 0.1210ms 8.2661 KOps/s 8.1884 KOps/s $\color{#35bf28}+0.95\%$
test_unflatten_speed 0.6520ms 0.5963ms 1.6771 KOps/s 1.6976 KOps/s $\color{#d91a1a}-1.21\%$
test_common_ops 0.9577ms 0.7541ms 1.3260 KOps/s 1.3412 KOps/s $\color{#d91a1a}-1.13\%$
test_creation 50.2510μs 2.7440μs 364.4269 KOps/s 363.6631 KOps/s $\color{#35bf28}+0.21\%$
test_creation_empty 48.2900μs 8.9183μs 112.1286 KOps/s 111.4981 KOps/s $\color{#35bf28}+0.57\%$
test_creation_nested_1 45.3710μs 12.1417μs 82.3609 KOps/s 83.0591 KOps/s $\color{#d91a1a}-0.84\%$
test_creation_nested_2 41.8500μs 15.9086μs 62.8592 KOps/s 62.9599 KOps/s $\color{#d91a1a}-0.16\%$
test_clone 45.9110μs 12.9926μs 76.9668 KOps/s 76.3221 KOps/s $\color{#35bf28}+0.84\%$
test_getitem[int] 1.1523ms 14.1793μs 70.5254 KOps/s 68.5468 KOps/s $\color{#35bf28}+2.89\%$
test_getitem[slice_int] 0.1545ms 28.2159μs 35.4410 KOps/s 33.6743 KOps/s $\textbf{\color{#35bf28}+5.25\%}$
test_getitem[range] 0.1643ms 51.6009μs 19.3795 KOps/s 18.7200 KOps/s $\color{#35bf28}+3.52\%$
test_getitem[tuple] 0.1445ms 24.1392μs 41.4264 KOps/s 39.7543 KOps/s $\color{#35bf28}+4.21\%$
test_getitem[list] 0.1711ms 44.2219μs 22.6132 KOps/s 21.9203 KOps/s $\color{#35bf28}+3.16\%$
test_setitem_dim[int] 46.2200μs 24.4786μs 40.8521 KOps/s 40.5474 KOps/s $\color{#35bf28}+0.75\%$
test_setitem_dim[slice_int] 72.0910μs 49.1798μs 20.3336 KOps/s 19.9882 KOps/s $\color{#35bf28}+1.73\%$
test_setitem_dim[range] 92.9010μs 69.1202μs 14.4675 KOps/s 13.9891 KOps/s $\color{#35bf28}+3.42\%$
test_setitem_dim[tuple] 65.5710μs 42.2930μs 23.6446 KOps/s 23.5812 KOps/s $\color{#35bf28}+0.27\%$
test_setitem 53.1510μs 17.9580μs 55.6855 KOps/s 56.8226 KOps/s $\color{#d91a1a}-2.00\%$
test_set 45.8810μs 16.9262μs 59.0799 KOps/s 59.0098 KOps/s $\color{#35bf28}+0.12\%$
test_set_shared 0.7133ms 0.2019ms 4.9537 KOps/s 4.9993 KOps/s $\color{#d91a1a}-0.91\%$
test_update 0.4183ms 21.4368μs 46.6488 KOps/s 45.8098 KOps/s $\color{#35bf28}+1.83\%$
test_update_nested 83.5110μs 34.4335μs 29.0415 KOps/s 28.2421 KOps/s $\color{#35bf28}+2.83\%$
test_update__nested 0.4785ms 34.5425μs 28.9499 KOps/s 30.0784 KOps/s $\color{#d91a1a}-3.75\%$
test_set_nested 60.3210μs 18.8501μs 53.0501 KOps/s 52.9091 KOps/s $\color{#35bf28}+0.27\%$
test_set_nested_new 62.4510μs 23.8284μs 41.9666 KOps/s 42.3282 KOps/s $\color{#d91a1a}-0.85\%$
test_select 78.0820μs 40.9398μs 24.4261 KOps/s 24.0730 KOps/s $\color{#35bf28}+1.47\%$
test_select_nested 0.1295ms 74.2580μs 13.4666 KOps/s 13.3901 KOps/s $\color{#35bf28}+0.57\%$
test_exclude_nested 0.1315ms 99.6445μs 10.0357 KOps/s 10.1809 KOps/s $\color{#d91a1a}-1.43\%$
test_empty[True] 0.4932ms 0.4376ms 2.2854 KOps/s 2.3339 KOps/s $\color{#d91a1a}-2.08\%$
test_empty[False] 24.2577μs 1.3479μs 741.9110 KOps/s 748.4252 KOps/s $\color{#d91a1a}-0.87\%$
test_to 95.8710μs 68.0885μs 14.6868 KOps/s 14.3747 KOps/s $\color{#35bf28}+2.17\%$
test_to_nonblocking 0.1118ms 59.8074μs 16.7203 KOps/s 16.0636 KOps/s $\color{#35bf28}+4.09\%$
test_unbind_speed 0.3525ms 0.3252ms 3.0754 KOps/s 3.0664 KOps/s $\color{#35bf28}+0.29\%$
test_unbind_speed_stack0 0.3784ms 0.3224ms 3.1016 KOps/s 3.0822 KOps/s $\color{#35bf28}+0.63\%$
test_unbind_speed_stack1 97.6098ms 0.8672ms 1.1531 KOps/s 1.1363 KOps/s $\color{#35bf28}+1.49\%$
test_split 97.5940ms 1.2876ms 776.6108 Ops/s 734.0956 Ops/s $\textbf{\color{#35bf28}+5.79\%}$
test_chunk 97.2858ms 1.2470ms 801.9034 Ops/s 892.5312 Ops/s $\textbf{\color{#d91a1a}-10.15\%}$
test_consolidate[False-None] 4.2069ms 3.9133ms 255.5369 Ops/s 232.8007 Ops/s $\textbf{\color{#35bf28}+9.77\%}$
test_consolidate[default-None] 2.4011ms 2.1937ms 455.8609 Ops/s 435.0066 Ops/s $\color{#35bf28}+4.79\%$
test_consolidate[reduce-overhead-None] 2.2978ms 2.1909ms 456.4372 Ops/s 433.6089 Ops/s $\textbf{\color{#35bf28}+5.26\%}$
test_consolidate_njt[False-None] 9.0758ms 8.8877ms 112.5153 Ops/s 117.3580 Ops/s $\color{#d91a1a}-4.13\%$
test_to[False-False-None] 2.1581ms 2.0673ms 483.7147 Ops/s 492.3819 Ops/s $\color{#d91a1a}-1.76\%$
test_to[True-False-None] 2.1792ms 1.9340ms 517.0727 Ops/s 523.3989 Ops/s $\color{#d91a1a}-1.21\%$
test_to[within-False-None] 6.2966ms 5.9927ms 166.8693 Ops/s 172.1882 Ops/s $\color{#d91a1a}-3.09\%$
test_to[True-default-None] 7.1110ms 6.9865ms 143.1339 Ops/s 142.3756 Ops/s $\color{#35bf28}+0.53\%$
test_to_njt[False-False-None] 8.6764ms 8.5197ms 117.3746 Ops/s 118.0319 Ops/s $\color{#d91a1a}-0.56\%$
test_to_njt[True-False-None] 7.4546ms 7.3289ms 136.4458 Ops/s 139.1505 Ops/s $\color{#d91a1a}-1.94\%$
test_to_njt[within-False-None] 16.3533ms 16.2569ms 61.5122 Ops/s 43.7586 Ops/s $\textbf{\color{#35bf28}+40.57\%}$
test_creation[device0] 0.4072ms 0.1089ms 9.1796 KOps/s 9.1472 KOps/s $\color{#35bf28}+0.35\%$
test_creation_from_tensor 0.4067ms 0.1115ms 8.9662 KOps/s 9.0893 KOps/s $\color{#d91a1a}-1.35\%$
test_add_one[memmap_tensor0] 0.3377ms 6.7550μs 148.0395 KOps/s 153.3494 KOps/s $\color{#d91a1a}-3.46\%$
test_contiguous[memmap_tensor0] 28.1710μs 0.8034μs 1.2447 MOps/s 1.7190 MOps/s $\textbf{\color{#d91a1a}-27.60\%}$
test_stack[memmap_tensor0] 35.9500μs 4.9343μs 202.6616 KOps/s 210.9611 KOps/s $\color{#d91a1a}-3.93\%$
test_memmaptd_index 1.1055ms 0.2922ms 3.4223 KOps/s 3.4701 KOps/s $\color{#d91a1a}-1.38\%$
test_memmaptd_index_astensor 0.5408ms 0.3862ms 2.5893 KOps/s 2.6269 KOps/s $\color{#d91a1a}-1.43\%$
test_memmaptd_index_op 0.7851ms 0.6229ms 1.6055 KOps/s 1.6230 KOps/s $\color{#d91a1a}-1.08\%$
test_serialize_model 0.1309s 0.1300s 7.6911 Ops/s 7.6545 Ops/s $\color{#35bf28}+0.48\%$
test_serialize_model_pickle 1.3659s 1.2141s 0.8237 Ops/s 0.8443 Ops/s $\color{#d91a1a}-2.44\%$
test_serialize_weights 0.1295s 0.1289s 7.7576 Ops/s 7.7318 Ops/s $\color{#35bf28}+0.33\%$
test_serialize_weights_returnearly 0.2405s 58.4036ms 17.1222 Ops/s 15.3998 Ops/s $\textbf{\color{#35bf28}+11.18\%}$
test_serialize_weights_pickle 1.3749s 1.2149s 0.8231 Ops/s 0.8241 Ops/s $\color{#d91a1a}-0.12\%$
test_reshape_pytree 0.3602ms 34.1503μs 29.2823 KOps/s 29.2168 KOps/s $\color{#35bf28}+0.22\%$
test_reshape_td 96.7220μs 40.2791μs 24.8268 KOps/s 24.6665 KOps/s $\color{#35bf28}+0.65\%$
test_view_pytree 0.2304ms 33.4194μs 29.9228 KOps/s 30.0193 KOps/s $\color{#d91a1a}-0.32\%$
test_view_td 78.3320μs 48.0461μs 20.8133 KOps/s 19.6690 KOps/s $\textbf{\color{#35bf28}+5.82\%}$
test_unbind_pytree 0.2400ms 37.6723μs 26.5447 KOps/s 26.2931 KOps/s $\color{#35bf28}+0.96\%$
test_unbind_td 0.1630ms 50.5986μs 19.7634 KOps/s 19.4037 KOps/s $\color{#35bf28}+1.85\%$
test_split_pytree 0.2527ms 44.5788μs 22.4322 KOps/s 21.3864 KOps/s $\color{#35bf28}+4.89\%$
test_split_td 0.1958ms 70.0770μs 14.2700 KOps/s 14.5321 KOps/s $\color{#d91a1a}-1.80\%$
test_add_pytree 0.1979ms 45.6051μs 21.9274 KOps/s 21.2619 KOps/s $\color{#35bf28}+3.13\%$
test_add_td 0.1057ms 60.1159μs 16.6345 KOps/s 16.9722 KOps/s $\color{#d91a1a}-1.99\%$
test_compile_add_one_nested[tensordict-compile] 0.2016ms 0.1388ms 7.2044 KOps/s 6.7100 KOps/s $\textbf{\color{#35bf28}+7.37\%}$
test_compile_add_one_nested[tensordict-eager] 0.4068ms 0.1895ms 5.2767 KOps/s 5.2863 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_add_one_nested[pytree-compile] 0.1478ms 0.1081ms 9.2499 KOps/s 8.6878 KOps/s $\textbf{\color{#35bf28}+6.47\%}$
test_compile_add_one_nested[pytree-eager] 0.3635ms 0.1800ms 5.5566 KOps/s 5.5301 KOps/s $\color{#35bf28}+0.48\%$
test_compile_copy_nested[tensordict-compile] 62.6810μs 29.8278μs 33.5258 KOps/s 30.5630 KOps/s $\textbf{\color{#35bf28}+9.69\%}$
test_compile_copy_nested[tensordict-eager] 96.0620μs 50.9171μs 19.6398 KOps/s 19.7243 KOps/s $\color{#d91a1a}-0.43\%$
test_compile_copy_nested[pytree-compile] 57.2410μs 13.6571μs 73.2222 KOps/s 68.4856 KOps/s $\textbf{\color{#35bf28}+6.92\%}$
test_compile_copy_nested[pytree-eager] 0.4128ms 75.6532μs 13.2182 KOps/s 13.1871 KOps/s $\color{#35bf28}+0.24\%$
test_compile_add_one_flat[tensordict-compile] 0.2480ms 0.1635ms 6.1177 KOps/s 5.5636 KOps/s $\textbf{\color{#35bf28}+9.96\%}$
test_compile_add_one_flat[tensordict-eager] 0.3246ms 0.2586ms 3.8671 KOps/s 3.8600 KOps/s $\color{#35bf28}+0.18\%$
test_compile_add_one_flat[tensorclass-compile] 0.1520ms 0.1135ms 8.8131 KOps/s 8.4387 KOps/s $\color{#35bf28}+4.44\%$
test_compile_add_one_flat[tensorclass-eager] 0.1344ms 68.7421μs 14.5471 KOps/s 14.5293 KOps/s $\color{#35bf28}+0.12\%$
test_compile_add_one_flat[pytree-compile] 0.2005ms 0.1578ms 6.3370 KOps/s 6.0989 KOps/s $\color{#35bf28}+3.90\%$
test_compile_add_one_flat[pytree-eager] 0.7452ms 0.5139ms 1.9458 KOps/s 1.9194 KOps/s $\color{#35bf28}+1.38\%$
test_compile_add_self_flat[tensordict-eager] 0.4480ms 0.3126ms 3.1993 KOps/s 3.1988 KOps/s $\color{#35bf28}+0.02\%$
test_compile_add_self_flat[tensordict-compile] 0.2034ms 0.1659ms 6.0285 KOps/s 5.8062 KOps/s $\color{#35bf28}+3.83\%$
test_compile_add_self_flat[tensorclass-eager] 0.1549ms 84.0771μs 11.8939 KOps/s 11.8469 KOps/s $\color{#35bf28}+0.40\%$
test_compile_add_self_flat[tensorclass-compile] 0.1763ms 0.1152ms 8.6811 KOps/s 8.4001 KOps/s $\color{#35bf28}+3.34\%$
test_compile_add_self_flat[pytree-eager] 0.6157ms 0.4334ms 2.3076 KOps/s 2.2774 KOps/s $\color{#35bf28}+1.32\%$
test_compile_add_self_flat[pytree-compile] 0.2080ms 0.1595ms 6.2694 KOps/s 6.0626 KOps/s $\color{#35bf28}+3.41\%$
test_compile_copy_flat[tensordict-compile] 58.0310μs 22.9560μs 43.5616 KOps/s 41.3333 KOps/s $\textbf{\color{#35bf28}+5.39\%}$
test_compile_copy_flat[tensordict-eager] 72.2310μs 41.8971μs 23.8680 KOps/s 24.3807 KOps/s $\color{#d91a1a}-2.10\%$
test_compile_copy_flat[pytree-compile] 49.2700μs 19.5126μs 51.2488 KOps/s 49.4956 KOps/s $\color{#35bf28}+3.54\%$
test_compile_copy_flat[pytree-eager] 0.3685ms 68.7026μs 14.5555 KOps/s 14.3757 KOps/s $\color{#35bf28}+1.25\%$
test_compile_assign_and_add[tensordict-compile] 2.0053ms 0.5316ms 1.8811 KOps/s 1.8489 KOps/s $\color{#35bf28}+1.74\%$
test_compile_assign_and_add[tensordict-eager] 3.2518ms 3.1508ms 317.3835 Ops/s 310.1776 Ops/s $\color{#35bf28}+2.32\%$
test_compile_assign_and_add[pytree-compile] 1.9535ms 0.5163ms 1.9368 KOps/s 1.8461 KOps/s $\color{#35bf28}+4.91\%$
test_compile_assign_and_add[pytree-eager] 2.9277ms 2.7878ms 358.7014 Ops/s 358.1482 Ops/s $\color{#35bf28}+0.15\%$
test_compile_indexing[tensor-tensordict-compile] 0.2016ms 0.1305ms 7.6611 KOps/s 7.3975 KOps/s $\color{#35bf28}+3.56\%$
test_compile_indexing[tensor-tensordict-eager] 0.2667ms 94.4012μs 10.5931 KOps/s 10.6563 KOps/s $\color{#d91a1a}-0.59\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1704ms 0.1224ms 8.1703 KOps/s 7.6835 KOps/s $\textbf{\color{#35bf28}+6.34\%}$
test_compile_indexing[tensor-tensorclass-eager] 0.2709ms 80.1452μs 12.4774 KOps/s 11.9148 KOps/s $\color{#35bf28}+4.72\%$
test_compile_indexing[tensor-pytree-compile] 0.1859ms 0.1241ms 8.0580 KOps/s 7.6572 KOps/s $\textbf{\color{#35bf28}+5.23\%}$
test_compile_indexing[tensor-pytree-eager] 0.3192ms 80.2622μs 12.4592 KOps/s 11.7417 KOps/s $\textbf{\color{#35bf28}+6.11\%}$
test_compile_indexing[slice-tensordict-compile] 0.1906ms 0.1185ms 8.4391 KOps/s 8.3751 KOps/s $\color{#35bf28}+0.76\%$
test_compile_indexing[slice-tensordict-eager] 0.1931ms 26.0248μs 38.4248 KOps/s 37.5720 KOps/s $\color{#35bf28}+2.27\%$
test_compile_indexing[slice-tensorclass-compile] 0.1520ms 0.1126ms 8.8814 KOps/s 8.6710 KOps/s $\color{#35bf28}+2.43\%$
test_compile_indexing[slice-tensorclass-eager] 0.2093ms 23.3580μs 42.8119 KOps/s 42.8388 KOps/s $\color{#d91a1a}-0.06\%$
test_compile_indexing[slice-pytree-compile] 0.1729ms 0.1133ms 8.8285 KOps/s 8.6373 KOps/s $\color{#35bf28}+2.21\%$
test_compile_indexing[slice-pytree-eager] 0.2665ms 23.3870μs 42.7587 KOps/s 43.0167 KOps/s $\color{#d91a1a}-0.60\%$
test_compile_indexing[int-tensordict-compile] 0.1674ms 0.1195ms 8.3655 KOps/s 8.3370 KOps/s $\color{#35bf28}+0.34\%$
test_compile_indexing[int-tensordict-eager] 0.1993ms 27.2935μs 36.6388 KOps/s 37.4512 KOps/s $\color{#d91a1a}-2.17\%$
test_compile_indexing[int-tensorclass-compile] 0.1757ms 0.1128ms 8.8649 KOps/s 8.6145 KOps/s $\color{#35bf28}+2.91\%$
test_compile_indexing[int-tensorclass-eager] 0.2279ms 23.2889μs 42.9390 KOps/s 42.8180 KOps/s $\color{#35bf28}+0.28\%$
test_compile_indexing[int-pytree-compile] 0.1531ms 0.1143ms 8.7505 KOps/s 8.4340 KOps/s $\color{#35bf28}+3.75\%$
test_compile_indexing[int-pytree-eager] 0.2543ms 24.0473μs 41.5847 KOps/s 43.0981 KOps/s $\color{#d91a1a}-3.51\%$
test_mod_add[eager] 0.1055ms 48.4693μs 20.6316 KOps/s 19.7134 KOps/s $\color{#35bf28}+4.66\%$
test_mod_add[compile] 0.1496ms 94.3123μs 10.6031 KOps/s 9.5255 KOps/s $\textbf{\color{#35bf28}+11.31\%}$
test_mod_add[compile-overhead] 0.3622ms 0.1891ms 5.2887 KOps/s 5.2180 KOps/s $\color{#35bf28}+1.36\%$
test_mod_wrap[eager] 0.3712ms 0.2932ms 3.4102 KOps/s 3.1755 KOps/s $\textbf{\color{#35bf28}+7.39\%}$
test_mod_wrap[compile] 0.4059ms 0.3372ms 2.9656 KOps/s 2.9268 KOps/s $\color{#35bf28}+1.33\%$
test_mod_wrap[compile-overhead] 7.5744ms 4.1610ms 240.3240 Ops/s 244.4686 Ops/s $\color{#d91a1a}-1.70\%$
test_mod_wrap_and_backward[eager] 1.6320ms 1.5240ms 656.1763 Ops/s 659.4620 Ops/s $\color{#d91a1a}-0.50\%$
test_mod_wrap_and_backward[compile] 1.6688ms 1.4971ms 667.9700 Ops/s 662.2595 Ops/s $\color{#35bf28}+0.86\%$
test_mod_wrap_and_backward[compile-overhead] 1.5036ms 1.0128ms 987.3267 Ops/s 919.7811 Ops/s $\textbf{\color{#35bf28}+7.34\%}$
test_seq_add[eager] 0.2080ms 0.1526ms 6.5524 KOps/s 6.2536 KOps/s $\color{#35bf28}+4.78\%$
test_seq_add[compile] 0.1476ms 0.1059ms 9.4423 KOps/s 8.8988 KOps/s $\textbf{\color{#35bf28}+6.11\%}$
test_seq_add[compile-overhead] 0.2048ms 0.1426ms 7.0129 KOps/s 6.7802 KOps/s $\color{#35bf28}+3.43\%$
test_seq_wrap[eager] 0.6072ms 0.5204ms 1.9217 KOps/s 1.8346 KOps/s $\color{#35bf28}+4.75\%$
test_seq_wrap[compile] 0.4256ms 0.3559ms 2.8097 KOps/s 2.6810 KOps/s $\color{#35bf28}+4.80\%$
test_seq_wrap[compile-overhead] 0.3261ms 0.2538ms 3.9400 KOps/s 3.9531 KOps/s $\color{#d91a1a}-0.33\%$
test_func_call_runtime[False-eager] 1.0044ms 0.8448ms 1.1837 KOps/s 1.1222 KOps/s $\textbf{\color{#35bf28}+5.47\%}$
test_func_call_runtime[False-compile] 0.9566ms 0.9040ms 1.1062 KOps/s 1.0679 KOps/s $\color{#35bf28}+3.59\%$
test_func_call_runtime[False-compile-overhead] 0.5193ms 0.4176ms 2.3949 KOps/s 2.3856 KOps/s $\color{#35bf28}+0.39\%$
test_func_call_runtime[True-eager] 1.1618ms 1.0813ms 924.7865 Ops/s 908.8353 Ops/s $\color{#35bf28}+1.76\%$
test_func_call_runtime[True-compile] 1.0480ms 0.9257ms 1.0803 KOps/s 1.0196 KOps/s $\textbf{\color{#35bf28}+5.95\%}$
test_func_call_runtime[True-compile-overhead] 0.4943ms 0.4374ms 2.2860 KOps/s 2.2618 KOps/s $\color{#35bf28}+1.07\%$
test_func_call_cm_runtime[False-eager] 0.9887ms 0.8457ms 1.1824 KOps/s 1.1673 KOps/s $\color{#35bf28}+1.30\%$
test_func_call_cm_runtime[False-compile] 0.9762ms 0.9117ms 1.0968 KOps/s 1.0461 KOps/s $\color{#35bf28}+4.84\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5034ms 0.4190ms 2.3868 KOps/s 2.3849 KOps/s $\color{#35bf28}+0.08\%$
test_func_call_cm_runtime[True-eager] 1.3540ms 1.2326ms 811.2711 Ops/s 807.7839 Ops/s $\color{#35bf28}+0.43\%$
test_func_call_cm_runtime[True-compile] 1.0975ms 0.9594ms 1.0423 KOps/s 1.0210 KOps/s $\color{#35bf28}+2.09\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5312ms 0.4698ms 2.1284 KOps/s 2.1221 KOps/s $\color{#35bf28}+0.30\%$
test_vmap_func_call_cm_runtime[eager] 2.8142ms 2.3121ms 432.5121 Ops/s 429.4254 Ops/s $\color{#35bf28}+0.72\%$
test_vmap_func_call_cm_runtime[compile] 1.0486ms 0.9774ms 1.0231 KOps/s 995.2215 Ops/s $\color{#35bf28}+2.80\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5392ms 0.4678ms 2.1378 KOps/s 2.1102 KOps/s $\color{#35bf28}+1.31\%$
test_distributed 2.5872ms 0.1614ms 6.1963 KOps/s 6.5357 KOps/s $\textbf{\color{#d91a1a}-5.19\%}$
test_tdmodule 0.6884ms 28.1632μs 35.5073 KOps/s 36.5989 KOps/s $\color{#d91a1a}-2.98\%$
test_tdmodule_dispatch 72.2510μs 48.3008μs 20.7036 KOps/s 20.9620 KOps/s $\color{#d91a1a}-1.23\%$
test_tdseq 47.4010μs 26.2628μs 38.0766 KOps/s 37.3130 KOps/s $\color{#35bf28}+2.05\%$
test_tdseq_dispatch 71.3810μs 49.5396μs 20.1859 KOps/s 20.0388 KOps/s $\color{#35bf28}+0.73\%$
test_instantiation_functorch 2.1817ms 2.0694ms 483.2358 Ops/s 487.2931 Ops/s $\color{#d91a1a}-0.83\%$
test_exec_functorch 0.2345ms 0.1835ms 5.4482 KOps/s 5.4995 KOps/s $\color{#d91a1a}-0.93\%$
test_exec_functional_call 0.2767ms 0.1647ms 6.0724 KOps/s 6.2102 KOps/s $\color{#d91a1a}-2.22\%$
test_exec_td_decorator 0.4497ms 0.2385ms 4.1924 KOps/s 4.2474 KOps/s $\color{#d91a1a}-1.30\%$
test_vmap_mlp_speed_decorator[True-True] 0.9227ms 0.7704ms 1.2981 KOps/s 1.2923 KOps/s $\color{#35bf28}+0.45\%$
test_vmap_mlp_speed_decorator[True-False] 0.9198ms 0.7707ms 1.2976 KOps/s 1.2859 KOps/s $\color{#35bf28}+0.90\%$
test_vmap_mlp_speed_decorator[False-True] 0.8285ms 0.6631ms 1.5080 KOps/s 1.4992 KOps/s $\color{#35bf28}+0.58\%$
test_vmap_mlp_speed_decorator[False-False] 0.8034ms 0.6624ms 1.5097 KOps/s 1.5065 KOps/s $\color{#35bf28}+0.21\%$
test_vmap_transformer_speed_decorator[True-True] 20.4677ms 20.3410ms 49.1619 Ops/s 48.9389 Ops/s $\color{#35bf28}+0.46\%$
test_vmap_transformer_speed_decorator[True-False] 20.5175ms 20.3428ms 49.1573 Ops/s 48.8771 Ops/s $\color{#35bf28}+0.57\%$
test_vmap_transformer_speed_decorator[False-True] 20.2771ms 20.1376ms 49.6583 Ops/s 49.3566 Ops/s $\color{#35bf28}+0.61\%$
test_vmap_transformer_speed_decorator[False-False] 20.8718ms 20.1671ms 49.5858 Ops/s 49.3015 Ops/s $\color{#35bf28}+0.58\%$
test_to_module_speed[True] 2.0114ms 1.4736ms 678.6173 Ops/s 676.1113 Ops/s $\color{#35bf28}+0.37\%$
test_to_module_speed[False] 1.9563ms 1.4567ms 686.4691 Ops/s 688.1905 Ops/s $\color{#d91a1a}-0.25\%$
test_tc_init 86.5020μs 51.9947μs 19.2327 KOps/s 19.4078 KOps/s $\color{#d91a1a}-0.90\%$
test_tc_init_tensor_only 48.4800μs 15.1445μs 66.0307 KOps/s 66.8328 KOps/s $\color{#d91a1a}-1.20\%$
test_tc_init_nested 0.1433ms 0.1030ms 9.7071 KOps/s 9.7966 KOps/s $\color{#d91a1a}-0.91\%$
test_tc_first_layer_tensor 25.7500μs 1.7783μs 562.3235 KOps/s 562.0300 KOps/s $\color{#35bf28}+0.05\%$
test_tc_first_layer_tensor_only 3.8201μs 0.6906μs 1.4479 MOps/s 1.4750 MOps/s $\color{#d91a1a}-1.84\%$
test_tc_first_layer_tensor_set 40.3200μs 4.2221μs 236.8503 KOps/s 238.2442 KOps/s $\color{#d91a1a}-0.59\%$
test_tc_first_layer_tensor_only_set 14.1700μs 3.0335μs 329.6534 KOps/s 336.8389 KOps/s $\color{#d91a1a}-2.13\%$
test_tc_first_layer_nontensor 28.2700μs 5.9167μs 169.0137 KOps/s 171.3503 KOps/s $\color{#d91a1a}-1.36\%$
test_tc_second_layer_tensor 36.9400μs 4.2824μs 233.5120 KOps/s 237.9182 KOps/s $\color{#d91a1a}-1.85\%$
test_tc_second_layer_nontensor 72.0110μs 8.3575μs 119.6529 KOps/s 122.3977 KOps/s $\color{#d91a1a}-2.24\%$
test_unbind 0.2488s 13.4206ms 74.5125 Ops/s 70.4754 Ops/s $\textbf{\color{#35bf28}+5.73\%}$
test_full_like 4.6693ms 4.3786ms 228.3824 Ops/s 112.9975 Ops/s $\textbf{\color{#35bf28}+102.11\%}$
test_zeros_like 5.0050ms 4.3760ms 228.5211 Ops/s 230.8509 Ops/s $\color{#d91a1a}-1.01\%$
test_ones_like 4.9803ms 4.3839ms 228.1088 Ops/s 228.6355 Ops/s $\color{#d91a1a}-0.23\%$
test_clone 6.5742ms 6.3939ms 156.3986 Ops/s 156.2706 Ops/s $\color{#35bf28}+0.08\%$
test_squeeze 96.4620μs 14.5348μs 68.8003 KOps/s 69.2076 KOps/s $\color{#d91a1a}-0.59\%$
test_unsqueeze 0.1553ms 0.1066ms 9.3826 KOps/s 9.4588 KOps/s $\color{#d91a1a}-0.81\%$
test_split 0.2362ms 0.1832ms 5.4576 KOps/s 5.5469 KOps/s $\color{#d91a1a}-1.61\%$
test_permute 0.2673ms 0.2124ms 4.7073 KOps/s 4.9203 KOps/s $\color{#d91a1a}-4.33\%$
test_stack 50.6830ms 50.4218ms 19.8327 Ops/s 19.8612 Ops/s $\color{#d91a1a}-0.14\%$
test_cat 42.4609ms 42.2161ms 23.6876 Ops/s 19.9493 Ops/s $\textbf{\color{#35bf28}+18.74\%}$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CI CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants