Skip to content

Conversation

@vmoens
Copy link
Collaborator

@vmoens vmoens commented Sep 9, 2024

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 9, 2024
@github-actions
Copy link

github-actions bot commented Sep 9, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 222. Improved: $\large\color{#35bf28}11$. Worsened: $\large\color{#d91a1a}23$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 59.5510μs 19.5805μs 51.0713 KOps/s 50.5064 KOps/s $\color{#35bf28}+1.12\%$
test_plain_set_stack_nested 55.6640μs 20.0274μs 49.9316 KOps/s 49.8603 KOps/s $\color{#35bf28}+0.14\%$
test_plain_set_nested_inplace 66.8140μs 21.5509μs 46.4018 KOps/s 46.7189 KOps/s $\color{#d91a1a}-0.68\%$
test_plain_set_stack_nested_inplace 94.1760μs 21.3787μs 46.7754 KOps/s 47.5077 KOps/s $\color{#d91a1a}-1.54\%$
test_items 27.1910μs 4.1635μs 240.1828 KOps/s 243.2361 KOps/s $\color{#d91a1a}-1.26\%$
test_items_nested 0.4068ms 0.3418ms 2.9256 KOps/s 3.0441 KOps/s $\color{#d91a1a}-3.90\%$
test_items_nested_locked 0.3929ms 0.3421ms 2.9234 KOps/s 3.0351 KOps/s $\color{#d91a1a}-3.68\%$
test_items_nested_leaf 0.1908ms 85.2456μs 11.7308 KOps/s 11.8224 KOps/s $\color{#d91a1a}-0.77\%$
test_items_stack_nested 0.4189ms 0.3436ms 2.9107 KOps/s 3.0021 KOps/s $\color{#d91a1a}-3.05\%$
test_items_stack_nested_leaf 0.1643ms 86.1726μs 11.6046 KOps/s 11.9396 KOps/s $\color{#d91a1a}-2.81\%$
test_items_stack_nested_locked 0.5010ms 0.3442ms 2.9054 KOps/s 3.0024 KOps/s $\color{#d91a1a}-3.23\%$
test_keys 25.0970μs 3.5403μs 282.4628 KOps/s 283.2967 KOps/s $\color{#d91a1a}-0.29\%$
test_keys_nested 0.1843ms 98.7359μs 10.1280 KOps/s 10.1305 KOps/s $\color{#d91a1a}-0.02\%$
test_keys_nested_locked 1.9100ms 0.1036ms 9.6510 KOps/s 9.6884 KOps/s $\color{#d91a1a}-0.39\%$
test_keys_nested_leaf 0.1591ms 81.3866μs 12.2870 KOps/s 11.9188 KOps/s $\color{#35bf28}+3.09\%$
test_keys_stack_nested 0.1760ms 95.7807μs 10.4405 KOps/s 10.4012 KOps/s $\color{#35bf28}+0.38\%$
test_keys_stack_nested_leaf 0.1566ms 82.0803μs 12.1832 KOps/s 12.4300 KOps/s $\color{#d91a1a}-1.99\%$
test_keys_stack_nested_locked 0.2233ms 0.1043ms 9.5854 KOps/s 9.9655 KOps/s $\color{#d91a1a}-3.81\%$
test_values 23.4130μs 1.2149μs 823.1257 KOps/s 910.8996 KOps/s $\textbf{\color{#d91a1a}-9.64\%}$
test_values_nested 91.6910μs 47.6833μs 20.9717 KOps/s 20.8063 KOps/s $\color{#35bf28}+0.79\%$
test_values_nested_locked 94.0850μs 47.5416μs 21.0342 KOps/s 20.8463 KOps/s $\color{#35bf28}+0.90\%$
test_values_nested_leaf 76.1220μs 41.9195μs 23.8553 KOps/s 23.5215 KOps/s $\color{#35bf28}+1.42\%$
test_values_stack_nested 0.1291ms 47.6815μs 20.9725 KOps/s 20.7609 KOps/s $\color{#35bf28}+1.02\%$
test_values_stack_nested_leaf 89.1160μs 42.7091μs 23.4142 KOps/s 24.0973 KOps/s $\color{#d91a1a}-2.83\%$
test_values_stack_nested_locked 90.7090μs 48.0646μs 20.8053 KOps/s 20.5527 KOps/s $\color{#35bf28}+1.23\%$
test_membership 17.3720μs 0.8548μs 1.1699 MOps/s 1.4383 MOps/s $\textbf{\color{#d91a1a}-18.66\%}$
test_membership_nested 25.6980μs 2.6602μs 375.9124 KOps/s 393.2169 KOps/s $\color{#d91a1a}-4.40\%$
test_membership_nested_leaf 46.4970μs 2.6528μs 376.9590 KOps/s 387.8234 KOps/s $\color{#d91a1a}-2.80\%$
test_membership_stacked_nested 30.2060μs 2.6254μs 380.8930 KOps/s 385.6293 KOps/s $\color{#d91a1a}-1.23\%$
test_membership_stacked_nested_leaf 22.2220μs 2.6410μs 378.6507 KOps/s 386.3033 KOps/s $\color{#d91a1a}-1.98\%$
test_membership_nested_last 35.0760μs 3.8712μs 258.3202 KOps/s 268.4946 KOps/s $\color{#d91a1a}-3.79\%$
test_membership_nested_leaf_last 43.8010μs 3.8713μs 258.3116 KOps/s 266.0248 KOps/s $\color{#d91a1a}-2.90\%$
test_membership_stacked_nested_last 32.1100μs 3.8334μs 260.8620 KOps/s 170.2254 KOps/s $\textbf{\color{#35bf28}+53.25\%}$
test_membership_stacked_nested_leaf_last 20.4470μs 3.8487μs 259.8267 KOps/s 169.5771 KOps/s $\textbf{\color{#35bf28}+53.22\%}$
test_nested_getleaf 45.6250μs 10.8106μs 92.5017 KOps/s 94.0072 KOps/s $\color{#d91a1a}-1.60\%$
test_nested_get 71.1740μs 10.0382μs 99.6198 KOps/s 99.6832 KOps/s $\color{#d91a1a}-0.06\%$
test_stacked_getleaf 43.2010μs 10.7610μs 92.9285 KOps/s 92.3191 KOps/s $\color{#35bf28}+0.66\%$
test_stacked_get 34.7350μs 10.3440μs 96.6749 KOps/s 99.2541 KOps/s $\color{#d91a1a}-2.60\%$
test_nested_getitemleaf 40.1950μs 11.0268μs 90.6884 KOps/s 90.8784 KOps/s $\color{#d91a1a}-0.21\%$
test_nested_getitem 44.4930μs 10.3661μs 96.4683 KOps/s 98.1170 KOps/s $\color{#d91a1a}-1.68\%$
test_stacked_getitemleaf 37.4400μs 11.0773μs 90.2749 KOps/s 90.4944 KOps/s $\color{#d91a1a}-0.24\%$
test_stacked_getitem 68.4460μs 10.2595μs 97.4704 KOps/s 97.6299 KOps/s $\color{#d91a1a}-0.16\%$
test_lock_nested 97.1362ms 0.5910ms 1.6921 KOps/s 2.0951 KOps/s $\textbf{\color{#d91a1a}-19.23\%}$
test_lock_stack_nested 0.7033ms 0.4591ms 2.1780 KOps/s 2.2725 KOps/s $\color{#d91a1a}-4.16\%$
test_unlock_nested 0.1026s 0.5212ms 1.9188 KOps/s 2.4561 KOps/s $\textbf{\color{#d91a1a}-21.88\%}$
test_unlock_stack_nested 0.6128ms 0.3742ms 2.6723 KOps/s 2.7430 KOps/s $\color{#d91a1a}-2.58\%$
test_flatten_speed 0.2465ms 0.1043ms 9.5885 KOps/s 9.5192 KOps/s $\color{#35bf28}+0.73\%$
test_unflatten_speed 0.5990ms 0.4606ms 2.1709 KOps/s 2.2154 KOps/s $\color{#d91a1a}-2.01\%$
test_common_ops 4.9871ms 1.1134ms 898.1236 Ops/s 901.3979 Ops/s $\color{#d91a1a}-0.36\%$
test_creation 61.6850μs 2.1602μs 462.9214 KOps/s 482.6016 KOps/s $\color{#d91a1a}-4.08\%$
test_creation_empty 0.1001ms 16.2252μs 61.6324 KOps/s 58.9320 KOps/s $\color{#35bf28}+4.58\%$
test_creation_nested_1 58.5800μs 19.2244μs 52.0171 KOps/s 50.1674 KOps/s $\color{#35bf28}+3.69\%$
test_creation_nested_2 70.4820μs 23.3672μs 42.7951 KOps/s 41.2938 KOps/s $\color{#35bf28}+3.64\%$
test_clone 0.2158ms 17.3472μs 57.6461 KOps/s 57.1815 KOps/s $\color{#35bf28}+0.81\%$
test_getitem[int] 1.1381ms 16.5982μs 60.2473 KOps/s 58.0362 KOps/s $\color{#35bf28}+3.81\%$
test_getitem[slice_int] 0.1629ms 30.8625μs 32.4018 KOps/s 33.5959 KOps/s $\color{#d91a1a}-3.55\%$
test_getitem[range] 0.2061ms 59.1013μs 16.9201 KOps/s 17.2618 KOps/s $\color{#d91a1a}-1.98\%$
test_getitem[tuple] 0.1522ms 25.6251μs 39.0243 KOps/s 39.8266 KOps/s $\color{#d91a1a}-2.01\%$
test_getitem[list] 0.1908ms 53.5422μs 18.6769 KOps/s 18.4273 KOps/s $\color{#35bf28}+1.35\%$
test_setitem_dim[int] 81.1620μs 38.8837μs 25.7177 KOps/s 26.1760 KOps/s $\color{#d91a1a}-1.75\%$
test_setitem_dim[slice_int] 0.1085ms 67.4184μs 14.8327 KOps/s 14.9177 KOps/s $\color{#d91a1a}-0.57\%$
test_setitem_dim[range] 0.1433ms 92.0924μs 10.8587 KOps/s 10.8460 KOps/s $\color{#35bf28}+0.12\%$
test_setitem_dim[tuple] 0.1010ms 55.5306μs 18.0081 KOps/s 17.8016 KOps/s $\color{#35bf28}+1.16\%$
test_setitem 0.2302ms 28.7382μs 34.7969 KOps/s 34.2579 KOps/s $\color{#35bf28}+1.57\%$
test_set 0.2503ms 28.4309μs 35.1730 KOps/s 34.6527 KOps/s $\color{#35bf28}+1.50\%$
test_set_shared 3.5864ms 0.2233ms 4.4786 KOps/s 4.6017 KOps/s $\color{#d91a1a}-2.68\%$
test_update 0.3175ms 34.7860μs 28.7472 KOps/s 28.6172 KOps/s $\color{#35bf28}+0.45\%$
test_update_nested 1.1339ms 45.0683μs 22.1886 KOps/s 21.8401 KOps/s $\color{#35bf28}+1.60\%$
test_update__nested 0.2014ms 34.4564μs 29.0222 KOps/s 28.7393 KOps/s $\color{#35bf28}+0.98\%$
test_set_nested 0.2106ms 30.3892μs 32.9065 KOps/s 31.7806 KOps/s $\color{#35bf28}+3.54\%$
test_set_nested_new 0.1928ms 35.4155μs 28.2362 KOps/s 26.2918 KOps/s $\textbf{\color{#35bf28}+7.40\%}$
test_select 0.2491ms 53.0071μs 18.8654 KOps/s 18.5190 KOps/s $\color{#35bf28}+1.87\%$
test_select_nested 0.1255ms 61.0555μs 16.3785 KOps/s 16.3947 KOps/s $\color{#d91a1a}-0.10\%$
test_exclude_nested 0.1622ms 76.5498μs 13.0634 KOps/s 13.0467 KOps/s $\color{#35bf28}+0.13\%$
test_empty[True] 0.5301ms 0.3206ms 3.1196 KOps/s 3.1745 KOps/s $\color{#d91a1a}-1.73\%$
test_empty[False] 12.3105μs 1.2224μs 818.0380 KOps/s 870.3258 KOps/s $\textbf{\color{#d91a1a}-6.01\%}$
test_unbind_speed 0.5424ms 0.2976ms 3.3602 KOps/s 3.3989 KOps/s $\color{#d91a1a}-1.14\%$
test_unbind_speed_stack0 0.4762ms 0.2983ms 3.3522 KOps/s 3.5316 KOps/s $\textbf{\color{#d91a1a}-5.08\%}$
test_unbind_speed_stack1 0.1113s 0.8672ms 1.1531 KOps/s 1.3987 KOps/s $\textbf{\color{#d91a1a}-17.56\%}$
test_split 0.1066s 2.2880ms 437.0703 Ops/s 446.6236 Ops/s $\color{#d91a1a}-2.14\%$
test_chunk 3.2941ms 2.0827ms 480.1417 Ops/s 448.5219 Ops/s $\textbf{\color{#35bf28}+7.05\%}$
test_creation[device0] 0.2981ms 0.1211ms 8.2574 KOps/s 8.4806 KOps/s $\color{#d91a1a}-2.63\%$
test_creation_from_tensor 0.3168ms 0.1202ms 8.3221 KOps/s 8.4545 KOps/s $\color{#d91a1a}-1.57\%$
test_add_one[memmap_tensor0] 0.4814ms 7.4567μs 134.1073 KOps/s 131.8616 KOps/s $\color{#35bf28}+1.70\%$
test_contiguous[memmap_tensor0] 41.9880μs 2.0045μs 498.8776 KOps/s 533.0490 KOps/s $\textbf{\color{#d91a1a}-6.41\%}$
test_stack[memmap_tensor0] 0.1450ms 6.0337μs 165.7371 KOps/s 164.2608 KOps/s $\color{#35bf28}+0.90\%$
test_memmaptd_index 1.4103ms 0.4199ms 2.3815 KOps/s 2.5081 KOps/s $\textbf{\color{#d91a1a}-5.05\%}$
test_memmaptd_index_astensor 1.2134ms 0.5067ms 1.9735 KOps/s 2.0981 KOps/s $\textbf{\color{#d91a1a}-5.94\%}$
test_memmaptd_index_op 2.1082ms 1.0320ms 969.0027 Ops/s 992.3289 Ops/s $\color{#d91a1a}-2.35\%$
test_serialize_model 0.2285s 0.1358s 7.3640 Ops/s 8.4150 Ops/s $\textbf{\color{#d91a1a}-12.49\%}$
test_serialize_model_pickle 0.4747s 0.4018s 2.4890 Ops/s 2.4971 Ops/s $\color{#d91a1a}-0.32\%$
test_serialize_weights 0.1391s 0.1200s 8.3306 Ops/s 7.6195 Ops/s $\textbf{\color{#35bf28}+9.33\%}$
test_serialize_weights_returnearly 0.2764s 0.1760s 5.6811 Ops/s 6.2575 Ops/s $\textbf{\color{#d91a1a}-9.21\%}$
test_serialize_weights_pickle 0.5435s 0.4134s 2.4192 Ops/s 2.1958 Ops/s $\textbf{\color{#35bf28}+10.18\%}$
test_serialize_weights_filesystem 0.1488s 0.1460s 6.8476 Ops/s 6.9956 Ops/s $\color{#d91a1a}-2.12\%$
test_serialize_model_filesystem 0.1559s 0.1497s 6.6815 Ops/s 5.9629 Ops/s $\textbf{\color{#35bf28}+12.05\%}$
test_reshape_pytree 76.2310μs 38.7647μs 25.7967 KOps/s 26.0940 KOps/s $\color{#d91a1a}-1.14\%$
test_reshape_td 0.1492ms 45.7253μs 21.8697 KOps/s 21.5474 KOps/s $\color{#35bf28}+1.50\%$
test_view_pytree 0.1752ms 39.9550μs 25.0282 KOps/s 25.7795 KOps/s $\color{#d91a1a}-2.91\%$
test_view_td 0.1147ms 52.1771μs 19.1655 KOps/s 19.9227 KOps/s $\color{#d91a1a}-3.80\%$
test_unbind_pytree 0.1023ms 36.4718μs 27.4184 KOps/s 27.4198 KOps/s $-0.00\%$
test_unbind_td 0.3360ms 44.4546μs 22.4948 KOps/s 21.9910 KOps/s $\color{#35bf28}+2.29\%$
test_split_pytree 79.8390μs 37.8652μs 26.4094 KOps/s 26.4340 KOps/s $\color{#d91a1a}-0.09\%$
test_split_td 0.4859ms 58.9631μs 16.9597 KOps/s 17.2846 KOps/s $\color{#d91a1a}-1.88\%$
test_add_pytree 0.1012ms 44.7037μs 22.3695 KOps/s 22.5017 KOps/s $\color{#d91a1a}-0.59\%$
test_add_td 0.1849ms 77.1820μs 12.9564 KOps/s 12.5380 KOps/s $\color{#35bf28}+3.34\%$
test_compile_add_one_nested[tensordict-compile] 0.1224ms 58.4185μs 17.1179 KOps/s 17.5199 KOps/s $\color{#d91a1a}-2.29\%$
test_compile_add_one_nested[tensordict-eager] 0.3737ms 0.1932ms 5.1751 KOps/s 5.3056 KOps/s $\color{#d91a1a}-2.46\%$
test_compile_add_one_nested[pytree-compile] 0.1403ms 57.7489μs 17.3164 KOps/s 17.6825 KOps/s $\color{#d91a1a}-2.07\%$
test_compile_add_one_nested[pytree-eager] 0.3251ms 0.1407ms 7.1093 KOps/s 7.3285 KOps/s $\color{#d91a1a}-2.99\%$
test_compile_copy_nested[tensordict-compile] 92.2010μs 20.5618μs 48.6338 KOps/s 48.7208 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_copy_nested[tensordict-eager] 0.1323ms 68.2919μs 14.6430 KOps/s 14.6993 KOps/s $\color{#d91a1a}-0.38\%$
test_compile_copy_nested[pytree-compile] 0.1669ms 75.5966μs 13.2281 KOps/s 13.0619 KOps/s $\color{#35bf28}+1.27\%$
test_compile_copy_nested[pytree-eager] 0.1441ms 68.1856μs 14.6659 KOps/s 14.9063 KOps/s $\color{#d91a1a}-1.61\%$
test_compile_add_one_flat[tensordict-compile] 0.3066ms 0.1817ms 5.5043 KOps/s 5.6714 KOps/s $\color{#d91a1a}-2.95\%$
test_compile_add_one_flat[tensordict-eager] 0.3375ms 0.1996ms 5.0100 KOps/s 5.2985 KOps/s $\textbf{\color{#d91a1a}-5.45\%}$
test_compile_add_one_flat[tensorclass-compile] 0.1874ms 47.3662μs 21.1121 KOps/s 22.2482 KOps/s $\textbf{\color{#d91a1a}-5.11\%}$
test_compile_add_one_flat[tensorclass-eager] 0.6736ms 71.0643μs 14.0718 KOps/s 14.1029 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_add_one_flat[pytree-compile] 0.3196ms 0.1801ms 5.5526 KOps/s 5.6710 KOps/s $\color{#d91a1a}-2.09\%$
test_compile_add_one_flat[pytree-eager] 0.5823ms 0.2961ms 3.3774 KOps/s 3.5628 KOps/s $\textbf{\color{#d91a1a}-5.20\%}$
test_compile_add_self_flat[tensordict-eager] 0.3979ms 0.2098ms 4.7667 KOps/s 4.8378 KOps/s $\color{#d91a1a}-1.47\%$
test_compile_add_self_flat[tensordict-compile] 0.3648ms 0.1839ms 5.4383 KOps/s 5.6288 KOps/s $\color{#d91a1a}-3.38\%$
test_compile_add_self_flat[tensorclass-eager] 0.1385ms 66.6612μs 15.0012 KOps/s 15.5474 KOps/s $\color{#d91a1a}-3.51\%$
test_compile_add_self_flat[tensorclass-compile] 0.1513ms 50.5426μs 19.7853 KOps/s 23.3099 KOps/s $\textbf{\color{#d91a1a}-15.12\%}$
test_compile_add_self_flat[pytree-eager] 0.4640ms 0.2426ms 4.1216 KOps/s 4.3508 KOps/s $\textbf{\color{#d91a1a}-5.27\%}$
test_compile_add_self_flat[pytree-compile] 0.3840ms 0.1859ms 5.3797 KOps/s 5.5978 KOps/s $\color{#d91a1a}-3.90\%$
test_compile_copy_flat[tensordict-compile] 0.2181ms 0.1073ms 9.3234 KOps/s 9.6881 KOps/s $\color{#d91a1a}-3.76\%$
test_compile_copy_flat[tensordict-eager] 0.1199ms 58.2878μs 17.1563 KOps/s 16.9970 KOps/s $\color{#35bf28}+0.94\%$
test_compile_copy_flat[pytree-compile] 0.1497ms 76.8159μs 13.0181 KOps/s 13.0821 KOps/s $\color{#d91a1a}-0.49\%$
test_compile_copy_flat[pytree-eager] 0.1374ms 68.4094μs 14.6179 KOps/s 14.5705 KOps/s $\color{#35bf28}+0.32\%$
test_compile_assign_and_add[tensordict-compile] 0.4265ms 0.2016ms 4.9602 KOps/s 4.9267 KOps/s $\color{#35bf28}+0.68\%$
test_compile_assign_and_add[tensordict-eager] 2.7218ms 1.7237ms 580.1604 Ops/s 587.0589 Ops/s $\color{#d91a1a}-1.18\%$
test_compile_assign_and_add[pytree-compile] 0.3354ms 0.1950ms 5.1292 KOps/s 5.0704 KOps/s $\color{#35bf28}+1.16\%$
test_compile_assign_and_add[pytree-eager] 1.9305ms 1.1429ms 874.9754 Ops/s 916.8663 Ops/s $\color{#d91a1a}-4.57\%$
test_compile_assign_and_add_stack[compile] 0.5585ms 0.4260ms 2.3473 KOps/s 2.2706 KOps/s $\color{#35bf28}+3.37\%$
test_compile_assign_and_add_stack[eager] 4.9191ms 3.7500ms 266.6655 Ops/s 271.4374 Ops/s $\color{#d91a1a}-1.76\%$
test_compile_indexing[tensor-tensordict-compile] 0.1021ms 34.9508μs 28.6116 KOps/s 27.7960 KOps/s $\color{#35bf28}+2.93\%$
test_compile_indexing[tensor-tensordict-eager] 1.1488ms 48.1841μs 20.7537 KOps/s 20.4932 KOps/s $\color{#35bf28}+1.27\%$
test_compile_indexing[tensor-tensorclass-compile] 75.2200μs 30.5119μs 32.7741 KOps/s 32.8081 KOps/s $\color{#d91a1a}-0.10\%$
test_compile_indexing[tensor-tensorclass-eager] 76.0110μs 28.2395μs 35.4114 KOps/s 35.2023 KOps/s $\color{#35bf28}+0.59\%$
test_compile_indexing[tensor-pytree-compile] 78.8570μs 31.0036μs 32.2544 KOps/s 30.8446 KOps/s $\color{#35bf28}+4.57\%$
test_compile_indexing[tensor-pytree-eager] 82.4440μs 29.1150μs 34.3466 KOps/s 35.5539 KOps/s $\color{#d91a1a}-3.40\%$
test_compile_indexing[slice-tensordict-compile] 0.1597ms 76.4174μs 13.0860 KOps/s 13.1367 KOps/s $\color{#d91a1a}-0.39\%$
test_compile_indexing[slice-tensordict-eager] 0.5859ms 27.6931μs 36.1100 KOps/s 36.1070 KOps/s $+0.01\%$
test_compile_indexing[slice-tensorclass-compile] 0.1521ms 69.3734μs 14.4147 KOps/s 14.3110 KOps/s $\color{#35bf28}+0.72\%$
test_compile_indexing[slice-tensorclass-eager] 69.9810μs 23.4829μs 42.5842 KOps/s 42.3190 KOps/s $\color{#35bf28}+0.63\%$
test_compile_indexing[slice-pytree-compile] 0.2034ms 70.2461μs 14.2357 KOps/s 14.7157 KOps/s $\color{#d91a1a}-3.26\%$
test_compile_indexing[slice-pytree-eager] 58.4290μs 23.0509μs 43.3823 KOps/s 41.9240 KOps/s $\color{#35bf28}+3.48\%$
test_compile_indexing[int-tensordict-compile] 0.1985ms 76.4009μs 13.0888 KOps/s 13.5583 KOps/s $\color{#d91a1a}-3.46\%$
test_compile_indexing[int-tensordict-eager] 1.0849ms 27.8213μs 35.9436 KOps/s 36.9451 KOps/s $\color{#d91a1a}-2.71\%$
test_compile_indexing[int-tensorclass-compile] 0.1532ms 70.0474μs 14.2761 KOps/s 14.7741 KOps/s $\color{#d91a1a}-3.37\%$
test_compile_indexing[int-tensorclass-eager] 68.5680μs 22.9389μs 43.5941 KOps/s 42.6407 KOps/s $\color{#35bf28}+2.24\%$
test_compile_indexing[int-pytree-compile] 0.1975ms 70.0304μs 14.2795 KOps/s 14.6874 KOps/s $\color{#d91a1a}-2.78\%$
test_compile_indexing[int-pytree-eager] 70.9430μs 23.0430μs 43.3972 KOps/s 43.3676 KOps/s $\color{#35bf28}+0.07\%$
test_mod_add[eager] 71.7340μs 23.0506μs 43.3829 KOps/s 43.6399 KOps/s $\color{#d91a1a}-0.59\%$
test_mod_add[compile] 0.1034ms 38.5784μs 25.9213 KOps/s 24.4637 KOps/s $\textbf{\color{#35bf28}+5.96\%}$
test_mod_add[compile-overhead] 0.1130ms 38.4335μs 26.0190 KOps/s 24.6941 KOps/s $\textbf{\color{#35bf28}+5.37\%}$
test_mod_wrap[eager] 0.4247ms 0.2071ms 4.8295 KOps/s 4.7364 KOps/s $\color{#35bf28}+1.97\%$
test_mod_wrap[compile] 0.5212ms 0.2345ms 4.2647 KOps/s 4.1823 KOps/s $\color{#35bf28}+1.97\%$
test_mod_wrap[compile-overhead] 0.4238ms 0.2308ms 4.3333 KOps/s 4.2178 KOps/s $\color{#35bf28}+2.74\%$
test_mod_wrap_and_backward[eager] 13.7993ms 11.9643ms 83.5823 Ops/s 91.7370 Ops/s $\textbf{\color{#d91a1a}-8.89\%}$
test_mod_wrap_and_backward[compile] 14.3704ms 12.5222ms 79.8584 Ops/s 89.8958 Ops/s $\textbf{\color{#d91a1a}-11.17\%}$
test_mod_wrap_and_backward[compile-overhead] 17.7771ms 11.8806ms 84.1709 Ops/s 90.3789 Ops/s $\textbf{\color{#d91a1a}-6.87\%}$
test_seq_add[eager] 0.2194ms 86.0054μs 11.6272 KOps/s 11.9432 KOps/s $\color{#d91a1a}-2.65\%$
test_seq_add[compile] 0.1486ms 64.5851μs 15.4835 KOps/s 15.4313 KOps/s $\color{#35bf28}+0.34\%$
test_seq_add[compile-overhead] 0.1338ms 62.0506μs 16.1159 KOps/s 15.7732 KOps/s $\color{#35bf28}+2.17\%$
test_seq_wrap[eager] 0.6152ms 0.3780ms 2.6452 KOps/s 2.7096 KOps/s $\color{#d91a1a}-2.37\%$
test_seq_wrap[compile] 0.5395ms 0.2751ms 3.6351 KOps/s 3.6724 KOps/s $\color{#d91a1a}-1.02\%$
test_seq_wrap[compile-overhead] 0.4033ms 0.2706ms 3.6955 KOps/s 3.6490 KOps/s $\color{#35bf28}+1.28\%$
test_func_call_runtime[False-eager] 0.7987ms 0.5321ms 1.8795 KOps/s 1.8855 KOps/s $\color{#d91a1a}-0.32\%$
test_func_call_runtime[False-compile] 0.9915ms 0.5113ms 1.9558 KOps/s 1.9729 KOps/s $\color{#d91a1a}-0.86\%$
test_func_call_runtime[False-compile-overhead] 0.8254ms 0.5123ms 1.9520 KOps/s 1.9653 KOps/s $\color{#d91a1a}-0.68\%$
test_func_call_runtime[True-eager] 1.0777ms 0.7480ms 1.3369 KOps/s 1.3339 KOps/s $\color{#35bf28}+0.23\%$
test_func_call_runtime[True-compile] 0.6700ms 0.5182ms 1.9297 KOps/s 1.9373 KOps/s $\color{#d91a1a}-0.39\%$
test_func_call_runtime[True-compile-overhead] 0.7153ms 0.5230ms 1.9121 KOps/s 1.9132 KOps/s $\color{#d91a1a}-0.06\%$
test_func_call_cm_runtime[False-eager] 1.0223ms 0.5315ms 1.8814 KOps/s 1.9221 KOps/s $\color{#d91a1a}-2.12\%$
test_func_call_cm_runtime[False-compile] 0.7892ms 0.5157ms 1.9392 KOps/s 1.9667 KOps/s $\color{#d91a1a}-1.40\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6742ms 0.5092ms 1.9637 KOps/s 1.9656 KOps/s $\color{#d91a1a}-0.10\%$
test_func_call_cm_runtime[True-eager] 1.1329ms 0.8818ms 1.1341 KOps/s 1.1340 KOps/s $+0.01\%$
test_func_call_cm_runtime[True-compile] 1.0282ms 0.7522ms 1.3293 KOps/s 1.3302 KOps/s $\color{#d91a1a}-0.07\%$
test_func_call_cm_runtime[True-compile-overhead] 1.3260ms 0.7506ms 1.3322 KOps/s 1.3320 KOps/s $\color{#35bf28}+0.02\%$
test_vmap_func_call_cm_runtime[eager] 4.4275ms 1.9023ms 525.6699 Ops/s 522.1019 Ops/s $\color{#35bf28}+0.68\%$
test_vmap_func_call_cm_runtime[compile] 3.2193ms 1.9715ms 507.2382 Ops/s 512.4731 Ops/s $\color{#d91a1a}-1.02\%$
test_vmap_func_call_cm_runtime[compile-overhead] 3.0035ms 1.9607ms 510.0287 Ops/s 512.3353 Ops/s $\color{#d91a1a}-0.45\%$
test_distributed 0.2906ms 0.1269ms 7.8831 KOps/s 7.7129 KOps/s $\color{#35bf28}+2.21\%$
test_tdmodule 42.6490μs 15.6454μs 63.9164 KOps/s 60.9932 KOps/s $\color{#35bf28}+4.79\%$
test_tdmodule_dispatch 62.2760μs 33.2933μs 30.0361 KOps/s 29.4622 KOps/s $\color{#35bf28}+1.95\%$
test_tdseq 49.2320μs 19.0451μs 52.5068 KOps/s 53.3752 KOps/s $\color{#d91a1a}-1.63\%$
test_tdseq_dispatch 69.4690μs 39.1592μs 25.5368 KOps/s 26.0489 KOps/s $\color{#d91a1a}-1.97\%$
test_instantiation_functorch 1.9682ms 1.6597ms 602.5104 Ops/s 628.4174 Ops/s $\color{#d91a1a}-4.12\%$
test_instantiation_td 3.4757ms 1.2139ms 823.8159 Ops/s 834.7495 Ops/s $\color{#d91a1a}-1.31\%$
test_exec_functorch 0.3424ms 0.1878ms 5.3251 KOps/s 5.4223 KOps/s $\color{#d91a1a}-1.79\%$
test_exec_functional_call 0.3294ms 0.1756ms 5.6958 KOps/s 5.6095 KOps/s $\color{#35bf28}+1.54\%$
test_exec_td 0.9338ms 0.1708ms 5.8549 KOps/s 5.8620 KOps/s $\color{#d91a1a}-0.12\%$
test_exec_td_decorator 0.9892ms 0.2250ms 4.4448 KOps/s 4.3789 KOps/s $\color{#35bf28}+1.50\%$
test_vmap_mlp_speed[True-True] 0.9776ms 0.6300ms 1.5872 KOps/s 1.5774 KOps/s $\color{#35bf28}+0.62\%$
test_vmap_mlp_speed[True-False] 0.9175ms 0.6292ms 1.5894 KOps/s 1.5720 KOps/s $\color{#35bf28}+1.11\%$
test_vmap_mlp_speed[False-True] 0.8916ms 0.4959ms 2.0166 KOps/s 2.0122 KOps/s $\color{#35bf28}+0.22\%$
test_vmap_mlp_speed[False-False] 0.6890ms 0.4910ms 2.0367 KOps/s 1.9937 KOps/s $\color{#35bf28}+2.15\%$
test_vmap_mlp_speed_decorator[True-True] 1.6035ms 0.6142ms 1.6282 KOps/s 1.5952 KOps/s $\color{#35bf28}+2.07\%$
test_vmap_mlp_speed_decorator[True-False] 0.8347ms 0.6103ms 1.6386 KOps/s 1.5891 KOps/s $\color{#35bf28}+3.12\%$
test_vmap_mlp_speed_decorator[False-True] 0.8663ms 0.5100ms 1.9607 KOps/s 1.9392 KOps/s $\color{#35bf28}+1.11\%$
test_vmap_mlp_speed_decorator[False-False] 0.7602ms 0.5088ms 1.9654 KOps/s 1.9464 KOps/s $\color{#35bf28}+0.98\%$
test_to_module_speed[True] 1.5810ms 1.3051ms 766.2309 Ops/s 757.9427 Ops/s $\color{#35bf28}+1.09\%$
test_to_module_speed[False] 1.4658ms 1.2697ms 787.6067 Ops/s 787.7855 Ops/s $\color{#d91a1a}-0.02\%$
test_tc_init 0.1022ms 42.8725μs 23.3250 KOps/s 24.0137 KOps/s $\color{#d91a1a}-2.87\%$
test_tc_init_nested 0.7709ms 84.5088μs 11.8331 KOps/s 12.0710 KOps/s $\color{#d91a1a}-1.97\%$
test_tc_first_layer_tensor 40.3250μs 1.5849μs 630.9393 KOps/s 643.4912 KOps/s $\color{#d91a1a}-1.95\%$
test_tc_first_layer_nontensor 41.4170μs 4.8135μs 207.7509 KOps/s 201.1544 KOps/s $\color{#35bf28}+3.28\%$
test_tc_second_layer_tensor 55.3330μs 2.9709μs 336.6034 KOps/s 342.5170 KOps/s $\color{#d91a1a}-1.73\%$
test_tc_second_layer_nontensor 45.3370μs 6.1914μs 161.5141 KOps/s 160.9164 KOps/s $\color{#35bf28}+0.37\%$
test_unbind 0.5175s 14.5183ms 68.8785 Ops/s 68.6673 Ops/s $\color{#35bf28}+0.31\%$
test_full_like 11.6612ms 8.7149ms 114.7458 Ops/s 100.4292 Ops/s $\textbf{\color{#35bf28}+14.26\%}$
test_zeros_like 15.1384ms 7.3431ms 136.1817 Ops/s 281.3661 Ops/s $\textbf{\color{#d91a1a}-51.60\%}$
test_ones_like 14.0123ms 8.1008ms 123.4449 Ops/s 254.7185 Ops/s $\textbf{\color{#d91a1a}-51.54\%}$
test_clone 15.6680ms 9.9190ms 100.8164 Ops/s 148.9623 Ops/s $\textbf{\color{#d91a1a}-32.32\%}$
test_squeeze 84.9680μs 12.3798μs 80.7767 KOps/s 78.3414 KOps/s $\color{#35bf28}+3.11\%$
test_unsqueeze 0.1758ms 93.1412μs 10.7364 KOps/s 11.1011 KOps/s $\color{#d91a1a}-3.29\%$
test_split 0.5128ms 0.1929ms 5.1837 KOps/s 5.0041 KOps/s $\color{#35bf28}+3.59\%$
test_permute 0.4647ms 0.2218ms 4.5081 KOps/s 4.5586 KOps/s $\color{#d91a1a}-1.11\%$
test_stack 33.1898ms 27.3764ms 36.5278 Ops/s 33.6550 Ops/s $\textbf{\color{#35bf28}+8.54\%}$
test_cat 31.4926ms 27.8940ms 35.8501 Ops/s 34.5311 Ops/s $\color{#35bf28}+3.82\%$

@github-actions
Copy link

github-actions bot commented Sep 9, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}6$. Worsened: $\large\color{#d91a1a}16$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.1248ms 14.8749μs 67.2275 KOps/s 68.9106 KOps/s $\color{#d91a1a}-2.44\%$
test_plain_set_stack_nested 41.1310μs 14.8114μs 67.5158 KOps/s 69.4973 KOps/s $\color{#d91a1a}-2.85\%$
test_plain_set_nested_inplace 0.1857ms 15.7252μs 63.5924 KOps/s 65.1430 KOps/s $\color{#d91a1a}-2.38\%$
test_plain_set_stack_nested_inplace 44.4810μs 15.8520μs 63.0835 KOps/s 65.3035 KOps/s $\color{#d91a1a}-3.40\%$
test_items 23.6710μs 2.8218μs 354.3854 KOps/s 352.0583 KOps/s $\color{#35bf28}+0.66\%$
test_items_nested 0.3671ms 0.3204ms 3.1213 KOps/s 3.1498 KOps/s $\color{#d91a1a}-0.91\%$
test_items_nested_locked 0.3844ms 0.3229ms 3.0966 KOps/s 3.1151 KOps/s $\color{#d91a1a}-0.59\%$
test_items_nested_leaf 91.5920μs 62.7333μs 15.9405 KOps/s 15.8864 KOps/s $\color{#35bf28}+0.34\%$
test_items_stack_nested 0.4728ms 0.3275ms 3.0535 KOps/s 3.1388 KOps/s $\color{#d91a1a}-2.72\%$
test_items_stack_nested_leaf 88.2120μs 64.2213μs 15.5712 KOps/s 15.7901 KOps/s $\color{#d91a1a}-1.39\%$
test_items_stack_nested_locked 0.3791ms 0.3260ms 3.0677 KOps/s 3.0944 KOps/s $\color{#d91a1a}-0.86\%$
test_keys 23.6310μs 3.3886μs 295.1056 KOps/s 295.9035 KOps/s $\color{#d91a1a}-0.27\%$
test_keys_nested 85.8420μs 54.8111μs 18.2445 KOps/s 18.0811 KOps/s $\color{#35bf28}+0.90\%$
test_keys_nested_locked 2.6924ms 59.6896μs 16.7533 KOps/s 16.4297 KOps/s $\color{#35bf28}+1.97\%$
test_keys_nested_leaf 0.2158ms 45.5882μs 21.9355 KOps/s 21.5271 KOps/s $\color{#35bf28}+1.90\%$
test_keys_stack_nested 0.2457ms 53.3027μs 18.7608 KOps/s 18.1454 KOps/s $\color{#35bf28}+3.39\%$
test_keys_stack_nested_leaf 0.2487ms 46.8858μs 21.3284 KOps/s 21.5754 KOps/s $\color{#d91a1a}-1.14\%$
test_keys_stack_nested_locked 0.2050ms 60.5353μs 16.5193 KOps/s 16.8640 KOps/s $\color{#d91a1a}-2.04\%$
test_values 4.2800μs 0.8025μs 1.2461 MOps/s 1.2446 MOps/s $\color{#35bf28}+0.12\%$
test_values_nested 0.2154ms 27.3075μs 36.6200 KOps/s 36.5982 KOps/s $\color{#35bf28}+0.06\%$
test_values_nested_locked 54.0810μs 29.6711μs 33.7028 KOps/s 34.1960 KOps/s $\color{#d91a1a}-1.44\%$
test_values_nested_leaf 57.9610μs 23.9642μs 41.7288 KOps/s 41.4066 KOps/s $\color{#35bf28}+0.78\%$
test_values_stack_nested 90.9720μs 27.9084μs 35.8315 KOps/s 35.5074 KOps/s $\color{#35bf28}+0.91\%$
test_values_stack_nested_leaf 50.7610μs 24.6273μs 40.6054 KOps/s 39.7745 KOps/s $\color{#35bf28}+2.09\%$
test_values_stack_nested_locked 0.1162ms 30.0273μs 33.3031 KOps/s 33.3458 KOps/s $\color{#d91a1a}-0.13\%$
test_membership 1.6776μs 0.4792μs 2.0867 MOps/s 2.1021 MOps/s $\color{#d91a1a}-0.73\%$
test_membership_nested 18.8305μs 1.7919μs 558.0647 KOps/s 566.0778 KOps/s $\color{#d91a1a}-1.42\%$
test_membership_nested_leaf 10.9570μs 1.7383μs 575.2690 KOps/s 579.1523 KOps/s $\color{#d91a1a}-0.67\%$
test_membership_stacked_nested 29.4110μs 1.8148μs 551.0136 KOps/s 558.3540 KOps/s $\color{#d91a1a}-1.31\%$
test_membership_stacked_nested_leaf 28.0900μs 1.7855μs 560.0589 KOps/s 556.3377 KOps/s $\color{#35bf28}+0.67\%$
test_membership_nested_last 31.6510μs 2.5985μs 384.8329 KOps/s 385.7854 KOps/s $\color{#d91a1a}-0.25\%$
test_membership_nested_leaf_last 32.1000μs 2.6345μs 379.5834 KOps/s 386.5984 KOps/s $\color{#d91a1a}-1.81\%$
test_membership_stacked_nested_last 33.3610μs 3.2438μs 308.2836 KOps/s 387.5550 KOps/s $\textbf{\color{#d91a1a}-20.45\%}$
test_membership_stacked_nested_leaf_last 35.6710μs 3.2142μs 311.1159 KOps/s 390.7668 KOps/s $\textbf{\color{#d91a1a}-20.38\%}$
test_nested_getleaf 36.2810μs 6.1191μs 163.4239 KOps/s 164.9591 KOps/s $\color{#d91a1a}-0.93\%$
test_nested_get 31.5600μs 5.7807μs 172.9887 KOps/s 173.9493 KOps/s $\color{#d91a1a}-0.55\%$
test_stacked_getleaf 39.5910μs 6.0659μs 164.8551 KOps/s 165.5940 KOps/s $\color{#d91a1a}-0.45\%$
test_stacked_get 29.0400μs 5.6357μs 177.4401 KOps/s 176.9499 KOps/s $\color{#35bf28}+0.28\%$
test_nested_getitemleaf 30.5310μs 6.1406μs 162.8499 KOps/s 162.9813 KOps/s $\color{#d91a1a}-0.08\%$
test_nested_getitem 30.2910μs 5.7376μs 174.2900 KOps/s 174.6931 KOps/s $\color{#d91a1a}-0.23\%$
test_stacked_getitemleaf 31.1100μs 6.1173μs 163.4696 KOps/s 165.4915 KOps/s $\color{#d91a1a}-1.22\%$
test_stacked_getitem 37.6310μs 5.6523μs 176.9191 KOps/s 177.3978 KOps/s $\color{#d91a1a}-0.27\%$
test_lock_nested 3.2926ms 0.4127ms 2.4229 KOps/s 2.4468 KOps/s $\color{#d91a1a}-0.97\%$
test_lock_stack_nested 0.5071ms 0.3803ms 2.6298 KOps/s 2.7164 KOps/s $\color{#d91a1a}-3.19\%$
test_unlock_nested 0.7818ms 0.3526ms 2.8364 KOps/s 2.8567 KOps/s $\color{#d91a1a}-0.71\%$
test_unlock_stack_nested 0.4520ms 0.3191ms 3.1341 KOps/s 3.2369 KOps/s $\color{#d91a1a}-3.18\%$
test_flatten_speed 0.2577ms 80.1686μs 12.4737 KOps/s 12.6990 KOps/s $\color{#d91a1a}-1.77\%$
test_unflatten_speed 0.3202ms 0.2848ms 3.5109 KOps/s 3.5280 KOps/s $\color{#d91a1a}-0.49\%$
test_common_ops 1.6979ms 1.2671ms 789.1826 Ops/s 805.1936 Ops/s $\color{#d91a1a}-1.99\%$
test_creation 34.5010μs 1.5020μs 665.7953 KOps/s 681.2213 KOps/s $\color{#d91a1a}-2.26\%$
test_creation_empty 47.1110μs 17.4584μs 57.2790 KOps/s 61.5662 KOps/s $\textbf{\color{#d91a1a}-6.96\%}$
test_creation_nested_1 65.6810μs 18.9195μs 52.8556 KOps/s 55.5292 KOps/s $\color{#d91a1a}-4.81\%$
test_creation_nested_2 0.2026ms 21.6747μs 46.1367 KOps/s 48.2788 KOps/s $\color{#d91a1a}-4.44\%$
test_clone 0.2388ms 28.7817μs 34.7443 KOps/s 35.7818 KOps/s $\color{#d91a1a}-2.90\%$
test_getitem[int] 1.3547ms 15.5530μs 64.2964 KOps/s 65.0581 KOps/s $\color{#d91a1a}-1.17\%$
test_getitem[slice_int] 0.1725ms 27.4658μs 36.4089 KOps/s 37.3609 KOps/s $\color{#d91a1a}-2.55\%$
test_getitem[range] 0.2698ms 0.1074ms 9.3075 KOps/s 9.3122 KOps/s $\color{#d91a1a}-0.05\%$
test_getitem[tuple] 98.8858ms 29.6311μs 33.7484 KOps/s 42.8135 KOps/s $\textbf{\color{#d91a1a}-21.17\%}$
test_getitem[list] 0.2529ms 96.5129μs 10.3613 KOps/s 10.5172 KOps/s $\color{#d91a1a}-1.48\%$
test_setitem_dim[int] 0.1857ms 51.5049μs 19.4156 KOps/s 19.9948 KOps/s $\color{#d91a1a}-2.90\%$
test_setitem_dim[slice_int] 0.2077ms 74.1988μs 13.4773 KOps/s 13.5440 KOps/s $\color{#d91a1a}-0.49\%$
test_setitem_dim[range] 0.2747ms 0.1334ms 7.4976 KOps/s 7.5606 KOps/s $\color{#d91a1a}-0.83\%$
test_setitem_dim[tuple] 0.1349ms 66.9341μs 14.9401 KOps/s 15.1280 KOps/s $\color{#d91a1a}-1.24\%$
test_setitem 0.2218ms 41.9827μs 23.8193 KOps/s 24.2355 KOps/s $\color{#d91a1a}-1.72\%$
test_set 0.1948ms 41.5856μs 24.0468 KOps/s 24.8134 KOps/s $\color{#d91a1a}-3.09\%$
test_set_shared 0.3135ms 50.5755μs 19.7724 KOps/s 20.1794 KOps/s $\color{#d91a1a}-2.02\%$
test_update 0.2029ms 51.7076μs 19.3395 KOps/s 20.2748 KOps/s $\color{#d91a1a}-4.61\%$
test_update_nested 0.3498ms 58.0910μs 17.2144 KOps/s 17.8914 KOps/s $\color{#d91a1a}-3.78\%$
test_update__nested 0.2085ms 58.2589μs 17.1648 KOps/s 17.4129 KOps/s $\color{#d91a1a}-1.43\%$
test_set_nested 0.1948ms 43.9003μs 22.7789 KOps/s 23.7334 KOps/s $\color{#d91a1a}-4.02\%$
test_set_nested_new 0.1951ms 47.6868μs 20.9702 KOps/s 21.9006 KOps/s $\color{#d91a1a}-4.25\%$
test_select 0.2094ms 61.0265μs 16.3863 KOps/s 17.0127 KOps/s $\color{#d91a1a}-3.68\%$
test_select_nested 0.5233ms 42.4614μs 23.5508 KOps/s 23.9938 KOps/s $\color{#d91a1a}-1.85\%$
test_exclude_nested 97.1520μs 59.7800μs 16.7280 KOps/s 16.8627 KOps/s $\color{#d91a1a}-0.80\%$
test_empty[True] 0.2827ms 0.2471ms 4.0478 KOps/s 4.0504 KOps/s $\color{#d91a1a}-0.07\%$
test_empty[False] 3.6561μs 0.7399μs 1.3515 MOps/s 1.4118 MOps/s $\color{#d91a1a}-4.27\%$
test_to 93.0820μs 25.2383μs 39.6223 KOps/s 40.8035 KOps/s $\color{#d91a1a}-2.90\%$
test_to_nonblocking 53.6010μs 24.5213μs 40.7808 KOps/s 42.7863 KOps/s $\color{#d91a1a}-4.69\%$
test_unbind_speed 1.1044ms 0.2768ms 3.6128 KOps/s 3.6634 KOps/s $\color{#d91a1a}-1.38\%$
test_unbind_speed_stack0 0.3166ms 0.2723ms 3.6723 KOps/s 3.7597 KOps/s $\color{#d91a1a}-2.33\%$
test_unbind_speed_stack1 97.7814ms 0.7101ms 1.4083 KOps/s 1.4393 KOps/s $\color{#d91a1a}-2.15\%$
test_split 0.1015s 2.2171ms 451.0466 Ops/s 458.0612 Ops/s $\color{#d91a1a}-1.53\%$
test_chunk 0.1006s 2.2241ms 449.6210 Ops/s 459.6788 Ops/s $\color{#d91a1a}-2.19\%$
test_creation[device0] 0.3497ms 0.1257ms 7.9562 KOps/s 7.9598 KOps/s $\color{#d91a1a}-0.05\%$
test_creation_from_tensor 0.3476ms 0.1276ms 7.8396 KOps/s 7.8323 KOps/s $\color{#35bf28}+0.09\%$
test_add_one[memmap_tensor0] 0.1336ms 8.3442μs 119.8439 KOps/s 121.3736 KOps/s $\color{#d91a1a}-1.26\%$
test_contiguous[memmap_tensor0] 30.2010μs 2.2597μs 442.5303 KOps/s 452.3759 KOps/s $\color{#d91a1a}-2.18\%$
test_stack[memmap_tensor0] 32.8110μs 6.5653μs 152.3168 KOps/s 158.7394 KOps/s $\color{#d91a1a}-4.05\%$
test_memmaptd_index 1.3017ms 0.4238ms 2.3594 KOps/s 2.3769 KOps/s $\color{#d91a1a}-0.73\%$
test_memmaptd_index_astensor 0.7186ms 0.4728ms 2.1148 KOps/s 2.1145 KOps/s $\color{#35bf28}+0.01\%$
test_memmaptd_index_op 1.4265ms 1.0257ms 974.9326 Ops/s 988.7392 Ops/s $\color{#d91a1a}-1.40\%$
test_serialize_model 0.1304s 0.1298s 7.7024 Ops/s 7.6790 Ops/s $\color{#35bf28}+0.30\%$
test_serialize_model_pickle 1.3472s 1.2128s 0.8245 Ops/s 0.8237 Ops/s $\color{#35bf28}+0.10\%$
test_serialize_weights 0.1299s 0.1290s 7.7543 Ops/s 7.7316 Ops/s $\color{#35bf28}+0.29\%$
test_serialize_weights_returnearly 55.0636ms 47.2688ms 21.1556 Ops/s 15.6155 Ops/s $\textbf{\color{#35bf28}+35.48\%}$
test_serialize_weights_pickle 1.3564s 1.2145s 0.8234 Ops/s 0.8206 Ops/s $\color{#35bf28}+0.33\%$
test_reshape_pytree 0.1462ms 35.8822μs 27.8689 KOps/s 28.7396 KOps/s $\color{#d91a1a}-3.03\%$
test_reshape_td 0.1414ms 41.7484μs 23.9530 KOps/s 25.0145 KOps/s $\color{#d91a1a}-4.24\%$
test_view_pytree 0.1278ms 35.6816μs 28.0256 KOps/s 28.4900 KOps/s $\color{#d91a1a}-1.63\%$
test_view_td 0.1697ms 46.8016μs 21.3668 KOps/s 21.9913 KOps/s $\color{#d91a1a}-2.84\%$
test_unbind_pytree 0.1597ms 34.1160μs 29.3117 KOps/s 29.8465 KOps/s $\color{#d91a1a}-1.79\%$
test_unbind_td 0.3689ms 42.3218μs 23.6285 KOps/s 24.0077 KOps/s $\color{#d91a1a}-1.58\%$
test_split_pytree 0.3275ms 46.6207μs 21.4497 KOps/s 21.9521 KOps/s $\color{#d91a1a}-2.29\%$
test_split_td 0.2120ms 55.8519μs 17.9045 KOps/s 18.0093 KOps/s $\color{#d91a1a}-0.58\%$
test_add_pytree 0.2138ms 56.2293μs 17.7843 KOps/s 18.1707 KOps/s $\color{#d91a1a}-2.13\%$
test_add_td 0.2407ms 91.1571μs 10.9701 KOps/s 11.1736 KOps/s $\color{#d91a1a}-1.82\%$
test_compile_add_one_nested[tensordict-compile] 0.4006ms 0.2047ms 4.8847 KOps/s 4.7612 KOps/s $\color{#35bf28}+2.59\%$
test_compile_add_one_nested[tensordict-eager] 0.3149ms 0.1563ms 6.3978 KOps/s 6.4178 KOps/s $\color{#d91a1a}-0.31\%$
test_compile_add_one_nested[pytree-compile] 0.3100ms 0.1442ms 6.9360 KOps/s 6.7636 KOps/s $\color{#35bf28}+2.55\%$
test_compile_add_one_nested[pytree-eager] 0.3626ms 0.1803ms 5.5469 KOps/s 5.6673 KOps/s $\color{#d91a1a}-2.12\%$
test_compile_copy_nested[tensordict-compile] 0.2151ms 20.8658μs 47.9254 KOps/s 48.0142 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_copy_nested[tensordict-eager] 0.2261ms 42.9571μs 23.2790 KOps/s 23.1935 KOps/s $\color{#35bf28}+0.37\%$
test_compile_copy_nested[pytree-compile] 0.2214ms 64.4761μs 15.5096 KOps/s 15.3917 KOps/s $\color{#35bf28}+0.77\%$
test_compile_copy_nested[pytree-eager] 0.2004ms 50.0624μs 19.9751 KOps/s 20.1333 KOps/s $\color{#d91a1a}-0.79\%$
test_compile_add_one_flat[tensordict-compile] 0.4295ms 0.3196ms 3.1285 KOps/s 3.1438 KOps/s $\color{#d91a1a}-0.48\%$
test_compile_add_one_flat[tensordict-eager] 0.4245ms 0.2091ms 4.7815 KOps/s 4.8015 KOps/s $\color{#d91a1a}-0.42\%$
test_compile_add_one_flat[tensorclass-compile] 0.2707ms 0.1269ms 7.8815 KOps/s 7.7739 KOps/s $\color{#35bf28}+1.38\%$
test_compile_add_one_flat[tensorclass-eager] 0.2127ms 61.1961μs 16.3409 KOps/s 16.7711 KOps/s $\color{#d91a1a}-2.57\%$
test_compile_add_one_flat[pytree-compile] 0.4458ms 0.3200ms 3.1253 KOps/s 3.1483 KOps/s $\color{#d91a1a}-0.73\%$
test_compile_add_one_flat[pytree-eager] 0.8073ms 0.6257ms 1.5981 KOps/s 1.7117 KOps/s $\textbf{\color{#d91a1a}-6.64\%}$
test_compile_add_self_flat[tensordict-eager] 0.3827ms 0.2494ms 4.0091 KOps/s 4.0291 KOps/s $\color{#d91a1a}-0.50\%$
test_compile_add_self_flat[tensordict-compile] 0.4455ms 0.3217ms 3.1087 KOps/s 3.1096 KOps/s $\color{#d91a1a}-0.03\%$
test_compile_add_self_flat[tensorclass-eager] 0.2214ms 71.3162μs 14.0221 KOps/s 14.2146 KOps/s $\color{#d91a1a}-1.35\%$
test_compile_add_self_flat[tensorclass-compile] 0.2705ms 0.1280ms 7.8139 KOps/s 7.5744 KOps/s $\color{#35bf28}+3.16\%$
test_compile_add_self_flat[pytree-eager] 0.6871ms 0.5194ms 1.9254 KOps/s 1.9738 KOps/s $\color{#d91a1a}-2.45\%$
test_compile_add_self_flat[pytree-compile] 0.4679ms 0.3185ms 3.1399 KOps/s 3.1338 KOps/s $\color{#35bf28}+0.19\%$
test_compile_copy_flat[tensordict-compile] 0.1808ms 18.6812μs 53.5297 KOps/s 51.3280 KOps/s $\color{#35bf28}+4.29\%$
test_compile_copy_flat[tensordict-eager] 60.8110μs 26.6777μs 37.4844 KOps/s 36.8846 KOps/s $\color{#35bf28}+1.63\%$
test_compile_copy_flat[pytree-compile] 0.1853ms 69.6579μs 14.3559 KOps/s 14.2982 KOps/s $\color{#35bf28}+0.40\%$
test_compile_copy_flat[pytree-eager] 79.4310μs 51.5103μs 19.4136 KOps/s 19.3787 KOps/s $\color{#35bf28}+0.18\%$
test_compile_assign_and_add[tensordict-compile] 2.3341ms 0.8070ms 1.2392 KOps/s 1.1447 KOps/s $\textbf{\color{#35bf28}+8.25\%}$
test_compile_assign_and_add[tensordict-eager] 3.4430ms 3.1788ms 314.5871 Ops/s 322.9717 Ops/s $\color{#d91a1a}-2.60\%$
test_compile_assign_and_add[pytree-compile] 2.2892ms 0.7981ms 1.2530 KOps/s 1.1609 KOps/s $\textbf{\color{#35bf28}+7.93\%}$
test_compile_assign_and_add[pytree-eager] 3.6389ms 3.2281ms 309.7759 Ops/s 332.5546 Ops/s $\textbf{\color{#d91a1a}-6.85\%}$
test_compile_indexing[tensor-tensordict-compile] 0.3028ms 0.1127ms 8.8720 KOps/s 8.8378 KOps/s $\color{#35bf28}+0.39\%$
test_compile_indexing[tensor-tensordict-eager] 0.2466ms 65.0728μs 15.3674 KOps/s 15.4243 KOps/s $\color{#d91a1a}-0.37\%$
test_compile_indexing[tensor-tensorclass-compile] 0.3116ms 0.1026ms 9.7425 KOps/s 9.4164 KOps/s $\color{#35bf28}+3.46\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2550ms 45.6600μs 21.9010 KOps/s 22.9164 KOps/s $\color{#d91a1a}-4.43\%$
test_compile_indexing[tensor-pytree-compile] 0.2970ms 0.1095ms 9.1354 KOps/s 9.7615 KOps/s $\textbf{\color{#d91a1a}-6.41\%}$
test_compile_indexing[tensor-pytree-eager] 0.2244ms 45.5997μs 21.9300 KOps/s 23.8467 KOps/s $\textbf{\color{#d91a1a}-8.04\%}$
test_compile_indexing[slice-tensordict-compile] 0.2865ms 0.1367ms 7.3136 KOps/s 7.3561 KOps/s $\color{#d91a1a}-0.58\%$
test_compile_indexing[slice-tensordict-eager] 0.1616ms 25.4016μs 39.3675 KOps/s 39.4392 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_indexing[slice-tensorclass-compile] 0.3172ms 0.1317ms 7.5939 KOps/s 7.7394 KOps/s $\color{#d91a1a}-1.88\%$
test_compile_indexing[slice-tensorclass-eager] 97.4820μs 20.8856μs 47.8800 KOps/s 49.0836 KOps/s $\color{#d91a1a}-2.45\%$
test_compile_indexing[slice-pytree-compile] 0.3011ms 0.1334ms 7.4956 KOps/s 7.6779 KOps/s $\color{#d91a1a}-2.37\%$
test_compile_indexing[slice-pytree-eager] 0.1946ms 21.0602μs 47.4829 KOps/s 48.3725 KOps/s $\color{#d91a1a}-1.84\%$
test_compile_indexing[int-tensordict-compile] 0.3330ms 0.1454ms 6.8759 KOps/s 7.3247 KOps/s $\textbf{\color{#d91a1a}-6.13\%}$
test_compile_indexing[int-tensordict-eager] 0.5349ms 24.8252μs 40.2817 KOps/s 40.0306 KOps/s $\color{#35bf28}+0.63\%$
test_compile_indexing[int-tensorclass-compile] 0.3136ms 0.1379ms 7.2517 KOps/s 7.6448 KOps/s $\textbf{\color{#d91a1a}-5.14\%}$
test_compile_indexing[int-tensorclass-eager] 0.1519ms 21.0834μs 47.4306 KOps/s 48.6631 KOps/s $\color{#d91a1a}-2.53\%$
test_compile_indexing[int-pytree-compile] 0.3082ms 0.1379ms 7.2534 KOps/s 7.6566 KOps/s $\textbf{\color{#d91a1a}-5.27\%}$
test_compile_indexing[int-pytree-eager] 0.1671ms 21.0263μs 47.5595 KOps/s 40.5087 KOps/s $\textbf{\color{#35bf28}+17.41\%}$
test_mod_add[eager] 0.2138ms 34.7562μs 28.7718 KOps/s 31.9202 KOps/s $\textbf{\color{#d91a1a}-9.86\%}$
test_mod_add[compile] 0.2598ms 75.4323μs 13.2569 KOps/s 14.4094 KOps/s $\textbf{\color{#d91a1a}-8.00\%}$
test_mod_add[compile-overhead] 0.2585ms 0.1325ms 7.5477 KOps/s 6.6820 KOps/s $\textbf{\color{#35bf28}+12.96\%}$
test_mod_wrap[eager] 0.4244ms 0.2517ms 3.9734 KOps/s 4.2242 KOps/s $\textbf{\color{#d91a1a}-5.94\%}$
test_mod_wrap[compile] 0.4463ms 0.2798ms 3.5743 KOps/s 3.5377 KOps/s $\color{#35bf28}+1.03\%$
test_mod_wrap[compile-overhead] 7.8061ms 4.1308ms 242.0851 Ops/s 249.3734 Ops/s $\color{#d91a1a}-2.92\%$
test_mod_wrap_and_backward[eager] 1.5831ms 1.2999ms 769.2718 Ops/s 720.9330 Ops/s $\textbf{\color{#35bf28}+6.71\%}$
test_mod_wrap_and_backward[compile] 2.6990ms 1.2952ms 772.0948 Ops/s 756.0161 Ops/s $\color{#35bf28}+2.13\%$
test_mod_wrap_and_backward[compile-overhead] 1.2968ms 0.8814ms 1.1346 KOps/s 1.1099 KOps/s $\color{#35bf28}+2.22\%$
test_seq_add[eager] 0.4892ms 97.7719μs 10.2279 KOps/s 10.3955 KOps/s $\color{#d91a1a}-1.61\%$
test_seq_add[compile] 0.5985ms 81.4574μs 12.2764 KOps/s 12.5347 KOps/s $\color{#d91a1a}-2.06\%$
test_seq_add[compile-overhead] 0.2658ms 0.1130ms 8.8483 KOps/s 8.6556 KOps/s $\color{#35bf28}+2.23\%$
test_seq_wrap[eager] 0.7967ms 0.3734ms 2.6780 KOps/s 2.5904 KOps/s $\color{#35bf28}+3.38\%$
test_seq_wrap[compile] 0.4461ms 0.2998ms 3.3353 KOps/s 3.2107 KOps/s $\color{#35bf28}+3.88\%$
test_seq_wrap[compile-overhead] 0.3491ms 0.2050ms 4.8784 KOps/s 4.8547 KOps/s $\color{#35bf28}+0.49\%$
test_func_call_runtime[False-eager] 0.8992ms 0.7101ms 1.4083 KOps/s 1.3955 KOps/s $\color{#35bf28}+0.92\%$
test_func_call_runtime[False-compile] 1.0116ms 0.7742ms 1.2917 KOps/s 1.2795 KOps/s $\color{#35bf28}+0.96\%$
test_func_call_runtime[False-compile-overhead] 0.4890ms 0.3477ms 2.8762 KOps/s 2.8625 KOps/s $\color{#35bf28}+0.48\%$
test_func_call_runtime[True-eager] 1.0411ms 0.8638ms 1.1577 KOps/s 1.1174 KOps/s $\color{#35bf28}+3.60\%$
test_func_call_runtime[True-compile] 0.9627ms 0.8092ms 1.2358 KOps/s 1.2218 KOps/s $\color{#35bf28}+1.14\%$
test_func_call_runtime[True-compile-overhead] 0.5082ms 0.3804ms 2.6288 KOps/s 2.6346 KOps/s $\color{#d91a1a}-0.22\%$
test_func_call_cm_runtime[False-eager] 0.8794ms 0.7054ms 1.4176 KOps/s 1.4252 KOps/s $\color{#d91a1a}-0.53\%$
test_func_call_cm_runtime[False-compile] 0.9853ms 0.7761ms 1.2886 KOps/s 1.2651 KOps/s $\color{#35bf28}+1.85\%$
test_func_call_cm_runtime[False-compile-overhead] 1.1867ms 0.3499ms 2.8577 KOps/s 2.8562 KOps/s $\color{#35bf28}+0.05\%$
test_func_call_cm_runtime[True-eager] 1.1941ms 0.9931ms 1.0070 KOps/s 1.0250 KOps/s $\color{#d91a1a}-1.75\%$
test_func_call_cm_runtime[True-compile] 1.0054ms 0.8298ms 1.2050 KOps/s 1.1885 KOps/s $\color{#35bf28}+1.39\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5341ms 0.4038ms 2.4764 KOps/s 2.4553 KOps/s $\color{#35bf28}+0.86\%$
test_vmap_func_call_cm_runtime[eager] 2.6037ms 1.9957ms 501.0730 Ops/s 498.3665 Ops/s $\color{#35bf28}+0.54\%$
test_vmap_func_call_cm_runtime[compile] 1.0938ms 0.8650ms 1.1561 KOps/s 1.1671 KOps/s $\color{#d91a1a}-0.94\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5425ms 0.4090ms 2.4447 KOps/s 2.4366 KOps/s $\color{#35bf28}+0.34\%$
test_distributed 4.6046ms 0.2437ms 4.1033 KOps/s 8.5551 KOps/s $\textbf{\color{#d91a1a}-52.04\%}$
test_tdmodule 0.1381ms 15.4088μs 64.8980 KOps/s 67.7662 KOps/s $\color{#d91a1a}-4.23\%$
test_tdmodule_dispatch 67.2110μs 31.1128μs 32.1411 KOps/s 32.5804 KOps/s $\color{#d91a1a}-1.35\%$
test_tdseq 35.0510μs 16.0079μs 62.4692 KOps/s 63.4949 KOps/s $\color{#d91a1a}-1.62\%$
test_tdseq_dispatch 52.8210μs 33.1137μs 30.1989 KOps/s 31.1005 KOps/s $\color{#d91a1a}-2.90\%$
test_instantiation_functorch 2.0186ms 1.8238ms 548.3127 Ops/s 546.8873 Ops/s $\color{#35bf28}+0.26\%$
test_instantiation_td 1.8091ms 1.1832ms 845.1315 Ops/s 839.9280 Ops/s $\color{#35bf28}+0.62\%$
test_exec_functorch 0.3541ms 0.2059ms 4.8573 KOps/s 4.8586 KOps/s $\color{#d91a1a}-0.03\%$
test_exec_functional_call 0.3514ms 0.2025ms 4.9381 KOps/s 4.9708 KOps/s $\color{#d91a1a}-0.66\%$
test_exec_td 0.3422ms 0.2061ms 4.8516 KOps/s 4.8044 KOps/s $\color{#35bf28}+0.98\%$
test_exec_td_decorator 0.5423ms 0.2496ms 4.0064 KOps/s 3.9671 KOps/s $\color{#35bf28}+0.99\%$
test_vmap_mlp_speed[True-True] 0.8284ms 0.6647ms 1.5045 KOps/s 1.4823 KOps/s $\color{#35bf28}+1.50\%$
test_vmap_mlp_speed[True-False] 0.8546ms 0.6579ms 1.5199 KOps/s 1.4865 KOps/s $\color{#35bf28}+2.25\%$
test_vmap_mlp_speed[False-True] 0.7210ms 0.5478ms 1.8256 KOps/s 1.7914 KOps/s $\color{#35bf28}+1.91\%$
test_vmap_mlp_speed[False-False] 0.7226ms 0.5499ms 1.8185 KOps/s 1.8047 KOps/s $\color{#35bf28}+0.76\%$
test_vmap_mlp_speed_decorator[True-True] 1.2819ms 0.6512ms 1.5356 KOps/s 1.5282 KOps/s $\color{#35bf28}+0.49\%$
test_vmap_mlp_speed_decorator[True-False] 0.7973ms 0.6502ms 1.5379 KOps/s 1.5441 KOps/s $\color{#d91a1a}-0.40\%$
test_vmap_mlp_speed_decorator[False-True] 0.7449ms 0.5672ms 1.7632 KOps/s 1.7746 KOps/s $\color{#d91a1a}-0.64\%$
test_vmap_mlp_speed_decorator[False-False] 0.7468ms 0.5669ms 1.7639 KOps/s 1.7741 KOps/s $\color{#d91a1a}-0.57\%$
test_vmap_transformer_speed[True-True] 8.2496ms 7.9887ms 125.1768 Ops/s 124.8297 Ops/s $\color{#35bf28}+0.28\%$
test_vmap_transformer_speed[True-False] 8.2947ms 7.9775ms 125.3526 Ops/s 125.5927 Ops/s $\color{#d91a1a}-0.19\%$
test_vmap_transformer_speed[False-True] 8.3980ms 7.9388ms 125.9630 Ops/s 128.8030 Ops/s $\color{#d91a1a}-2.20\%$
test_vmap_transformer_speed[False-False] 8.0526ms 7.8014ms 128.1814 Ops/s 128.5977 Ops/s $\color{#d91a1a}-0.32\%$
test_vmap_transformer_speed_decorator[True-True] 19.5781ms 18.8323ms 53.1003 Ops/s 53.6710 Ops/s $\color{#d91a1a}-1.06\%$
test_vmap_transformer_speed_decorator[True-False] 19.0028ms 18.7419ms 53.3563 Ops/s 53.6218 Ops/s $\color{#d91a1a}-0.50\%$
test_vmap_transformer_speed_decorator[False-True] 19.4607ms 18.7221ms 53.4128 Ops/s 53.0444 Ops/s $\color{#35bf28}+0.69\%$
test_vmap_transformer_speed_decorator[False-False] 18.8811ms 18.5859ms 53.8044 Ops/s 53.5908 Ops/s $\color{#35bf28}+0.40\%$
test_to_module_speed[True] 1.4919ms 0.9399ms 1.0639 KOps/s 1.0702 KOps/s $\color{#d91a1a}-0.59\%$
test_to_module_speed[False] 1.2741ms 0.9122ms 1.0963 KOps/s 1.1035 KOps/s $\color{#d91a1a}-0.65\%$
test_tc_init 62.4220μs 35.1375μs 28.4596 KOps/s 27.7593 KOps/s $\color{#35bf28}+2.52\%$
test_tc_init_nested 0.1880ms 71.7077μs 13.9455 KOps/s 13.8423 KOps/s $\color{#35bf28}+0.75\%$
test_tc_first_layer_tensor 12.5503μs 0.6652μs 1.5032 MOps/s 1.4998 MOps/s $\color{#35bf28}+0.23\%$
test_tc_first_layer_nontensor 37.2610μs 2.2269μs 449.0647 KOps/s 452.1151 KOps/s $\color{#d91a1a}-0.67\%$
test_tc_second_layer_tensor 8.8102μs 1.3495μs 741.0343 KOps/s 733.8272 KOps/s $\color{#35bf28}+0.98\%$
test_tc_second_layer_nontensor 34.4110μs 2.9029μs 344.4798 KOps/s 342.3245 KOps/s $\color{#35bf28}+0.63\%$
test_unbind 0.2016s 12.3696ms 80.8431 Ops/s 93.0557 Ops/s $\textbf{\color{#d91a1a}-13.12\%}$
test_full_like 0.7623ms 0.5762ms 1.7354 KOps/s 1.7292 KOps/s $\color{#35bf28}+0.36\%$
test_zeros_like 0.3407ms 0.1982ms 5.0451 KOps/s 5.0433 KOps/s $\color{#35bf28}+0.03\%$
test_ones_like 0.3533ms 0.1983ms 5.0437 KOps/s 5.0463 KOps/s $\color{#d91a1a}-0.05\%$
test_clone 0.5570ms 0.4141ms 2.4150 KOps/s 2.4092 KOps/s $\color{#35bf28}+0.24\%$
test_squeeze 0.1408ms 9.6254μs 103.8921 KOps/s 101.4889 KOps/s $\color{#35bf28}+2.37\%$
test_unsqueeze 0.2195ms 72.1342μs 13.8630 KOps/s 14.0527 KOps/s $\color{#d91a1a}-1.35\%$
test_split 0.4456ms 0.1553ms 6.4386 KOps/s 6.4295 KOps/s $\color{#35bf28}+0.14\%$
test_permute 0.3156ms 0.1766ms 5.6640 KOps/s 5.6855 KOps/s $\color{#d91a1a}-0.38\%$
test_stack 1.3555ms 0.8844ms 1.1307 KOps/s 1.1225 KOps/s $\color{#35bf28}+0.73\%$
test_cat 1.3764ms 1.2329ms 811.0718 Ops/s 811.1889 Ops/s $\color{#d91a1a}-0.01\%$

@vmoens vmoens added the Quality label Sep 9, 2024
@vmoens vmoens merged commit f9e76e8 into main Sep 9, 2024
@vmoens vmoens deleted the fix-split-update_-tc branch September 9, 2024 17:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Quality

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants