Skip to content

Conversation

@vmoens
Copy link
Collaborator

@vmoens vmoens commented Oct 24, 2025

No description provided.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 24, 2025
@github-actions
Copy link

github-actions bot commented Oct 24, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 233. Improved: $\large\color{#35bf28}8$. Worsened: $\large\color{#d91a1a}5$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 41.4410μs 14.9920μs 66.7023 KOps/s 66.4242 KOps/s $\color{#35bf28}+0.42\%$
test_plain_set_stack_nested 37.7110μs 15.2790μs 65.4491 KOps/s 65.7087 KOps/s $\color{#d91a1a}-0.40\%$
test_plain_set_nested_inplace 46.8610μs 16.4532μs 60.7785 KOps/s 59.6174 KOps/s $\color{#35bf28}+1.95\%$
test_plain_set_stack_nested_inplace 45.2010μs 16.4348μs 60.8466 KOps/s 59.7696 KOps/s $\color{#35bf28}+1.80\%$
test_items 35.6100μs 5.8434μs 171.1319 KOps/s 169.9330 KOps/s $\color{#35bf28}+0.71\%$
test_items_nested 0.5886ms 0.5294ms 1.8888 KOps/s 1.8593 KOps/s $\color{#35bf28}+1.59\%$
test_items_nested_locked 0.7211ms 0.5313ms 1.8823 KOps/s 1.8505 KOps/s $\color{#35bf28}+1.72\%$
test_items_nested_leaf 0.1345ms 95.6812μs 10.4514 KOps/s 10.2291 KOps/s $\color{#35bf28}+2.17\%$
test_items_stack_nested 0.5953ms 0.5316ms 1.8811 KOps/s 1.8349 KOps/s $\color{#35bf28}+2.52\%$
test_items_stack_nested_leaf 0.2069ms 96.7466μs 10.3363 KOps/s 10.4194 KOps/s $\color{#d91a1a}-0.80\%$
test_items_stack_nested_locked 0.5949ms 0.5332ms 1.8756 KOps/s 1.8332 KOps/s $\color{#35bf28}+2.32\%$
test_keys 34.4810μs 4.2222μs 236.8416 KOps/s 237.2921 KOps/s $\color{#d91a1a}-0.19\%$
test_keys_nested 0.2859ms 0.1201ms 8.3293 KOps/s 8.2861 KOps/s $\color{#35bf28}+0.52\%$
test_keys_nested_locked 2.0662ms 0.1299ms 7.6974 KOps/s 7.7247 KOps/s $\color{#d91a1a}-0.35\%$
test_keys_nested_leaf 0.1536ms 0.1115ms 8.9696 KOps/s 9.0711 KOps/s $\color{#d91a1a}-1.12\%$
test_keys_stack_nested 0.1930ms 0.1204ms 8.3038 KOps/s 8.3181 KOps/s $\color{#d91a1a}-0.17\%$
test_keys_stack_nested_leaf 0.1527ms 0.1103ms 9.0649 KOps/s 9.1286 KOps/s $\color{#d91a1a}-0.70\%$
test_keys_stack_nested_locked 0.3008ms 0.1282ms 7.7999 KOps/s 7.7131 KOps/s $\color{#35bf28}+1.13\%$
test_values 8.1602μs 1.0298μs 971.0879 KOps/s 938.7874 KOps/s $\color{#35bf28}+3.44\%$
test_values_nested 0.5401ms 47.6233μs 20.9981 KOps/s 20.9306 KOps/s $\color{#35bf28}+0.32\%$
test_values_nested_locked 80.7610μs 50.9957μs 19.6095 KOps/s 19.5755 KOps/s $\color{#35bf28}+0.17\%$
test_values_nested_leaf 0.1938ms 54.1244μs 18.4760 KOps/s 18.3750 KOps/s $\color{#35bf28}+0.55\%$
test_values_stack_nested 0.1108ms 47.6247μs 20.9975 KOps/s 20.9099 KOps/s $\color{#35bf28}+0.42\%$
test_values_stack_nested_leaf 85.1810μs 53.7885μs 18.5913 KOps/s 18.5168 KOps/s $\color{#35bf28}+0.40\%$
test_values_stack_nested_locked 73.8210μs 51.2837μs 19.4994 KOps/s 19.5937 KOps/s $\color{#d91a1a}-0.48\%$
test_membership 6.5600μs 0.8559μs 1.1684 MOps/s 1.1862 MOps/s $\color{#d91a1a}-1.50\%$
test_membership_nested 28.8110μs 3.1412μs 318.3463 KOps/s 314.3996 KOps/s $\color{#35bf28}+1.26\%$
test_membership_nested_leaf 31.6010μs 3.1325μs 319.2348 KOps/s 313.2777 KOps/s $\color{#35bf28}+1.90\%$
test_membership_stacked_nested 34.1810μs 3.1647μs 315.9833 KOps/s 311.7072 KOps/s $\color{#35bf28}+1.37\%$
test_membership_stacked_nested_leaf 25.0200μs 3.1660μs 315.8605 KOps/s 311.9584 KOps/s $\color{#35bf28}+1.25\%$
test_membership_nested_last 25.2400μs 4.5914μs 217.7992 KOps/s 213.4111 KOps/s $\color{#35bf28}+2.06\%$
test_membership_nested_leaf_last 30.3410μs 4.5989μs 217.4425 KOps/s 212.3680 KOps/s $\color{#35bf28}+2.39\%$
test_membership_stacked_nested_last 39.0910μs 4.6016μs 217.3147 KOps/s 213.9973 KOps/s $\color{#35bf28}+1.55\%$
test_membership_stacked_nested_leaf_last 54.1010μs 4.6168μs 216.5981 KOps/s 213.9538 KOps/s $\color{#35bf28}+1.24\%$
test_nested_getleaf 50.8510μs 21.6819μs 46.1214 KOps/s 46.2803 KOps/s $\color{#d91a1a}-0.34\%$
test_nested_get 0.1729ms 20.6883μs 48.3366 KOps/s 50.2028 KOps/s $\color{#d91a1a}-3.72\%$
test_stacked_getleaf 46.7810μs 21.7138μs 46.0536 KOps/s 46.5885 KOps/s $\color{#d91a1a}-1.15\%$
test_stacked_get 51.7810μs 20.6068μs 48.5277 KOps/s 47.8412 KOps/s $\color{#35bf28}+1.43\%$
test_nested_getitemleaf 57.6710μs 22.5802μs 44.2866 KOps/s 44.0279 KOps/s $\color{#35bf28}+0.59\%$
test_nested_getitem 45.6700μs 21.3818μs 46.7688 KOps/s 46.8548 KOps/s $\color{#d91a1a}-0.18\%$
test_stacked_getitemleaf 50.0710μs 22.2643μs 44.9150 KOps/s 45.0634 KOps/s $\color{#d91a1a}-0.33\%$
test_stacked_getitem 49.5910μs 21.3271μs 46.8887 KOps/s 47.6119 KOps/s $\color{#d91a1a}-1.52\%$
test_lock_nested 0.5987ms 0.4700ms 2.1277 KOps/s 2.1445 KOps/s $\color{#d91a1a}-0.79\%$
test_lock_stack_nested 0.5428ms 0.4742ms 2.1089 KOps/s 2.1068 KOps/s $\color{#35bf28}+0.10\%$
test_unlock_nested 0.4937ms 0.3797ms 2.6340 KOps/s 2.6498 KOps/s $\color{#d91a1a}-0.60\%$
test_unlock_stack_nested 0.4370ms 0.3782ms 2.6440 KOps/s 2.6344 KOps/s $\color{#35bf28}+0.36\%$
test_flatten_speed 0.1706ms 0.1222ms 8.1817 KOps/s 8.1884 KOps/s $\color{#d91a1a}-0.08\%$
test_unflatten_speed 0.7487ms 0.5958ms 1.6785 KOps/s 1.6860 KOps/s $\color{#d91a1a}-0.45\%$
test_common_ops 0.8940ms 0.7371ms 1.3566 KOps/s 1.3502 KOps/s $\color{#35bf28}+0.47\%$
test_creation 65.2920μs 2.7552μs 362.9506 KOps/s 364.2268 KOps/s $\color{#d91a1a}-0.35\%$
test_creation_empty 38.7310μs 9.1543μs 109.2379 KOps/s 109.4960 KOps/s $\color{#d91a1a}-0.24\%$
test_creation_nested_1 64.8710μs 12.2229μs 81.8136 KOps/s 81.6822 KOps/s $\color{#35bf28}+0.16\%$
test_creation_nested_2 51.3600μs 16.1693μs 61.8455 KOps/s 62.1011 KOps/s $\color{#d91a1a}-0.41\%$
test_clone 45.1710μs 13.5313μs 73.9025 KOps/s 73.9521 KOps/s $\color{#d91a1a}-0.07\%$
test_getitem[int] 1.1848ms 14.0670μs 71.0882 KOps/s 71.2786 KOps/s $\color{#d91a1a}-0.27\%$
test_getitem[slice_int] 0.1403ms 25.1727μs 39.7256 KOps/s 39.8829 KOps/s $\color{#d91a1a}-0.39\%$
test_getitem[range] 0.1790ms 60.1507μs 16.6249 KOps/s 16.7384 KOps/s $\color{#d91a1a}-0.68\%$
test_getitem[tuple] 0.1406ms 24.4157μs 40.9572 KOps/s 41.2708 KOps/s $\color{#d91a1a}-0.76\%$
test_getitem[list] 0.2061ms 52.8037μs 18.9381 KOps/s 18.3664 KOps/s $\color{#35bf28}+3.11\%$
test_setitem_dim[int] 44.8900μs 24.1499μs 41.4081 KOps/s 40.9285 KOps/s $\color{#35bf28}+1.17\%$
test_setitem_dim[slice_int] 67.1310μs 44.3027μs 22.5720 KOps/s 22.2477 KOps/s $\color{#35bf28}+1.46\%$
test_setitem_dim[range] 0.1159ms 86.3955μs 11.5747 KOps/s 11.5350 KOps/s $\color{#35bf28}+0.34\%$
test_setitem_dim[tuple] 92.0510μs 41.3394μs 24.1900 KOps/s 22.9655 KOps/s $\textbf{\color{#35bf28}+5.33\%}$
test_setitem 50.7310μs 18.4720μs 54.1360 KOps/s 54.0972 KOps/s $\color{#35bf28}+0.07\%$
test_set 60.7610μs 17.4284μs 57.3776 KOps/s 57.1801 KOps/s $\color{#35bf28}+0.35\%$
test_set_shared 0.5551ms 0.2034ms 4.9154 KOps/s 4.9708 KOps/s $\color{#d91a1a}-1.11\%$
test_update 0.3867ms 22.8301μs 43.8018 KOps/s 44.1396 KOps/s $\color{#d91a1a}-0.77\%$
test_update_nested 65.7310μs 35.5796μs 28.1060 KOps/s 28.7033 KOps/s $\color{#d91a1a}-2.08\%$
test_update__nested 0.4805ms 34.3416μs 29.1192 KOps/s 28.0782 KOps/s $\color{#35bf28}+3.71\%$
test_set_nested 54.5710μs 19.3277μs 51.7392 KOps/s 50.8343 KOps/s $\color{#35bf28}+1.78\%$
test_set_nested_new 59.1810μs 24.8691μs 40.2106 KOps/s 40.5400 KOps/s $\color{#d91a1a}-0.81\%$
test_select 0.2147ms 42.3447μs 23.6157 KOps/s 23.5473 KOps/s $\color{#35bf28}+0.29\%$
test_select_nested 0.1122ms 74.7336μs 13.3809 KOps/s 13.4920 KOps/s $\color{#d91a1a}-0.82\%$
test_exclude_nested 0.2521ms 98.3575μs 10.1670 KOps/s 9.9852 KOps/s $\color{#35bf28}+1.82\%$
test_empty[True] 0.6229ms 0.4362ms 2.2925 KOps/s 2.2575 KOps/s $\color{#35bf28}+1.55\%$
test_empty[False] 7.7550μs 1.3286μs 752.6721 KOps/s 743.1142 KOps/s $\color{#35bf28}+1.29\%$
test_to 0.1030ms 72.8025μs 13.7358 KOps/s 13.2498 KOps/s $\color{#35bf28}+3.67\%$
test_to_nonblocking 0.1227ms 66.5948μs 15.0162 KOps/s 14.8960 KOps/s $\color{#35bf28}+0.81\%$
test_unbind_speed 0.3994ms 0.3249ms 3.0777 KOps/s 3.1519 KOps/s $\color{#d91a1a}-2.36\%$
test_unbind_speed_stack0 0.4257ms 0.3212ms 3.1129 KOps/s 3.1668 KOps/s $\color{#d91a1a}-1.70\%$
test_unbind_speed_stack1 98.5806ms 0.9295ms 1.0758 KOps/s 1.1601 KOps/s $\textbf{\color{#d91a1a}-7.26\%}$
test_split 1.3063ms 1.1612ms 861.1737 Ops/s 868.3685 Ops/s $\color{#d91a1a}-0.83\%$
test_chunk 98.7753ms 1.2295ms 813.3612 Ops/s 762.0693 Ops/s $\textbf{\color{#35bf28}+6.73\%}$
test_consolidate[False-None] 4.0812ms 3.8779ms 257.8696 Ops/s 256.2867 Ops/s $\color{#35bf28}+0.62\%$
test_consolidate[default-None] 2.4485ms 2.0590ms 485.6655 Ops/s 463.8541 Ops/s $\color{#35bf28}+4.70\%$
test_consolidate[reduce-overhead-None] 2.4115ms 1.9912ms 502.2162 Ops/s 478.6888 Ops/s $\color{#35bf28}+4.91\%$
test_consolidate_njt[False-None] 0.1806s 10.4828ms 95.3944 Ops/s 78.9810 Ops/s $\textbf{\color{#35bf28}+20.78\%}$
test_to[False-False-None] 2.5026ms 2.0876ms 479.0255 Ops/s 475.9594 Ops/s $\color{#35bf28}+0.64\%$
test_to[True-False-None] 2.2671ms 1.8844ms 530.6786 Ops/s 527.9682 Ops/s $\color{#35bf28}+0.51\%$
test_to[within-False-None] 6.2560ms 5.8186ms 171.8636 Ops/s 169.9608 Ops/s $\color{#35bf28}+1.12\%$
test_to[True-default-None] 12.3183ms 11.8350ms 84.4952 Ops/s 84.4150 Ops/s $\color{#35bf28}+0.10\%$
test_to_njt[False-False-None] 8.7621ms 8.6142ms 116.0868 Ops/s 116.2974 Ops/s $\color{#d91a1a}-0.18\%$
test_to_njt[True-False-None] 7.5586ms 7.2834ms 137.2988 Ops/s 133.7848 Ops/s $\color{#35bf28}+2.63\%$
test_to_njt[within-False-None] 17.1154ms 16.4057ms 60.9543 Ops/s 60.3625 Ops/s $\color{#35bf28}+0.98\%$
test_creation[device0] 0.4065ms 0.1105ms 9.0506 KOps/s 9.1585 KOps/s $\color{#d91a1a}-1.18\%$
test_creation_from_tensor 0.4358ms 0.1116ms 8.9573 KOps/s 9.0867 KOps/s $\color{#d91a1a}-1.42\%$
test_add_one[memmap_tensor0] 0.3738ms 7.0331μs 142.1846 KOps/s 144.7274 KOps/s $\color{#d91a1a}-1.76\%$
test_contiguous[memmap_tensor0] 13.0000μs 0.7230μs 1.3831 MOps/s 2.0061 MOps/s $\textbf{\color{#d91a1a}-31.05\%}$
test_stack[memmap_tensor0] 31.8610μs 4.7650μs 209.8615 KOps/s 206.8957 KOps/s $\color{#35bf28}+1.43\%$
test_memmaptd_index 1.0745ms 0.2800ms 3.5718 KOps/s 3.5846 KOps/s $\color{#d91a1a}-0.36\%$
test_memmaptd_index_astensor 0.5264ms 0.3727ms 2.6829 KOps/s 2.7008 KOps/s $\color{#d91a1a}-0.67\%$
test_memmaptd_index_op 0.9503ms 0.6301ms 1.5871 KOps/s 1.6153 KOps/s $\color{#d91a1a}-1.75\%$
test_serialize_model 0.3121s 0.1567s 6.3821 Ops/s 7.5836 Ops/s $\textbf{\color{#d91a1a}-15.84\%}$
test_serialize_model_pickle 1.3483s 1.1938s 0.8377 Ops/s 0.8371 Ops/s $\color{#35bf28}+0.07\%$
test_serialize_weights 0.1313s 0.1306s 7.6586 Ops/s 7.6446 Ops/s $\color{#35bf28}+0.18\%$
test_serialize_weights_returnearly 0.3927s 69.2548ms 14.4394 Ops/s 19.0313 Ops/s $\textbf{\color{#d91a1a}-24.13\%}$
test_serialize_weights_pickle 1.3715s 1.2149s 0.8231 Ops/s 0.8405 Ops/s $\color{#d91a1a}-2.07\%$
test_reshape_pytree 0.5634ms 33.1824μs 30.1365 KOps/s 30.2694 KOps/s $\color{#d91a1a}-0.44\%$
test_reshape_td 63.6110μs 38.3971μs 26.0436 KOps/s 25.8801 KOps/s $\color{#35bf28}+0.63\%$
test_view_pytree 0.2228ms 32.9783μs 30.3229 KOps/s 30.6175 KOps/s $\color{#d91a1a}-0.96\%$
test_view_td 78.9820μs 45.1900μs 22.1288 KOps/s 22.2430 KOps/s $\color{#d91a1a}-0.51\%$
test_unbind_pytree 0.2352ms 37.6880μs 26.5336 KOps/s 26.6549 KOps/s $\color{#d91a1a}-0.45\%$
test_unbind_td 74.5810μs 48.1859μs 20.7529 KOps/s 20.5290 KOps/s $\color{#35bf28}+1.09\%$
test_split_pytree 0.2190ms 44.3881μs 22.5286 KOps/s 22.2474 KOps/s $\color{#35bf28}+1.26\%$
test_split_td 0.1199ms 67.0256μs 14.9197 KOps/s 14.8739 KOps/s $\color{#35bf28}+0.31\%$
test_add_pytree 0.2341ms 44.8191μs 22.3119 KOps/s 21.9727 KOps/s $\color{#35bf28}+1.54\%$
test_add_td 99.2410μs 55.5944μs 17.9874 KOps/s 18.0448 KOps/s $\color{#d91a1a}-0.32\%$
test_compile_add_one_nested[tensordict-compile] 0.2598ms 0.1802ms 5.5507 KOps/s 5.7004 KOps/s $\color{#d91a1a}-2.63\%$
test_compile_add_one_nested[tensordict-eager] 0.2512ms 0.1939ms 5.1570 KOps/s 5.1456 KOps/s $\color{#35bf28}+0.22\%$
test_compile_add_one_nested[pytree-compile] 0.2279ms 0.1502ms 6.6578 KOps/s 6.4644 KOps/s $\color{#35bf28}+2.99\%$
test_compile_add_one_nested[pytree-eager] 0.4338ms 0.1913ms 5.2269 KOps/s 5.3478 KOps/s $\color{#d91a1a}-2.26\%$
test_compile_copy_nested[tensordict-compile] 76.8710μs 27.7115μs 36.0861 KOps/s 38.2312 KOps/s $\textbf{\color{#d91a1a}-5.61\%}$
test_compile_copy_nested[tensordict-eager] 88.7820μs 52.7787μs 18.9470 KOps/s 18.9096 KOps/s $\color{#35bf28}+0.20\%$
test_compile_copy_nested[pytree-compile] 0.2834ms 14.5666μs 68.6503 KOps/s 67.4615 KOps/s $\color{#35bf28}+1.76\%$
test_compile_copy_nested[pytree-eager] 0.3806ms 75.8358μs 13.1864 KOps/s 12.9643 KOps/s $\color{#35bf28}+1.71\%$
test_compile_add_one_flat[tensordict-compile] 0.3150ms 0.2069ms 4.8327 KOps/s 4.7603 KOps/s $\color{#35bf28}+1.52\%$
test_compile_add_one_flat[tensordict-eager] 0.4709ms 0.2628ms 3.8052 KOps/s 3.8239 KOps/s $\color{#d91a1a}-0.49\%$
test_compile_add_one_flat[tensorclass-compile] 0.2166ms 0.1533ms 6.5245 KOps/s 6.4027 KOps/s $\color{#35bf28}+1.90\%$
test_compile_add_one_flat[tensorclass-eager] 0.2399ms 71.8522μs 13.9175 KOps/s 14.1885 KOps/s $\color{#d91a1a}-1.91\%$
test_compile_add_one_flat[pytree-compile] 0.2720ms 0.2027ms 4.9331 KOps/s 4.8797 KOps/s $\color{#35bf28}+1.10\%$
test_compile_add_one_flat[pytree-eager] 0.7898ms 0.5343ms 1.8716 KOps/s 1.8539 KOps/s $\color{#35bf28}+0.96\%$
test_compile_add_self_flat[tensordict-eager] 0.4738ms 0.3131ms 3.1936 KOps/s 3.1870 KOps/s $\color{#35bf28}+0.21\%$
test_compile_add_self_flat[tensordict-compile] 0.2530ms 0.2082ms 4.8020 KOps/s 4.5783 KOps/s $\color{#35bf28}+4.89\%$
test_compile_add_self_flat[tensorclass-eager] 0.1397ms 87.2722μs 11.4584 KOps/s 11.5027 KOps/s $\color{#d91a1a}-0.39\%$
test_compile_add_self_flat[tensorclass-compile] 0.2305ms 0.1550ms 6.4534 KOps/s 6.2214 KOps/s $\color{#35bf28}+3.73\%$
test_compile_add_self_flat[pytree-eager] 0.6464ms 0.4443ms 2.2507 KOps/s 2.2174 KOps/s $\color{#35bf28}+1.50\%$
test_compile_add_self_flat[pytree-compile] 0.3401ms 0.2024ms 4.9402 KOps/s 4.7108 KOps/s $\color{#35bf28}+4.87\%$
test_compile_copy_flat[tensordict-compile] 0.5499ms 23.2616μs 42.9893 KOps/s 40.9485 KOps/s $\color{#35bf28}+4.98\%$
test_compile_copy_flat[tensordict-eager] 71.7710μs 41.4218μs 24.1419 KOps/s 23.5778 KOps/s $\color{#35bf28}+2.39\%$
test_compile_copy_flat[pytree-compile] 0.2619ms 19.8098μs 50.4800 KOps/s 49.1600 KOps/s $\color{#35bf28}+2.69\%$
test_compile_copy_flat[pytree-eager] 0.3584ms 69.8772μs 14.3108 KOps/s 14.2180 KOps/s $\color{#35bf28}+0.65\%$
test_compile_assign_and_add[tensordict-compile] 2.0752ms 0.2125ms 4.7054 KOps/s 4.5399 KOps/s $\color{#35bf28}+3.65\%$
test_compile_assign_and_add[tensordict-eager] 3.5100ms 3.3750ms 296.2966 Ops/s 300.0359 Ops/s $\color{#d91a1a}-1.25\%$
test_compile_assign_and_add[pytree-compile] 2.0526ms 0.2097ms 4.7694 KOps/s 4.7266 KOps/s $\color{#35bf28}+0.91\%$
test_compile_assign_and_add[pytree-eager] 3.0653ms 2.9509ms 338.8821 Ops/s 346.6495 Ops/s $\color{#d91a1a}-2.24\%$
test_compile_indexing[tensor-tensordict-compile] 0.2409ms 0.1421ms 7.0384 KOps/s 6.9025 KOps/s $\color{#35bf28}+1.97\%$
test_compile_indexing[tensor-tensordict-eager] 0.2991ms 66.4621μs 15.0462 KOps/s 14.9694 KOps/s $\color{#35bf28}+0.51\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1734ms 0.1350ms 7.4082 KOps/s 7.2800 KOps/s $\color{#35bf28}+1.76\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2510ms 47.4929μs 21.0558 KOps/s 21.7317 KOps/s $\color{#d91a1a}-3.11\%$
test_compile_indexing[tensor-pytree-compile] 0.2147ms 0.1356ms 7.3735 KOps/s 7.2336 KOps/s $\color{#35bf28}+1.93\%$
test_compile_indexing[tensor-pytree-eager] 0.2294ms 47.1326μs 21.2168 KOps/s 21.8448 KOps/s $\color{#d91a1a}-2.88\%$
test_compile_indexing[slice-tensordict-compile] 0.2667ms 86.9971μs 11.4946 KOps/s 11.0186 KOps/s $\color{#35bf28}+4.32\%$
test_compile_indexing[slice-tensordict-eager] 0.2072ms 27.9410μs 35.7897 KOps/s 36.0744 KOps/s $\color{#d91a1a}-0.79\%$
test_compile_indexing[slice-tensorclass-compile] 0.2086ms 81.5085μs 12.2687 KOps/s 12.0923 KOps/s $\color{#35bf28}+1.46\%$
test_compile_indexing[slice-tensorclass-eager] 0.2402ms 24.0466μs 41.5859 KOps/s 41.6743 KOps/s $\color{#d91a1a}-0.21\%$
test_compile_indexing[slice-pytree-compile] 0.1231ms 82.7518μs 12.0843 KOps/s 12.0156 KOps/s $\color{#35bf28}+0.57\%$
test_compile_indexing[slice-pytree-eager] 0.2756ms 24.0377μs 41.6013 KOps/s 42.0237 KOps/s $\color{#d91a1a}-1.01\%$
test_compile_indexing[int-tensordict-compile] 0.1247ms 88.0086μs 11.3625 KOps/s 11.0708 KOps/s $\color{#35bf28}+2.64\%$
test_compile_indexing[int-tensordict-eager] 0.2167ms 27.7739μs 36.0050 KOps/s 36.6887 KOps/s $\color{#d91a1a}-1.86\%$
test_compile_indexing[int-tensorclass-compile] 0.2440ms 82.1558μs 12.1720 KOps/s 11.9631 KOps/s $\color{#35bf28}+1.75\%$
test_compile_indexing[int-tensorclass-eager] 0.2338ms 24.0330μs 41.6094 KOps/s 41.6321 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_indexing[int-pytree-compile] 0.1435ms 81.7368μs 12.2344 KOps/s 11.9023 KOps/s $\color{#35bf28}+2.79\%$
test_compile_indexing[int-pytree-eager] 0.2374ms 24.0477μs 41.5840 KOps/s 41.9061 KOps/s $\color{#d91a1a}-0.77\%$
test_mod_add[eager] 0.1052ms 51.9120μs 19.2634 KOps/s 19.8091 KOps/s $\color{#d91a1a}-2.75\%$
test_mod_add[compile] 0.1988ms 0.1539ms 6.4960 KOps/s 6.4633 KOps/s $\color{#35bf28}+0.51\%$
test_mod_add[compile-overhead] 0.2924ms 0.1978ms 5.0548 KOps/s 4.9652 KOps/s $\color{#35bf28}+1.81\%$
test_mod_wrap[eager] 0.3898ms 0.3142ms 3.1825 KOps/s 3.2148 KOps/s $\color{#d91a1a}-1.00\%$
test_mod_wrap[compile] 0.5710ms 0.4108ms 2.4344 KOps/s 2.4558 KOps/s $\color{#d91a1a}-0.87\%$
test_mod_wrap[compile-overhead] 7.6543ms 4.0046ms 249.7102 Ops/s 251.5819 Ops/s $\color{#d91a1a}-0.74\%$
test_mod_wrap_and_backward[eager] 1.7251ms 1.5971ms 626.1276 Ops/s 628.6910 Ops/s $\color{#d91a1a}-0.41\%$
test_mod_wrap_and_backward[compile] 1.8660ms 1.6189ms 617.7189 Ops/s 617.3394 Ops/s $\color{#35bf28}+0.06\%$
test_mod_wrap_and_backward[compile-overhead] 1.4106ms 0.9899ms 1.0102 KOps/s 1.0011 KOps/s $\color{#35bf28}+0.91\%$
test_seq_add[eager] 0.2401ms 0.1679ms 5.9559 KOps/s 6.2163 KOps/s $\color{#d91a1a}-4.19\%$
test_seq_add[compile] 0.2481ms 0.1626ms 6.1488 KOps/s 6.1246 KOps/s $\color{#35bf28}+0.40\%$
test_seq_add[compile-overhead] 0.2488ms 0.2023ms 4.9440 KOps/s 4.8946 KOps/s $\color{#35bf28}+1.01\%$
test_seq_wrap[eager] 0.6299ms 0.5543ms 1.8040 KOps/s 1.8035 KOps/s $\color{#35bf28}+0.03\%$
test_seq_wrap[compile] 0.5475ms 0.4213ms 2.3737 KOps/s 2.3650 KOps/s $\color{#35bf28}+0.37\%$
test_seq_wrap[compile-overhead] 0.3773ms 0.3134ms 3.1907 KOps/s 3.1415 KOps/s $\color{#35bf28}+1.57\%$
test_func_call_runtime[False-eager] 0.9666ms 0.8960ms 1.1161 KOps/s 1.1161 KOps/s $+0.00\%$
test_func_call_runtime[False-compile] 1.0346ms 0.9559ms 1.0461 KOps/s 1.0448 KOps/s $\color{#35bf28}+0.12\%$
test_func_call_runtime[False-compile-overhead] 0.5528ms 0.4972ms 2.0114 KOps/s 1.9837 KOps/s $\color{#35bf28}+1.40\%$
test_func_call_runtime[True-eager] 1.2046ms 1.1351ms 880.9880 Ops/s 875.3654 Ops/s $\color{#35bf28}+0.64\%$
test_func_call_runtime[True-compile] 1.0968ms 0.9774ms 1.0231 KOps/s 1.0238 KOps/s $\color{#d91a1a}-0.06\%$
test_func_call_runtime[True-compile-overhead] 0.5866ms 0.5185ms 1.9285 KOps/s 1.8996 KOps/s $\color{#35bf28}+1.52\%$
test_func_call_cm_runtime[False-eager] 1.1055ms 0.9169ms 1.0906 KOps/s 1.1186 KOps/s $\color{#d91a1a}-2.50\%$
test_func_call_cm_runtime[False-compile] 1.0297ms 0.9587ms 1.0431 KOps/s 1.0442 KOps/s $\color{#d91a1a}-0.11\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5675ms 0.5001ms 1.9996 KOps/s 1.9774 KOps/s $\color{#35bf28}+1.12\%$
test_func_call_cm_runtime[True-eager] 1.3831ms 1.2806ms 780.8661 Ops/s 770.5774 Ops/s $\color{#35bf28}+1.34\%$
test_func_call_cm_runtime[True-compile] 1.0944ms 1.0151ms 985.1282 Ops/s 989.8623 Ops/s $\color{#d91a1a}-0.48\%$
test_func_call_cm_runtime[True-compile-overhead] 0.6286ms 0.5557ms 1.7995 KOps/s 1.7913 KOps/s $\color{#35bf28}+0.46\%$
test_vmap_func_call_cm_runtime[eager] 2.8911ms 2.3996ms 416.7385 Ops/s 417.0775 Ops/s $\color{#d91a1a}-0.08\%$
test_vmap_func_call_cm_runtime[compile] 1.1587ms 1.0276ms 973.0998 Ops/s 984.8800 Ops/s $\color{#d91a1a}-1.20\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.6016ms 0.5497ms 1.8191 KOps/s 1.7918 KOps/s $\color{#35bf28}+1.52\%$
test_distributed 0.4891ms 0.1532ms 6.5275 KOps/s 6.3815 KOps/s $\color{#35bf28}+2.29\%$
test_tdmodule 0.3975ms 28.1978μs 35.4638 KOps/s 35.0767 KOps/s $\color{#35bf28}+1.10\%$
test_tdmodule_dispatch 81.8510μs 48.6125μs 20.5708 KOps/s 20.4968 KOps/s $\color{#35bf28}+0.36\%$
test_tdseq 45.8310μs 26.8380μs 37.2606 KOps/s 37.4508 KOps/s $\color{#d91a1a}-0.51\%$
test_tdseq_dispatch 80.0510μs 51.0399μs 19.5925 KOps/s 19.6866 KOps/s $\color{#d91a1a}-0.48\%$
test_instantiation_functorch 2.2240ms 2.1007ms 476.0212 Ops/s 480.1550 Ops/s $\color{#d91a1a}-0.86\%$
test_exec_functorch 0.2376ms 0.1878ms 5.3259 KOps/s 5.3964 KOps/s $\color{#d91a1a}-1.31\%$
test_exec_functional_call 0.2306ms 0.1670ms 5.9891 KOps/s 5.9871 KOps/s $\color{#35bf28}+0.03\%$
test_exec_td_decorator 0.4678ms 0.2434ms 4.1079 KOps/s 4.0953 KOps/s $\color{#35bf28}+0.31\%$
test_vmap_mlp_speed_decorator[True-True] 0.9802ms 0.8184ms 1.2219 KOps/s 1.2236 KOps/s $\color{#d91a1a}-0.14\%$
test_vmap_mlp_speed_decorator[True-False] 0.9918ms 0.8177ms 1.2229 KOps/s 1.2272 KOps/s $\color{#d91a1a}-0.35\%$
test_vmap_mlp_speed_decorator[False-True] 0.9099ms 0.7051ms 1.4183 KOps/s 1.4356 KOps/s $\color{#d91a1a}-1.20\%$
test_vmap_mlp_speed_decorator[False-False] 0.8518ms 0.7058ms 1.4168 KOps/s 1.4271 KOps/s $\color{#d91a1a}-0.72\%$
test_vmap_transformer_speed_decorator[True-True] 21.0144ms 20.7961ms 48.0860 Ops/s 48.0018 Ops/s $\color{#35bf28}+0.18\%$
test_vmap_transformer_speed_decorator[True-False] 20.9408ms 20.7756ms 48.1333 Ops/s 47.9119 Ops/s $\color{#35bf28}+0.46\%$
test_vmap_transformer_speed_decorator[False-True] 20.9919ms 20.6036ms 48.5351 Ops/s 48.4364 Ops/s $\color{#35bf28}+0.20\%$
test_vmap_transformer_speed_decorator[False-False] 21.2002ms 20.5851ms 48.5787 Ops/s 48.3134 Ops/s $\color{#35bf28}+0.55\%$
test_to_module_speed[True] 2.0478ms 1.4756ms 677.6931 Ops/s 670.8481 Ops/s $\color{#35bf28}+1.02\%$
test_to_module_speed[False] 2.0546ms 1.4453ms 691.9155 Ops/s 682.0226 Ops/s $\color{#35bf28}+1.45\%$
test_tc_init 0.1978ms 52.8026μs 18.9385 KOps/s 19.1947 KOps/s $\color{#d91a1a}-1.34\%$
test_tc_init_tensor_only 37.0100μs 15.1507μs 66.0033 KOps/s 65.2244 KOps/s $\color{#35bf28}+1.19\%$
test_tc_init_nested 0.1392ms 0.1040ms 9.6189 KOps/s 9.7231 KOps/s $\color{#d91a1a}-1.07\%$
test_tc_first_layer_tensor 82.0010μs 1.7845μs 560.3842 KOps/s 551.4037 KOps/s $\color{#35bf28}+1.63\%$
test_tc_first_layer_tensor_only 16.5477μs 0.6684μs 1.4961 MOps/s 1.4645 MOps/s $\color{#35bf28}+2.16\%$
test_tc_first_layer_tensor_set 34.5600μs 4.1635μs 240.1816 KOps/s 237.8911 KOps/s $\color{#35bf28}+0.96\%$
test_tc_first_layer_tensor_only_set 28.5605μs 2.9944μs 333.9514 KOps/s 327.7957 KOps/s $\color{#35bf28}+1.88\%$
test_tc_first_layer_nontensor 23.4110μs 5.9807μs 167.2038 KOps/s 166.0246 KOps/s $\color{#35bf28}+0.71\%$
test_tc_second_layer_tensor 22.9710μs 4.3550μs 229.6234 KOps/s 231.2049 KOps/s $\color{#d91a1a}-0.68\%$
test_tc_second_layer_nontensor 39.4900μs 8.4938μs 117.7328 KOps/s 117.9863 KOps/s $\color{#d91a1a}-0.21\%$
test_unbind 0.2790s 13.4727ms 74.2243 Ops/s 52.9750 Ops/s $\textbf{\color{#35bf28}+40.11\%}$
test_full_like 5.5853ms 4.3924ms 227.6649 Ops/s 94.4908 Ops/s $\textbf{\color{#35bf28}+140.94\%}$
test_zeros_like 4.4979ms 4.3708ms 228.7891 Ops/s 94.7576 Ops/s $\textbf{\color{#35bf28}+141.45\%}$
test_ones_like 4.5696ms 4.3781ms 228.4103 Ops/s 94.5255 Ops/s $\textbf{\color{#35bf28}+141.64\%}$
test_clone 7.1167ms 6.4156ms 155.8698 Ops/s 82.2217 Ops/s $\textbf{\color{#35bf28}+89.57\%}$
test_squeeze 0.1905ms 14.7432μs 67.8279 KOps/s 68.0735 KOps/s $\color{#d91a1a}-0.36\%$
test_unsqueeze 0.1604ms 0.1094ms 9.1406 KOps/s 9.1087 KOps/s $\color{#35bf28}+0.35\%$
test_split 0.3604ms 0.1848ms 5.4100 KOps/s 5.3702 KOps/s $\color{#35bf28}+0.74\%$
test_permute 0.3755ms 0.2054ms 4.8682 KOps/s 4.6531 KOps/s $\color{#35bf28}+4.62\%$
test_stack 53.0902ms 51.3955ms 19.4569 Ops/s 19.4517 Ops/s $\color{#35bf28}+0.03\%$
test_cat 51.6144ms 51.2728ms 19.5035 Ops/s 19.4468 Ops/s $\color{#35bf28}+0.29\%$

@github-actions
Copy link

github-actions bot commented Oct 24, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 233. Improved: $\large\color{#35bf28}9$. Worsened: $\large\color{#d91a1a}10$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 30.5100μs 14.8427μs 67.3733 KOps/s 67.6462 KOps/s $\color{#d91a1a}-0.40\%$
test_plain_set_stack_nested 45.3410μs 14.9726μs 66.7888 KOps/s 66.4557 KOps/s $\color{#35bf28}+0.50\%$
test_plain_set_nested_inplace 46.2210μs 16.7065μs 59.8569 KOps/s 59.9485 KOps/s $\color{#d91a1a}-0.15\%$
test_plain_set_stack_nested_inplace 45.1310μs 16.4201μs 60.9009 KOps/s 60.5700 KOps/s $\color{#35bf28}+0.55\%$
test_items 32.3410μs 5.7887μs 172.7496 KOps/s 172.9723 KOps/s $\color{#d91a1a}-0.13\%$
test_items_nested 0.5819ms 0.5386ms 1.8565 KOps/s 1.8993 KOps/s $\color{#d91a1a}-2.25\%$
test_items_nested_locked 0.5920ms 0.5317ms 1.8807 KOps/s 1.8869 KOps/s $\color{#d91a1a}-0.33\%$
test_items_nested_leaf 0.1298ms 97.5064μs 10.2557 KOps/s 10.4034 KOps/s $\color{#d91a1a}-1.42\%$
test_items_stack_nested 0.5674ms 0.5267ms 1.8987 KOps/s 1.8603 KOps/s $\color{#35bf28}+2.06\%$
test_items_stack_nested_leaf 0.1411ms 96.9176μs 10.3180 KOps/s 10.4846 KOps/s $\color{#d91a1a}-1.59\%$
test_items_stack_nested_locked 0.6004ms 0.5366ms 1.8636 KOps/s 1.8573 KOps/s $\color{#35bf28}+0.34\%$
test_keys 26.8100μs 4.2294μs 236.4383 KOps/s 235.6501 KOps/s $\color{#35bf28}+0.33\%$
test_keys_nested 0.1613ms 0.1199ms 8.3391 KOps/s 8.3714 KOps/s $\color{#d91a1a}-0.39\%$
test_keys_nested_locked 2.1431ms 0.1302ms 7.6828 KOps/s 7.7848 KOps/s $\color{#d91a1a}-1.31\%$
test_keys_nested_leaf 0.1485ms 0.1098ms 9.1069 KOps/s 9.1906 KOps/s $\color{#d91a1a}-0.91\%$
test_keys_stack_nested 0.1652ms 0.1201ms 8.3246 KOps/s 8.3952 KOps/s $\color{#d91a1a}-0.84\%$
test_keys_stack_nested_leaf 0.1549ms 0.1099ms 9.0954 KOps/s 9.1196 KOps/s $\color{#d91a1a}-0.27\%$
test_keys_stack_nested_locked 0.1772ms 0.1295ms 7.7196 KOps/s 7.7804 KOps/s $\color{#d91a1a}-0.78\%$
test_values 6.8142μs 1.0275μs 973.2422 KOps/s 984.8391 KOps/s $\color{#d91a1a}-1.18\%$
test_values_nested 74.9310μs 48.0081μs 20.8298 KOps/s 20.8826 KOps/s $\color{#d91a1a}-0.25\%$
test_values_nested_locked 86.9610μs 51.1161μs 19.5633 KOps/s 19.4622 KOps/s $\color{#35bf28}+0.52\%$
test_values_nested_leaf 89.4820μs 55.0345μs 18.1704 KOps/s 18.4800 KOps/s $\color{#d91a1a}-1.68\%$
test_values_stack_nested 92.0010μs 48.0529μs 20.8104 KOps/s 20.9334 KOps/s $\color{#d91a1a}-0.59\%$
test_values_stack_nested_leaf 90.0620μs 54.7819μs 18.2542 KOps/s 18.4611 KOps/s $\color{#d91a1a}-1.12\%$
test_values_stack_nested_locked 85.6410μs 51.5881μs 19.3843 KOps/s 19.5735 KOps/s $\color{#d91a1a}-0.97\%$
test_membership 5.1617μs 0.8589μs 1.1643 MOps/s 1.1583 MOps/s $\color{#35bf28}+0.52\%$
test_membership_nested 56.7710μs 3.0631μs 326.4712 KOps/s 314.5349 KOps/s $\color{#35bf28}+3.79\%$
test_membership_nested_leaf 38.5000μs 3.1665μs 315.8100 KOps/s 315.3145 KOps/s $\color{#35bf28}+0.16\%$
test_membership_stacked_nested 26.3900μs 3.1690μs 315.5593 KOps/s 314.0615 KOps/s $\color{#35bf28}+0.48\%$
test_membership_stacked_nested_leaf 23.7500μs 3.2157μs 310.9789 KOps/s 316.1800 KOps/s $\color{#d91a1a}-1.64\%$
test_membership_nested_last 33.5500μs 4.6456μs 215.2568 KOps/s 214.7195 KOps/s $\color{#35bf28}+0.25\%$
test_membership_nested_leaf_last 34.7100μs 4.6549μs 214.8254 KOps/s 214.5842 KOps/s $\color{#35bf28}+0.11\%$
test_membership_stacked_nested_last 29.5000μs 4.6444μs 215.3112 KOps/s 216.0410 KOps/s $\color{#d91a1a}-0.34\%$
test_membership_stacked_nested_leaf_last 23.8510μs 4.6422μs 215.4135 KOps/s 216.2692 KOps/s $\color{#d91a1a}-0.40\%$
test_nested_getleaf 48.2710μs 21.3745μs 46.7847 KOps/s 46.4542 KOps/s $\color{#35bf28}+0.71\%$
test_nested_get 48.1510μs 20.0790μs 49.8034 KOps/s 49.1326 KOps/s $\color{#35bf28}+1.37\%$
test_stacked_getleaf 54.2410μs 21.2023μs 47.1647 KOps/s 46.1926 KOps/s $\color{#35bf28}+2.10\%$
test_stacked_get 50.2110μs 20.2693μs 49.3358 KOps/s 48.5178 KOps/s $\color{#35bf28}+1.69\%$
test_nested_getitemleaf 49.1010μs 21.8833μs 45.6969 KOps/s 44.7588 KOps/s $\color{#35bf28}+2.10\%$
test_nested_getitem 71.2120μs 20.7512μs 48.1900 KOps/s 47.0124 KOps/s $\color{#35bf28}+2.50\%$
test_stacked_getitemleaf 50.9010μs 21.8977μs 45.6668 KOps/s 45.0527 KOps/s $\color{#35bf28}+1.36\%$
test_stacked_getitem 49.0110μs 20.8911μs 47.8673 KOps/s 48.1641 KOps/s $\color{#d91a1a}-0.62\%$
test_lock_nested 0.5418ms 0.4712ms 2.1222 KOps/s 2.1207 KOps/s $\color{#35bf28}+0.07\%$
test_lock_stack_nested 0.5166ms 0.4754ms 2.1034 KOps/s 2.0714 KOps/s $\color{#35bf28}+1.55\%$
test_unlock_nested 0.4656ms 0.3815ms 2.6213 KOps/s 2.6055 KOps/s $\color{#35bf28}+0.61\%$
test_unlock_stack_nested 0.4225ms 0.3794ms 2.6358 KOps/s 2.5892 KOps/s $\color{#35bf28}+1.80\%$
test_flatten_speed 0.1596ms 0.1231ms 8.1205 KOps/s 8.2258 KOps/s $\color{#d91a1a}-1.28\%$
test_unflatten_speed 0.6624ms 0.5894ms 1.6967 KOps/s 1.6717 KOps/s $\color{#35bf28}+1.50\%$
test_common_ops 0.9185ms 0.7348ms 1.3609 KOps/s 1.3427 KOps/s $\color{#35bf28}+1.35\%$
test_creation 72.5610μs 2.7825μs 359.3938 KOps/s 363.8784 KOps/s $\color{#d91a1a}-1.23\%$
test_creation_empty 46.5800μs 9.0719μs 110.2300 KOps/s 109.7263 KOps/s $\color{#35bf28}+0.46\%$
test_creation_nested_1 35.7800μs 12.2455μs 81.6625 KOps/s 81.4618 KOps/s $\color{#35bf28}+0.25\%$
test_creation_nested_2 47.5300μs 16.1315μs 61.9906 KOps/s 61.6269 KOps/s $\color{#35bf28}+0.59\%$
test_clone 52.6310μs 13.5036μs 74.0544 KOps/s 73.3664 KOps/s $\color{#35bf28}+0.94\%$
test_getitem[int] 1.2198ms 14.1151μs 70.8461 KOps/s 70.4776 KOps/s $\color{#35bf28}+0.52\%$
test_getitem[slice_int] 0.1439ms 25.0565μs 39.9098 KOps/s 39.7459 KOps/s $\color{#35bf28}+0.41\%$
test_getitem[range] 0.1668ms 59.7935μs 16.7242 KOps/s 16.8139 KOps/s $\color{#d91a1a}-0.53\%$
test_getitem[tuple] 0.1448ms 24.2708μs 41.2018 KOps/s 41.5386 KOps/s $\color{#d91a1a}-0.81\%$
test_getitem[list] 0.1821ms 53.0042μs 18.8664 KOps/s 18.5714 KOps/s $\color{#35bf28}+1.59\%$
test_setitem_dim[int] 64.3210μs 24.1268μs 41.4477 KOps/s 40.9638 KOps/s $\color{#35bf28}+1.18\%$
test_setitem_dim[slice_int] 66.4410μs 44.4994μs 22.4722 KOps/s 22.5681 KOps/s $\color{#d91a1a}-0.43\%$
test_setitem_dim[range] 0.1413ms 87.3136μs 11.4530 KOps/s 11.6979 KOps/s $\color{#d91a1a}-2.09\%$
test_setitem_dim[tuple] 76.5020μs 41.3688μs 24.1728 KOps/s 24.2063 KOps/s $\color{#d91a1a}-0.14\%$
test_setitem 45.4310μs 18.1120μs 55.2120 KOps/s 54.0591 KOps/s $\color{#35bf28}+2.13\%$
test_set 52.2110μs 17.4090μs 57.4415 KOps/s 57.3336 KOps/s $\color{#35bf28}+0.19\%$
test_set_shared 0.5114ms 0.2001ms 4.9964 KOps/s 4.9851 KOps/s $\color{#35bf28}+0.23\%$
test_update 0.2024ms 22.2053μs 45.0343 KOps/s 45.1139 KOps/s $\color{#d91a1a}-0.18\%$
test_update_nested 71.5210μs 34.5308μs 28.9596 KOps/s 28.2082 KOps/s $\color{#35bf28}+2.66\%$
test_update__nested 0.4854ms 34.4835μs 28.9993 KOps/s 27.9644 KOps/s $\color{#35bf28}+3.70\%$
test_set_nested 55.5010μs 19.0098μs 52.6046 KOps/s 51.3197 KOps/s $\color{#35bf28}+2.50\%$
test_set_nested_new 57.0810μs 24.7630μs 40.3829 KOps/s 39.6595 KOps/s $\color{#35bf28}+1.82\%$
test_select 90.3820μs 41.5510μs 24.0668 KOps/s 23.4426 KOps/s $\color{#35bf28}+2.66\%$
test_select_nested 0.1540ms 73.8528μs 13.5405 KOps/s 13.4232 KOps/s $\color{#35bf28}+0.87\%$
test_exclude_nested 0.1343ms 98.7498μs 10.1266 KOps/s 10.0550 KOps/s $\color{#35bf28}+0.71\%$
test_empty[True] 0.4752ms 0.4360ms 2.2938 KOps/s 2.2832 KOps/s $\color{#35bf28}+0.46\%$
test_empty[False] 13.6877μs 1.3144μs 760.8054 KOps/s 748.3292 KOps/s $\color{#35bf28}+1.67\%$
test_to 0.1015ms 71.3288μs 14.0196 KOps/s 13.1379 KOps/s $\textbf{\color{#35bf28}+6.71\%}$
test_to_nonblocking 0.1136ms 65.5615μs 15.2529 KOps/s 15.1753 KOps/s $\color{#35bf28}+0.51\%$
test_unbind_speed 0.3570ms 0.3257ms 3.0701 KOps/s 3.0908 KOps/s $\color{#d91a1a}-0.67\%$
test_unbind_speed_stack0 0.3741ms 0.3227ms 3.0991 KOps/s 3.1329 KOps/s $\color{#d91a1a}-1.08\%$
test_unbind_speed_stack1 98.7036ms 0.9275ms 1.0782 KOps/s 1.1604 KOps/s $\textbf{\color{#d91a1a}-7.08\%}$
test_split 1.2146ms 1.1598ms 862.1847 Ops/s 653.4448 Ops/s $\textbf{\color{#35bf28}+31.94\%}$
test_chunk 97.2213ms 1.2168ms 821.8089 Ops/s 916.6004 Ops/s $\textbf{\color{#d91a1a}-10.34\%}$
test_consolidate[False-None] 3.9635ms 3.8974ms 256.5789 Ops/s 256.9968 Ops/s $\color{#d91a1a}-0.16\%$
test_consolidate[default-None] 2.2762ms 2.1041ms 475.2653 Ops/s 451.3463 Ops/s $\textbf{\color{#35bf28}+5.30\%}$
test_consolidate[reduce-overhead-None] 2.1261ms 2.0236ms 494.1728 Ops/s 465.1498 Ops/s $\textbf{\color{#35bf28}+6.24\%}$
test_consolidate_njt[False-None] 0.1786s 10.4697ms 95.5136 Ops/s 110.0324 Ops/s $\textbf{\color{#d91a1a}-13.20\%}$
test_to[False-False-None] 2.1965ms 2.0994ms 476.3220 Ops/s 474.7909 Ops/s $\color{#35bf28}+0.32\%$
test_to[True-False-None] 2.1763ms 1.9179ms 521.4108 Ops/s 528.4150 Ops/s $\color{#d91a1a}-1.33\%$
test_to[within-False-None] 6.1518ms 5.8777ms 170.1345 Ops/s 169.8734 Ops/s $\color{#35bf28}+0.15\%$
test_to[True-default-None] 11.7790ms 11.6313ms 85.9751 Ops/s 83.8217 Ops/s $\color{#35bf28}+2.57\%$
test_to_njt[False-False-None] 8.6667ms 8.5972ms 116.3175 Ops/s 114.3387 Ops/s $\color{#35bf28}+1.73\%$
test_to_njt[True-False-None] 7.6915ms 7.4893ms 133.5234 Ops/s 128.8256 Ops/s $\color{#35bf28}+3.65\%$
test_to_njt[within-False-None] 16.2559ms 16.1056ms 62.0904 Ops/s 60.4871 Ops/s $\color{#35bf28}+2.65\%$
test_creation[device0] 0.4055ms 0.1128ms 8.8627 KOps/s 9.2310 KOps/s $\color{#d91a1a}-3.99\%$
test_creation_from_tensor 0.4587ms 0.1112ms 8.9906 KOps/s 8.8725 KOps/s $\color{#35bf28}+1.33\%$
test_add_one[memmap_tensor0] 0.3159ms 6.6167μs 151.1329 KOps/s 148.0020 KOps/s $\color{#35bf28}+2.12\%$
test_contiguous[memmap_tensor0] 25.5800μs 0.7133μs 1.4020 MOps/s 2.0238 MOps/s $\textbf{\color{#d91a1a}-30.72\%}$
test_stack[memmap_tensor0] 30.6000μs 4.6852μs 213.4368 KOps/s 217.0035 KOps/s $\color{#d91a1a}-1.64\%$
test_memmaptd_index 1.0989ms 0.2819ms 3.5467 KOps/s 3.5137 KOps/s $\color{#35bf28}+0.94\%$
test_memmaptd_index_astensor 0.5343ms 0.3758ms 2.6611 KOps/s 2.6380 KOps/s $\color{#35bf28}+0.88\%$
test_memmaptd_index_op 0.8598ms 0.6171ms 1.6205 KOps/s 1.5939 KOps/s $\color{#35bf28}+1.67\%$
test_serialize_model 0.3095s 0.1574s 6.3548 Ops/s 7.5744 Ops/s $\textbf{\color{#d91a1a}-16.10\%}$
test_serialize_model_pickle 1.3753s 1.2158s 0.8225 Ops/s 0.8253 Ops/s $\color{#d91a1a}-0.34\%$
test_serialize_weights 0.1318s 0.1309s 7.6404 Ops/s 7.6317 Ops/s $\color{#35bf28}+0.11\%$
test_serialize_weights_returnearly 0.3991s 70.6018ms 14.1639 Ops/s 12.6530 Ops/s $\textbf{\color{#35bf28}+11.94\%}$
test_serialize_weights_pickle 1.3683s 1.2155s 0.8227 Ops/s 0.8229 Ops/s $\color{#d91a1a}-0.02\%$
test_reshape_pytree 0.3530ms 33.2837μs 30.0447 KOps/s 30.2841 KOps/s $\color{#d91a1a}-0.79\%$
test_reshape_td 74.2110μs 38.9652μs 25.6639 KOps/s 26.1966 KOps/s $\color{#d91a1a}-2.03\%$
test_view_pytree 0.2145ms 33.2340μs 30.0897 KOps/s 30.4654 KOps/s $\color{#d91a1a}-1.23\%$
test_view_td 76.6820μs 46.3784μs 21.5618 KOps/s 22.2517 KOps/s $\color{#d91a1a}-3.10\%$
test_unbind_pytree 0.2290ms 37.3366μs 26.7834 KOps/s 26.4810 KOps/s $\color{#35bf28}+1.14\%$
test_unbind_td 87.8310μs 49.1800μs 20.3334 KOps/s 20.5525 KOps/s $\color{#d91a1a}-1.07\%$
test_split_pytree 0.2530ms 44.3198μs 22.5633 KOps/s 22.7534 KOps/s $\color{#d91a1a}-0.84\%$
test_split_td 0.1278ms 66.0638μs 15.1369 KOps/s 15.4694 KOps/s $\color{#d91a1a}-2.15\%$
test_add_pytree 0.1749ms 45.0238μs 22.2105 KOps/s 22.4121 KOps/s $\color{#d91a1a}-0.90\%$
test_add_td 0.1042ms 54.3009μs 18.4159 KOps/s 18.4342 KOps/s $\color{#d91a1a}-0.10\%$
test_compile_add_one_nested[tensordict-compile] 0.3395ms 0.1805ms 5.5399 KOps/s 5.5891 KOps/s $\color{#d91a1a}-0.88\%$
test_compile_add_one_nested[tensordict-eager] 0.3225ms 0.1959ms 5.1048 KOps/s 5.1786 KOps/s $\color{#d91a1a}-1.43\%$
test_compile_add_one_nested[pytree-compile] 0.2087ms 0.1551ms 6.4495 KOps/s 6.3920 KOps/s $\color{#35bf28}+0.90\%$
test_compile_add_one_nested[pytree-eager] 0.4354ms 0.1958ms 5.1084 KOps/s 5.4382 KOps/s $\textbf{\color{#d91a1a}-6.06\%}$
test_compile_copy_nested[tensordict-compile] 63.8610μs 27.9040μs 35.8371 KOps/s 37.0910 KOps/s $\color{#d91a1a}-3.38\%$
test_compile_copy_nested[tensordict-eager] 80.3110μs 53.1116μs 18.8283 KOps/s 19.0037 KOps/s $\color{#d91a1a}-0.92\%$
test_compile_copy_nested[pytree-compile] 0.1128ms 14.4374μs 69.2645 KOps/s 71.1605 KOps/s $\color{#d91a1a}-2.66\%$
test_compile_copy_nested[pytree-eager] 0.3766ms 75.0621μs 13.3223 KOps/s 13.2067 KOps/s $\color{#35bf28}+0.88\%$
test_compile_add_one_flat[tensordict-compile] 0.2905ms 0.2152ms 4.6475 KOps/s 4.7024 KOps/s $\color{#d91a1a}-1.17\%$
test_compile_add_one_flat[tensordict-eager] 0.3127ms 0.2613ms 3.8264 KOps/s 3.8028 KOps/s $\color{#35bf28}+0.62\%$
test_compile_add_one_flat[tensorclass-compile] 0.2317ms 0.1544ms 6.4766 KOps/s 6.3919 KOps/s $\color{#35bf28}+1.32\%$
test_compile_add_one_flat[tensorclass-eager] 1.0724ms 74.8379μs 13.3622 KOps/s 13.9625 KOps/s $\color{#d91a1a}-4.30\%$
test_compile_add_one_flat[pytree-compile] 0.2482ms 0.2102ms 4.7581 KOps/s 4.6762 KOps/s $\color{#35bf28}+1.75\%$
test_compile_add_one_flat[pytree-eager] 0.9179ms 0.5396ms 1.8533 KOps/s 1.8555 KOps/s $\color{#d91a1a}-0.12\%$
test_compile_add_self_flat[tensordict-eager] 0.3864ms 0.3134ms 3.1907 KOps/s 3.1157 KOps/s $\color{#35bf28}+2.40\%$
test_compile_add_self_flat[tensordict-compile] 0.3056ms 0.2133ms 4.6880 KOps/s 4.5406 KOps/s $\color{#35bf28}+3.25\%$
test_compile_add_self_flat[tensorclass-eager] 0.1295ms 90.2572μs 11.0794 KOps/s 11.1289 KOps/s $\color{#d91a1a}-0.44\%$
test_compile_add_self_flat[tensorclass-compile] 0.2408ms 0.1621ms 6.1699 KOps/s 6.1652 KOps/s $\color{#35bf28}+0.08\%$
test_compile_add_self_flat[pytree-eager] 0.6676ms 0.4507ms 2.2187 KOps/s 2.2205 KOps/s $\color{#d91a1a}-0.08\%$
test_compile_add_self_flat[pytree-compile] 0.2712ms 0.2116ms 4.7257 KOps/s 4.6406 KOps/s $\color{#35bf28}+1.83\%$
test_compile_copy_flat[tensordict-compile] 0.5349ms 23.4970μs 42.5586 KOps/s 38.7307 KOps/s $\textbf{\color{#35bf28}+9.88\%}$
test_compile_copy_flat[tensordict-eager] 79.6310μs 40.9355μs 24.4287 KOps/s 23.9212 KOps/s $\color{#35bf28}+2.12\%$
test_compile_copy_flat[pytree-compile] 0.1460ms 19.8898μs 50.2771 KOps/s 46.7952 KOps/s $\textbf{\color{#35bf28}+7.44\%}$
test_compile_copy_flat[pytree-eager] 0.3652ms 69.3729μs 14.4149 KOps/s 14.3791 KOps/s $\color{#35bf28}+0.25\%$
test_compile_assign_and_add[tensordict-compile] 2.0621ms 0.2130ms 4.6948 KOps/s 4.4980 KOps/s $\color{#35bf28}+4.38\%$
test_compile_assign_and_add[tensordict-eager] 3.4495ms 3.3030ms 302.7592 Ops/s 305.6707 Ops/s $\color{#d91a1a}-0.95\%$
test_compile_assign_and_add[pytree-compile] 2.0511ms 0.2097ms 4.7691 KOps/s 4.7143 KOps/s $\color{#35bf28}+1.16\%$
test_compile_assign_and_add[pytree-eager] 3.0029ms 2.8784ms 347.4170 Ops/s 343.4369 Ops/s $\color{#35bf28}+1.16\%$
test_compile_indexing[tensor-tensordict-compile] 0.2140ms 0.1444ms 6.9274 KOps/s 6.8178 KOps/s $\color{#35bf28}+1.61\%$
test_compile_indexing[tensor-tensordict-eager] 0.2966ms 66.0391μs 15.1425 KOps/s 14.8699 KOps/s $\color{#35bf28}+1.83\%$
test_compile_indexing[tensor-tensorclass-compile] 0.2209ms 0.1371ms 7.2934 KOps/s 7.2514 KOps/s $\color{#35bf28}+0.58\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2503ms 46.9413μs 21.3032 KOps/s 21.3738 KOps/s $\color{#d91a1a}-0.33\%$
test_compile_indexing[tensor-pytree-compile] 0.1949ms 0.1389ms 7.1969 KOps/s 7.2388 KOps/s $\color{#d91a1a}-0.58\%$
test_compile_indexing[tensor-pytree-eager] 0.2711ms 46.9057μs 21.3194 KOps/s 21.3746 KOps/s $\color{#d91a1a}-0.26\%$
test_compile_indexing[slice-tensordict-compile] 0.2973ms 86.6936μs 11.5349 KOps/s 11.3026 KOps/s $\color{#35bf28}+2.06\%$
test_compile_indexing[slice-tensordict-eager] 0.2063ms 27.4915μs 36.3748 KOps/s 36.6633 KOps/s $\color{#d91a1a}-0.79\%$
test_compile_indexing[slice-tensorclass-compile] 0.1832ms 82.3967μs 12.1364 KOps/s 11.9874 KOps/s $\color{#35bf28}+1.24\%$
test_compile_indexing[slice-tensorclass-eager] 0.2347ms 24.1099μs 41.4768 KOps/s 41.6892 KOps/s $\color{#d91a1a}-0.51\%$
test_compile_indexing[slice-pytree-compile] 0.1405ms 82.6922μs 12.0930 KOps/s 11.9348 KOps/s $\color{#35bf28}+1.33\%$
test_compile_indexing[slice-pytree-eager] 0.2539ms 24.1936μs 41.3332 KOps/s 41.8610 KOps/s $\color{#d91a1a}-1.26\%$
test_compile_indexing[int-tensordict-compile] 0.1325ms 88.5837μs 11.2888 KOps/s 11.1444 KOps/s $\color{#35bf28}+1.30\%$
test_compile_indexing[int-tensordict-eager] 0.2488ms 26.9799μs 37.0646 KOps/s 37.2711 KOps/s $\color{#d91a1a}-0.55\%$
test_compile_indexing[int-tensorclass-compile] 0.1378ms 83.3444μs 11.9984 KOps/s 11.9643 KOps/s $\color{#35bf28}+0.29\%$
test_compile_indexing[int-tensorclass-eager] 0.2377ms 24.1386μs 41.4274 KOps/s 41.5074 KOps/s $\color{#d91a1a}-0.19\%$
test_compile_indexing[int-pytree-compile] 0.1436ms 82.8489μs 12.0702 KOps/s 11.9031 KOps/s $\color{#35bf28}+1.40\%$
test_compile_indexing[int-pytree-eager] 0.2543ms 23.9170μs 41.8112 KOps/s 41.4471 KOps/s $\color{#35bf28}+0.88\%$
test_mod_add[eager] 91.3920μs 51.2318μs 19.5191 KOps/s 19.2537 KOps/s $\color{#35bf28}+1.38\%$
test_mod_add[compile] 0.2212ms 0.1541ms 6.4876 KOps/s 6.3458 KOps/s $\color{#35bf28}+2.23\%$
test_mod_add[compile-overhead] 0.2922ms 0.1984ms 5.0407 KOps/s 4.9056 KOps/s $\color{#35bf28}+2.75\%$
test_mod_wrap[eager] 0.4080ms 0.3070ms 3.2570 KOps/s 3.1693 KOps/s $\color{#35bf28}+2.77\%$
test_mod_wrap[compile] 0.5022ms 0.4021ms 2.4867 KOps/s 2.4843 KOps/s $\color{#35bf28}+0.10\%$
test_mod_wrap[compile-overhead] 6.9397ms 3.8701ms 258.3918 Ops/s 254.8519 Ops/s $\color{#35bf28}+1.39\%$
test_mod_wrap_and_backward[eager] 1.6966ms 1.5704ms 636.7642 Ops/s 640.9932 Ops/s $\color{#d91a1a}-0.66\%$
test_mod_wrap_and_backward[compile] 1.7246ms 1.6130ms 619.9536 Ops/s 618.8731 Ops/s $\color{#35bf28}+0.17\%$
test_mod_wrap_and_backward[compile-overhead] 1.3446ms 0.9872ms 1.0130 KOps/s 1.0136 KOps/s $\color{#d91a1a}-0.06\%$
test_seq_add[eager] 0.2884ms 0.1560ms 6.4116 KOps/s 5.9589 KOps/s $\textbf{\color{#35bf28}+7.60\%}$
test_seq_add[compile] 0.2451ms 0.1622ms 6.1640 KOps/s 5.9147 KOps/s $\color{#35bf28}+4.21\%$
test_seq_add[compile-overhead] 0.2570ms 0.2038ms 4.9072 KOps/s 4.8783 KOps/s $\color{#35bf28}+0.59\%$
test_seq_wrap[eager] 0.6402ms 0.5442ms 1.8375 KOps/s 1.8251 KOps/s $\color{#35bf28}+0.68\%$
test_seq_wrap[compile] 0.5259ms 0.4169ms 2.3984 KOps/s 2.3896 KOps/s $\color{#35bf28}+0.37\%$
test_seq_wrap[compile-overhead] 0.4073ms 0.3160ms 3.1647 KOps/s 3.1590 KOps/s $\color{#35bf28}+0.18\%$
test_func_call_runtime[False-eager] 0.9393ms 0.8788ms 1.1379 KOps/s 1.1245 KOps/s $\color{#35bf28}+1.19\%$
test_func_call_runtime[False-compile] 1.0277ms 0.9347ms 1.0699 KOps/s 1.0668 KOps/s $\color{#35bf28}+0.29\%$
test_func_call_runtime[False-compile-overhead] 0.5709ms 0.4967ms 2.0132 KOps/s 2.0002 KOps/s $\color{#35bf28}+0.65\%$
test_func_call_runtime[True-eager] 1.1995ms 1.1163ms 895.8184 Ops/s 883.1647 Ops/s $\color{#35bf28}+1.43\%$
test_func_call_runtime[True-compile] 1.0533ms 0.9601ms 1.0415 KOps/s 1.0300 KOps/s $\color{#35bf28}+1.12\%$
test_func_call_runtime[True-compile-overhead] 0.5852ms 0.5161ms 1.9374 KOps/s 1.9070 KOps/s $\color{#35bf28}+1.60\%$
test_func_call_cm_runtime[False-eager] 0.9341ms 0.8691ms 1.1506 KOps/s 1.0677 KOps/s $\textbf{\color{#35bf28}+7.76\%}$
test_func_call_cm_runtime[False-compile] 1.0174ms 0.9388ms 1.0652 KOps/s 1.0660 KOps/s $\color{#d91a1a}-0.07\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5786ms 0.4986ms 2.0056 KOps/s 2.0009 KOps/s $\color{#35bf28}+0.24\%$
test_func_call_cm_runtime[True-eager] 1.4021ms 1.2531ms 797.9971 Ops/s 786.1553 Ops/s $\color{#35bf28}+1.51\%$
test_func_call_cm_runtime[True-compile] 1.0822ms 0.9990ms 1.0010 KOps/s 1.0041 KOps/s $\color{#d91a1a}-0.32\%$
test_func_call_cm_runtime[True-compile-overhead] 0.6074ms 0.5532ms 1.8076 KOps/s 1.7982 KOps/s $\color{#35bf28}+0.52\%$
test_vmap_func_call_cm_runtime[eager] 2.8321ms 2.3694ms 422.0446 Ops/s 419.2800 Ops/s $\color{#35bf28}+0.66\%$
test_vmap_func_call_cm_runtime[compile] 1.0724ms 1.0070ms 993.0251 Ops/s 997.4713 Ops/s $\color{#d91a1a}-0.45\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.6182ms 0.5480ms 1.8250 KOps/s 1.8015 KOps/s $\color{#35bf28}+1.30\%$
test_distributed 0.4983ms 0.1522ms 6.5683 KOps/s 6.4642 KOps/s $\color{#35bf28}+1.61\%$
test_tdmodule 47.8000μs 26.9191μs 37.1483 KOps/s 35.5166 KOps/s $\color{#35bf28}+4.59\%$
test_tdmodule_dispatch 76.8010μs 46.9793μs 21.2860 KOps/s 20.7674 KOps/s $\color{#35bf28}+2.50\%$
test_tdseq 36.6900μs 26.0005μs 38.4609 KOps/s 37.5347 KOps/s $\color{#35bf28}+2.47\%$
test_tdseq_dispatch 0.1264ms 49.6282μs 20.1498 KOps/s 19.7914 KOps/s $\color{#35bf28}+1.81\%$
test_instantiation_functorch 2.1737ms 2.0649ms 484.2797 Ops/s 481.6044 Ops/s $\color{#35bf28}+0.56\%$
test_exec_functorch 0.2262ms 0.1847ms 5.4133 KOps/s 5.4197 KOps/s $\color{#d91a1a}-0.12\%$
test_exec_functional_call 0.2333ms 0.1635ms 6.1166 KOps/s 6.0339 KOps/s $\color{#35bf28}+1.37\%$
test_exec_td_decorator 0.4497ms 0.2404ms 4.1589 KOps/s 4.1172 KOps/s $\color{#35bf28}+1.01\%$
test_vmap_mlp_speed_decorator[True-True] 0.9841ms 0.8113ms 1.2327 KOps/s 1.2319 KOps/s $\color{#35bf28}+0.07\%$
test_vmap_mlp_speed_decorator[True-False] 0.9370ms 0.8100ms 1.2346 KOps/s 1.2360 KOps/s $\color{#d91a1a}-0.11\%$
test_vmap_mlp_speed_decorator[False-True] 0.8509ms 0.6975ms 1.4337 KOps/s 1.4100 KOps/s $\color{#35bf28}+1.68\%$
test_vmap_mlp_speed_decorator[False-False] 0.8637ms 0.6970ms 1.4347 KOps/s 1.3978 KOps/s $\color{#35bf28}+2.64\%$
test_vmap_transformer_speed_decorator[True-True] 21.7384ms 20.7148ms 48.2746 Ops/s 47.3580 Ops/s $\color{#35bf28}+1.94\%$
test_vmap_transformer_speed_decorator[True-False] 20.9443ms 20.7229ms 48.2557 Ops/s 48.2386 Ops/s $\color{#35bf28}+0.04\%$
test_vmap_transformer_speed_decorator[False-True] 20.6337ms 20.4818ms 48.8237 Ops/s 48.6907 Ops/s $\color{#35bf28}+0.27\%$
test_vmap_transformer_speed_decorator[False-False] 20.6679ms 20.5050ms 48.7686 Ops/s 48.6449 Ops/s $\color{#35bf28}+0.25\%$
test_to_module_speed[True] 1.5746ms 1.4704ms 680.0927 Ops/s 677.6017 Ops/s $\color{#35bf28}+0.37\%$
test_to_module_speed[False] 1.5557ms 1.4454ms 691.8609 Ops/s 690.2575 Ops/s $\color{#35bf28}+0.23\%$
test_tc_init 0.1001ms 51.7300μs 19.3311 KOps/s 19.4626 KOps/s $\color{#d91a1a}-0.68\%$
test_tc_init_tensor_only 78.7220μs 14.7183μs 67.9426 KOps/s 66.5380 KOps/s $\color{#35bf28}+2.11\%$
test_tc_init_nested 0.1518ms 0.1019ms 9.8122 KOps/s 9.6940 KOps/s $\color{#35bf28}+1.22\%$
test_tc_first_layer_tensor 35.5510μs 1.7831μs 560.8207 KOps/s 550.3371 KOps/s $\color{#35bf28}+1.90\%$
test_tc_first_layer_tensor_only 3.3071μs 0.6927μs 1.4437 MOps/s 1.4701 MOps/s $\color{#d91a1a}-1.79\%$
test_tc_first_layer_tensor_set 29.7100μs 4.1875μs 238.8034 KOps/s 242.9403 KOps/s $\color{#d91a1a}-1.70\%$
test_tc_first_layer_tensor_only_set 17.7400μs 3.0220μs 330.9083 KOps/s 336.5747 KOps/s $\color{#d91a1a}-1.68\%$
test_tc_first_layer_nontensor 36.4510μs 5.9825μs 167.1549 KOps/s 169.7460 KOps/s $\color{#d91a1a}-1.53\%$
test_tc_second_layer_tensor 28.0700μs 4.3105μs 231.9905 KOps/s 231.0941 KOps/s $\color{#35bf28}+0.39\%$
test_tc_second_layer_nontensor 32.6510μs 8.4691μs 118.0768 KOps/s 119.1702 KOps/s $\color{#d91a1a}-0.92\%$
test_unbind 0.2575s 17.1948ms 58.1572 Ops/s 55.9122 Ops/s $\color{#35bf28}+4.02\%$
test_full_like 11.3845ms 10.6704ms 93.7168 Ops/s 246.6493 Ops/s $\textbf{\color{#d91a1a}-62.00\%}$
test_zeros_like 10.7684ms 10.6316ms 94.0592 Ops/s 227.0128 Ops/s $\textbf{\color{#d91a1a}-58.57\%}$
test_ones_like 10.9819ms 10.5822ms 94.4985 Ops/s 225.6618 Ops/s $\textbf{\color{#d91a1a}-58.12\%}$
test_clone 12.7400ms 12.2897ms 81.3690 Ops/s 149.9172 Ops/s $\textbf{\color{#d91a1a}-45.72\%}$
test_squeeze 0.1396ms 14.3146μs 69.8588 KOps/s 70.1116 KOps/s $\color{#d91a1a}-0.36\%$
test_unsqueeze 0.1572ms 0.1059ms 9.4403 KOps/s 9.4941 KOps/s $\color{#d91a1a}-0.57\%$
test_split 0.2316ms 0.1819ms 5.4979 KOps/s 5.4856 KOps/s $\color{#35bf28}+0.22\%$
test_permute 0.2562ms 0.2026ms 4.9362 KOps/s 4.8919 KOps/s $\color{#35bf28}+0.91\%$
test_stack 52.3303ms 52.0307ms 19.2194 Ops/s 19.1522 Ops/s $\color{#35bf28}+0.35\%$
test_cat 52.4489ms 51.7871ms 19.3098 Ops/s 19.1329 Ops/s $\color{#35bf28}+0.92\%$

@vmoens vmoens added the Formatting Code formatting label Oct 25, 2025
@vmoens vmoens merged commit f7b5e05 into main Oct 25, 2025
83 of 90 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Formatting Code formatting

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants