- 
                Notifications
    
You must be signed in to change notification settings  - Fork 103
 
[Feature] Add expln map to NormalParamExtractor #1204
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
          
     Merged
      
      
    
                
     Merged
            
            
          Conversation
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
    
          
 | 
    
| Name | Max | Mean | Ops | Ops on Repo HEAD
 | 
Change | 
|---|---|---|---|---|---|
| test_plain_set_nested | 52.0970μs | 20.4415μs | 48.9200 KOps/s | 48.0218 KOps/s | |
| test_plain_set_stack_nested | 58.1480μs | 20.6880μs | 48.3371 KOps/s | 47.8329 KOps/s | |
| test_plain_set_nested_inplace | 75.5710μs | 22.5180μs | 44.4089 KOps/s | 43.1632 KOps/s | |
| test_plain_set_stack_nested_inplace | 56.6850μs | 22.3730μs | 44.6968 KOps/s | 43.2862 KOps/s | |
| test_items | 27.2510μs | 4.1882μs | 238.7667 KOps/s | 237.0213 KOps/s | |
| test_items_nested | 0.7319ms | 0.4107ms | 2.4350 KOps/s | 2.4702 KOps/s | |
| test_items_nested_locked | 0.5021ms | 0.4082ms | 2.4497 KOps/s | 2.4488 KOps/s | |
| test_items_nested_leaf | 0.1318ms | 77.0039μs | 12.9864 KOps/s | 12.8166 KOps/s | |
| test_items_stack_nested | 0.7206ms | 0.4142ms | 2.4142 KOps/s | 2.4653 KOps/s | |
| test_items_stack_nested_leaf | 0.1420ms | 79.7586μs | 12.5378 KOps/s | 12.5823 KOps/s | |
| test_items_stack_nested_locked | 0.5522ms | 0.4105ms | 2.4361 KOps/s | 2.4276 KOps/s | |
| test_keys | 28.4230μs | 3.4591μs | 289.0959 KOps/s | 244.9198 KOps/s | |
| test_keys_nested | 0.2289ms | 0.1646ms | 6.0765 KOps/s | 6.0039 KOps/s | |
| test_keys_nested_locked | 1.8461ms | 0.1725ms | 5.7981 KOps/s | 5.8216 KOps/s | |
| test_keys_nested_leaf | 0.2779ms | 0.1451ms | 6.8913 KOps/s | 6.9528 KOps/s | |
| test_keys_stack_nested | 0.2531ms | 0.1642ms | 6.0919 KOps/s | 6.1246 KOps/s | |
| test_keys_stack_nested_leaf | 0.2666ms | 0.1417ms | 7.0554 KOps/s | 7.1466 KOps/s | |
| test_keys_stack_nested_locked | 0.2282ms | 0.1707ms | 5.8598 KOps/s | 5.9597 KOps/s | |
| test_values | 8.7684μs | 1.0291μs | 971.7333 KOps/s | 1.0105 MOps/s | |
| test_values_nested | 94.0150μs | 62.4023μs | 16.0250 KOps/s | 15.6637 KOps/s | |
| test_values_nested_locked | 0.1045ms | 62.3534μs | 16.0376 KOps/s | 15.5095 KOps/s | |
| test_values_nested_leaf | 0.1156ms | 71.5548μs | 13.9753 KOps/s | 13.5632 KOps/s | |
| test_values_stack_nested | 0.1174ms | 64.1085μs | 15.5986 KOps/s | 15.0256 KOps/s | |
| test_values_stack_nested_leaf | 0.1204ms | 70.9244μs | 14.0995 KOps/s | 14.2233 KOps/s | |
| test_values_stack_nested_locked | 0.1223ms | 63.9546μs | 15.6361 KOps/s | 15.4815 KOps/s | |
| test_membership | 14.3370μs | 0.8976μs | 1.1141 MOps/s | 1.3988 MOps/s | |
| test_membership_nested | 42.8600μs | 2.9161μs | 342.9205 KOps/s | 347.1683 KOps/s | |
| test_membership_nested_leaf | 32.3510μs | 2.9443μs | 339.6442 KOps/s | 346.6478 KOps/s | |
| test_membership_stacked_nested | 30.1670μs | 2.8815μs | 347.0384 KOps/s | 350.9656 KOps/s | |
| test_membership_stacked_nested_leaf | 29.8750μs | 2.9176μs | 342.7418 KOps/s | 343.7231 KOps/s | |
| test_membership_nested_last | 43.9120μs | 4.3961μs | 227.4734 KOps/s | 231.1087 KOps/s | |
| test_membership_nested_leaf_last | 28.1530μs | 4.3682μs | 228.9259 KOps/s | 227.2968 KOps/s | |
| test_membership_stacked_nested_last | 21.4500μs | 4.3575μs | 229.4911 KOps/s | 73.8947 KOps/s | |
| test_membership_stacked_nested_leaf_last | 29.4350μs | 4.3363μs | 230.6114 KOps/s | 74.2295 KOps/s | |
| test_nested_getleaf | 42.8990μs | 10.7752μs | 92.8061 KOps/s | 93.8122 KOps/s | |
| test_nested_get | 37.0390μs | 10.2586μs | 97.4794 KOps/s | 99.1441 KOps/s | |
| test_stacked_getleaf | 36.9390μs | 10.6407μs | 93.9784 KOps/s | 95.4771 KOps/s | |
| test_stacked_get | 52.1270μs | 10.1767μs | 98.2634 KOps/s | 99.1616 KOps/s | |
| test_nested_getitemleaf | 35.1460μs | 11.4585μs | 87.2711 KOps/s | 88.5132 KOps/s | |
| test_nested_getitem | 35.8870μs | 11.1134μs | 89.9815 KOps/s | 93.8409 KOps/s | |
| test_stacked_getitemleaf | 36.3680μs | 11.4113μs | 87.6326 KOps/s | 89.8498 KOps/s | |
| test_stacked_getitem | 51.7460μs | 10.9259μs | 91.5258 KOps/s | 92.9548 KOps/s | |
| test_lock_nested | 0.5452ms | 0.4079ms | 2.4515 KOps/s | 2.4559 KOps/s | |
| test_lock_stack_nested | 0.6649ms | 0.4172ms | 2.3968 KOps/s | 2.4262 KOps/s | |
| test_unlock_nested | 0.4439ms | 0.3357ms | 2.9790 KOps/s | 2.9874 KOps/s | |
| test_unlock_stack_nested | 0.4217ms | 0.3367ms | 2.9698 KOps/s | 3.0332 KOps/s | |
| test_flatten_speed | 0.1777ms | 98.7451μs | 10.1271 KOps/s | 9.9575 KOps/s | |
| test_unflatten_speed | 1.2829ms | 0.5347ms | 1.8701 KOps/s | 1.9015 KOps/s | |
| test_common_ops | 1.0405ms | 0.7997ms | 1.2505 KOps/s | 1.2434 KOps/s | |
| test_creation | 53.0990μs | 2.5093μs | 398.5174 KOps/s | 405.2768 KOps/s | |
| test_creation_empty | 33.5520μs | 11.5125μs | 86.8624 KOps/s | 80.2320 KOps/s | |
| test_creation_nested_1 | 42.2890μs | 14.6654μs | 68.1879 KOps/s | 65.7949 KOps/s | |
| test_creation_nested_2 | 56.8860μs | 18.9702μs | 52.7142 KOps/s | 49.8028 KOps/s | |
| test_clone | 91.7420μs | 13.5384μs | 73.8642 KOps/s | 74.1110 KOps/s | |
| test_getitem[int] | 0.8287ms | 12.6715μs | 78.9173 KOps/s | 79.0716 KOps/s | |
| test_getitem[slice_int] | 0.1373ms | 24.3975μs | 40.9879 KOps/s | 41.9792 KOps/s | |
| test_getitem[range] | 0.2196ms | 50.1497μs | 19.9403 KOps/s | 20.0456 KOps/s | |
| test_getitem[tuple] | 0.2914ms | 20.2340μs | 49.4218 KOps/s | 50.2621 KOps/s | |
| test_getitem[list] | 0.1866ms | 45.2656μs | 22.0918 KOps/s | 22.4857 KOps/s | |
| test_setitem_dim[int] | 63.4680μs | 25.8157μs | 38.7361 KOps/s | 39.5267 KOps/s | |
| test_setitem_dim[slice_int] | 0.1030ms | 51.4709μs | 19.4285 KOps/s | 20.1137 KOps/s | |
| test_setitem_dim[range] | 0.1346ms | 78.0192μs | 12.8174 KOps/s | 13.0890 KOps/s | |
| test_setitem_dim[tuple] | 81.2210μs | 40.2355μs | 24.8537 KOps/s | 25.3783 KOps/s | |
| test_setitem | 68.2070μs | 20.1013μs | 49.7479 KOps/s | 49.0752 KOps/s | |
| test_set | 97.0000μs | 19.6228μs | 50.9612 KOps/s | 49.8050 KOps/s | |
| test_set_shared | 3.4206ms | 0.1888ms | 5.2961 KOps/s | 5.4602 KOps/s | |
| test_update | 0.1590ms | 22.3209μs | 44.8010 KOps/s | 42.2234 KOps/s | |
| test_update_nested | 0.5126ms | 33.0830μs | 30.2270 KOps/s | 29.2405 KOps/s | |
| test_update__nested | 0.4134ms | 34.3463μs | 29.1152 KOps/s | 29.3531 KOps/s | |
| test_set_nested | 83.3650μs | 21.7141μs | 46.0530 KOps/s | 43.8238 KOps/s | |
| test_set_nested_new | 0.1028ms | 26.0400μs | 38.4024 KOps/s | 37.0305 KOps/s | |
| test_select | 0.1278ms | 42.8103μs | 23.3589 KOps/s | 23.2688 KOps/s | |
| test_select_nested | 0.1204ms | 63.8194μs | 15.6692 KOps/s | 15.8558 KOps/s | |
| test_exclude_nested | 0.2213ms | 81.6532μs | 12.2469 KOps/s | 12.2966 KOps/s | |
| test_empty[True] | 0.5841ms | 0.4129ms | 2.4221 KOps/s | 2.4384 KOps/s | |
| test_empty[False] | 9.6605μs | 1.3953μs | 716.6782 KOps/s | 725.5497 KOps/s | |
| test_unbind_speed | 0.3680ms | 0.2693ms | 3.7137 KOps/s | 3.7172 KOps/s | |
| test_unbind_speed_stack0 | 0.8305ms | 0.2680ms | 3.7316 KOps/s | 3.8223 KOps/s | |
| test_unbind_speed_stack1 | 0.1137s | 0.7254ms | 1.3785 KOps/s | 1.3498 KOps/s | |
| test_split | 0.1058s | 1.7359ms | 576.0551 Ops/s | 505.7913 Ops/s | |
| test_chunk | 0.1146s | 1.7622ms | 567.4599 Ops/s | 628.3052 Ops/s | |
| test_consolidate_njt[False-None] | 8.7315ms | 8.1857ms | 122.1643 Ops/s | 105.4272 Ops/s | |
| test_creation[device0] | 0.2342ms | 91.0046μs | 10.9885 KOps/s | 10.8180 KOps/s | |
| test_creation_from_tensor | 4.0505ms | 95.3838μs | 10.4840 KOps/s | 10.3249 KOps/s | |
| test_add_one[memmap_tensor0] | 0.1183ms | 4.9385μs | 202.4893 KOps/s | 205.8133 KOps/s | |
| test_contiguous[memmap_tensor0] | 11.4610μs | 0.5023μs | 1.9910 MOps/s | 1.9160 MOps/s | |
| test_stack[memmap_tensor0] | 18.1340μs | 3.4084μs | 293.3965 KOps/s | 281.4054 KOps/s | |
| test_memmaptd_index | 1.2718ms | 0.2250ms | 4.4446 KOps/s | 4.4607 KOps/s | |
| test_memmaptd_index_astensor | 0.5158ms | 0.3119ms | 3.2065 KOps/s | 3.2080 KOps/s | |
| test_memmaptd_index_op | 1.0605ms | 0.5732ms | 1.7447 KOps/s | 1.7190 KOps/s | |
| test_serialize_model | 0.1285s | 0.1169s | 8.5553 Ops/s | 8.5650 Ops/s | |
| test_serialize_model_pickle | 0.4439s | 0.3885s | 2.5737 Ops/s | 2.5634 Ops/s | |
| test_serialize_weights | 0.1226s | 0.1136s | 8.8059 Ops/s | 8.7177 Ops/s | |
| test_serialize_weights_returnearly | 0.1677s | 0.1588s | 6.2987 Ops/s | 5.5784 Ops/s | |
| test_serialize_weights_pickle | 0.9459s | 0.7071s | 1.4142 Ops/s | 2.3100 Ops/s | |
| test_serialize_weights_filesystem | 0.1534s | 0.1422s | 7.0325 Ops/s | 6.9548 Ops/s | |
| test_serialize_model_filesystem | 0.2585s | 0.1597s | 6.2624 Ops/s | 6.6389 Ops/s | |
| test_reshape_pytree | 0.1228ms | 26.5261μs | 37.6988 KOps/s | 37.1895 KOps/s | |
| test_reshape_td | 79.8890μs | 32.1105μs | 31.1424 KOps/s | 30.5628 KOps/s | |
| test_view_pytree | 83.2350μs | 26.4619μs | 37.7902 KOps/s | 37.7546 KOps/s | |
| test_view_td | 74.8800μs | 38.5195μs | 25.9609 KOps/s | 26.6512 KOps/s | |
| test_unbind_pytree | 97.0800μs | 29.1994μs | 34.2472 KOps/s | 32.9636 KOps/s | |
| test_unbind_td | 0.3239ms | 39.3529μs | 25.4111 KOps/s | 25.1280 KOps/s | |
| test_split_pytree | 78.5960μs | 29.3483μs | 34.0735 KOps/s | 34.0217 KOps/s | |
| test_split_td | 0.5563ms | 45.3651μs | 22.0434 KOps/s | 22.1471 KOps/s | |
| test_add_pytree | 76.0610μs | 35.4501μs | 28.2087 KOps/s | 28.3070 KOps/s | |
| test_add_td | 0.1318ms | 56.0117μs | 17.8534 KOps/s | 18.0491 KOps/s | |
| test_compile_add_one_nested[tensordict-compile] | 0.1405ms | 67.6806μs | 14.7753 KOps/s | 15.0313 KOps/s | |
| test_compile_add_one_nested[tensordict-eager] | 1.3046ms | 0.1720ms | 5.8152 KOps/s | 5.7433 KOps/s | |
| test_compile_add_one_nested[pytree-compile] | 0.1976ms | 48.1806μs | 20.7552 KOps/s | 21.7477 KOps/s | |
| test_compile_add_one_nested[pytree-eager] | 0.2235ms | 0.1178ms | 8.4906 KOps/s | 8.4570 KOps/s | |
| test_compile_copy_nested[tensordict-compile] | 0.1233ms | 28.5158μs | 35.0683 KOps/s | 35.7323 KOps/s | |
| test_compile_copy_nested[tensordict-eager] | 0.1182ms | 59.3647μs | 16.8450 KOps/s | 16.8854 KOps/s | |
| test_compile_copy_nested[pytree-compile] | 0.1602ms | 79.6633μs | 12.5528 KOps/s | 12.3485 KOps/s | |
| test_compile_copy_nested[pytree-eager] | 0.2709ms | 67.2702μs | 14.8654 KOps/s | 14.7855 KOps/s | |
| test_compile_add_one_flat[tensordict-compile] | 0.1862ms | 0.1063ms | 9.4114 KOps/s | 9.1669 KOps/s | |
| test_compile_add_one_flat[tensordict-eager] | 0.3968ms | 0.2146ms | 4.6607 KOps/s | 4.6478 KOps/s | |
| test_compile_add_one_flat[tensorclass-compile] | 0.1409ms | 47.9761μs | 20.8437 KOps/s | 20.7456 KOps/s | |
| test_compile_add_one_flat[tensorclass-eager] | 0.1636ms | 67.2870μs | 14.8617 KOps/s | 15.0583 KOps/s | |
| test_compile_add_one_flat[pytree-compile] | 0.2343ms | 0.1013ms | 9.8703 KOps/s | 9.5354 KOps/s | |
| test_compile_add_one_flat[pytree-eager] | 0.3514ms | 0.2047ms | 4.8842 KOps/s | 4.9359 KOps/s | |
| test_compile_add_self_flat[tensordict-eager] | 0.4968ms | 0.2325ms | 4.3010 KOps/s | 4.3039 KOps/s | |
| test_compile_add_self_flat[tensordict-compile] | 0.2646ms | 0.1126ms | 8.8836 KOps/s | 9.1512 KOps/s | |
| test_compile_add_self_flat[tensorclass-eager] | 0.2713ms | 63.3122μs | 15.7947 KOps/s | 15.7254 KOps/s | |
| test_compile_add_self_flat[tensorclass-compile] | 0.1498ms | 50.2487μs | 19.9010 KOps/s | 19.4344 KOps/s | |
| test_compile_add_self_flat[pytree-eager] | 0.2582ms | 0.1591ms | 6.2857 KOps/s | 6.2862 KOps/s | |
| test_compile_add_self_flat[pytree-compile] | 0.1759ms | 0.1028ms | 9.7245 KOps/s | 9.8335 KOps/s | |
| test_compile_copy_flat[tensordict-compile] | 80.4300μs | 21.9198μs | 45.6208 KOps/s | 45.3359 KOps/s | |
| test_compile_copy_flat[tensordict-eager] | 0.1586ms | 66.9330μs | 14.9403 KOps/s | 14.6197 KOps/s | |
| test_compile_copy_flat[pytree-compile] | 0.1682ms | 84.2771μs | 11.8656 KOps/s | 11.7235 KOps/s | |
| test_compile_copy_flat[pytree-eager] | 0.1469ms | 70.4169μs | 14.2011 KOps/s | 14.1523 KOps/s | |
| test_compile_assign_and_add[tensordict-compile] | 0.4369ms | 0.2171ms | 4.6058 KOps/s | 4.5995 KOps/s | |
| test_compile_assign_and_add[tensordict-eager] | 2.2558ms | 1.3735ms | 728.0494 Ops/s | 710.1649 Ops/s | |
| test_compile_assign_and_add[pytree-compile] | 0.4324ms | 0.2079ms | 4.8100 KOps/s | 4.7430 KOps/s | |
| test_compile_assign_and_add[pytree-eager] | 1.4425ms | 0.8316ms | 1.2026 KOps/s | 1.2016 KOps/s | |
| test_compile_assign_and_add_stack[compile] | 0.8567ms | 0.4598ms | 2.1749 KOps/s | 2.1733 KOps/s | |
| test_compile_assign_and_add_stack[eager] | 2.8492ms | 2.6254ms | 380.8935 Ops/s | 364.4853 Ops/s | |
| test_compile_indexing[tensor-tensordict-compile] | 0.1184ms | 40.8326μs | 24.4902 KOps/s | 25.6517 KOps/s | |
| test_compile_indexing[tensor-tensordict-eager] | 0.7257ms | 32.5015μs | 30.7678 KOps/s | 30.9190 KOps/s | |
| test_compile_indexing[tensor-tensorclass-compile] | 85.6600μs | 32.3621μs | 30.9003 KOps/s | 32.2002 KOps/s | |
| test_compile_indexing[tensor-tensorclass-eager] | 78.9070μs | 23.3807μs | 42.7703 KOps/s | 43.8037 KOps/s | |
| test_compile_indexing[tensor-pytree-compile] | 0.1016ms | 32.8233μs | 30.4661 KOps/s | 31.2910 KOps/s | |
| test_compile_indexing[tensor-pytree-eager] | 61.5750μs | 22.9216μs | 43.6270 KOps/s | 43.6843 KOps/s | |
| test_compile_indexing[slice-tensordict-compile] | 0.1976ms | 54.2678μs | 18.4271 KOps/s | 18.6921 KOps/s | |
| test_compile_indexing[slice-tensordict-eager] | 0.3798ms | 19.6972μs | 50.7688 KOps/s | 50.3265 KOps/s | |
| test_compile_indexing[slice-tensorclass-compile] | 0.1114ms | 46.3394μs | 21.5799 KOps/s | 21.7074 KOps/s | |
| test_compile_indexing[slice-tensorclass-eager] | 58.5890μs | 18.9330μs | 52.8179 KOps/s | 53.0075 KOps/s | |
| test_compile_indexing[slice-pytree-compile] | 0.1367ms | 47.5069μs | 21.0496 KOps/s | 21.0089 KOps/s | |
| test_compile_indexing[slice-pytree-eager] | 89.8990μs | 18.7265μs | 53.4003 KOps/s | 51.7819 KOps/s | |
| test_compile_indexing[int-tensordict-compile] | 0.1038ms | 56.0366μs | 17.8455 KOps/s | 18.2868 KOps/s | |
| test_compile_indexing[int-tensordict-eager] | 0.9091ms | 19.6837μs | 50.8035 KOps/s | 50.6153 KOps/s | |
| test_compile_indexing[int-tensorclass-compile] | 0.1254ms | 47.5322μs | 21.0384 KOps/s | 21.1208 KOps/s | |
| test_compile_indexing[int-tensorclass-eager] | 56.7450μs | 18.7089μs | 53.4506 KOps/s | 53.2714 KOps/s | |
| test_compile_indexing[int-pytree-compile] | 0.1063ms | 46.9733μs | 21.2887 KOps/s | 21.2230 KOps/s | |
| test_compile_indexing[int-pytree-eager] | 59.6410μs | 19.0520μs | 52.4878 KOps/s | 53.7296 KOps/s | |
| test_mod_add[eager] | 86.9020μs | 33.5756μs | 29.7835 KOps/s | 28.5175 KOps/s | |
| test_mod_add[compile] | 0.1545ms | 67.5742μs | 14.7986 KOps/s | 15.0928 KOps/s | |
| test_mod_add[compile-overhead] | 0.1483ms | 66.4127μs | 15.0574 KOps/s | 15.3763 KOps/s | |
| test_mod_wrap[eager] | 0.4591ms | 0.2211ms | 4.5224 KOps/s | 4.4680 KOps/s | |
| test_mod_wrap[compile] | 1.9948ms | 0.2268ms | 4.4091 KOps/s | 4.3245 KOps/s | |
| test_mod_wrap[compile-overhead] | 0.4270ms | 0.2267ms | 4.4114 KOps/s | 4.3704 KOps/s | |
| test_mod_wrap_and_backward[eager] | 20.0920ms | 13.5330ms | 73.8932 Ops/s | 89.1349 Ops/s | |
| test_mod_wrap_and_backward[compile] | 14.5626ms | 11.6513ms | 85.8275 Ops/s | 86.3512 Ops/s | |
| test_mod_wrap_and_backward[compile-overhead] | 21.8084ms | 11.6698ms | 85.6914 Ops/s | 86.9054 Ops/s | |
| test_seq_add[eager] | 0.1956ms | 0.1161ms | 8.6136 KOps/s | 8.4270 KOps/s | |
| test_seq_add[compile] | 0.1655ms | 77.5502μs | 12.8949 KOps/s | 12.9554 KOps/s | |
| test_seq_add[compile-overhead] | 0.1560ms | 74.8267μs | 13.3642 KOps/s | 13.2109 KOps/s | |
| test_seq_wrap[eager] | 0.7398ms | 0.4373ms | 2.2865 KOps/s | 2.2407 KOps/s | |
| test_seq_wrap[compile] | 0.4128ms | 0.2435ms | 4.1065 KOps/s | 4.0953 KOps/s | |
| test_seq_wrap[compile-overhead] | 0.3686ms | 0.2421ms | 4.1297 KOps/s | 4.1011 KOps/s | |
| test_func_call_runtime[False-eager] | 0.7503ms | 0.5265ms | 1.8994 KOps/s | 1.8371 KOps/s | |
| test_func_call_runtime[False-compile] | 0.9499ms | 0.4510ms | 2.2174 KOps/s | 2.2482 KOps/s | |
| test_func_call_runtime[False-compile-overhead] | 0.7019ms | 0.4463ms | 2.2405 KOps/s | 2.2491 KOps/s | |
| test_func_call_runtime[True-eager] | 1.4235ms | 0.7437ms | 1.3447 KOps/s | 1.3232 KOps/s | |
| test_func_call_runtime[True-compile] | 0.8427ms | 0.4683ms | 2.1353 KOps/s | 2.1612 KOps/s | |
| test_func_call_runtime[True-compile-overhead] | 0.6572ms | 0.4677ms | 2.1379 KOps/s | 2.1506 KOps/s | |
| test_func_call_cm_runtime[False-eager] | 0.6882ms | 0.5256ms | 1.9025 KOps/s | 1.8667 KOps/s | |
| test_func_call_cm_runtime[False-compile] | 0.5639ms | 0.4470ms | 2.2373 KOps/s | 2.2672 KOps/s | |
| test_func_call_cm_runtime[False-compile-overhead] | 0.5627ms | 0.4502ms | 2.2212 KOps/s | 2.2563 KOps/s | |
| test_func_call_cm_runtime[True-eager] | 1.6010ms | 0.9095ms | 1.0996 KOps/s | 1.1013 KOps/s | |
| test_func_call_cm_runtime[True-compile] | 1.0217ms | 0.7944ms | 1.2587 KOps/s | 1.2254 KOps/s | |
| test_func_call_cm_runtime[True-compile-overhead] | 1.3213ms | 0.8142ms | 1.2282 KOps/s | 1.2230 KOps/s | |
| test_vmap_func_call_cm_runtime[eager] | 3.1399ms | 1.9056ms | 524.7801 Ops/s | 518.7514 Ops/s | |
| test_vmap_func_call_cm_runtime[compile] | 0.6659ms | 0.5390ms | 1.8552 KOps/s | 1.8089 KOps/s | |
| test_vmap_func_call_cm_runtime[compile-overhead] | 1.0843ms | 0.5429ms | 1.8419 KOps/s | 1.8511 KOps/s | |
| test_distributed | 1.4882ms | 0.1290ms | 7.7511 KOps/s | 7.9098 KOps/s | |
| test_tdmodule | 0.1717ms | 26.8778μs | 37.2054 KOps/s | 38.1387 KOps/s | |
| test_tdmodule_dispatch | 85.0780μs | 48.9138μs | 20.4441 KOps/s | 20.5502 KOps/s | |
| test_tdseq | 53.5690μs | 28.9487μs | 34.5438 KOps/s | 34.0491 KOps/s | |
| test_tdseq_dispatch | 0.1097ms | 54.5744μs | 18.3236 KOps/s | 17.9562 KOps/s | |
| test_instantiation_functorch | 2.3626ms | 1.5518ms | 644.4061 Ops/s | 659.7041 Ops/s | |
| test_exec_functorch | 0.2740ms | 0.1825ms | 5.4800 KOps/s | 5.6111 KOps/s | |
| test_exec_functional_call | 0.2816ms | 0.1738ms | 5.7531 KOps/s | 5.8169 KOps/s | |
| test_exec_td_decorator | 0.5528ms | 0.2361ms | 4.2352 KOps/s | 4.2554 KOps/s | |
| test_vmap_mlp_speed_decorator[True-True] | 0.8616ms | 0.6520ms | 1.5338 KOps/s | 1.5040 KOps/s | |
| test_vmap_mlp_speed_decorator[True-False] | 0.9013ms | 0.6501ms | 1.5383 KOps/s | 1.5252 KOps/s | |
| test_vmap_mlp_speed_decorator[False-True] | 0.7680ms | 0.5247ms | 1.9060 KOps/s | 1.8762 KOps/s | |
| test_vmap_mlp_speed_decorator[False-False] | 0.7691ms | 0.5258ms | 1.9018 KOps/s | 1.8933 KOps/s | |
| test_to_module_speed[True] | 1.9948ms | 1.3341ms | 749.5609 Ops/s | 753.3954 Ops/s | |
| test_to_module_speed[False] | 2.1333ms | 1.3207ms | 757.1635 Ops/s | 768.6816 Ops/s | |
| test_tc_init | 0.1196ms | 48.5945μs | 20.5785 KOps/s | 21.2140 KOps/s | |
| test_tc_init_nested | 0.1700ms | 96.0543μs | 10.4108 KOps/s | 10.3494 KOps/s | |
| test_tc_first_layer_tensor | 21.6800μs | 1.6178μs | 618.1390 KOps/s | 645.6131 KOps/s | |
| test_tc_first_layer_nontensor | 24.3660μs | 4.9043μs | 203.9042 KOps/s | 216.6265 KOps/s | |
| test_tc_second_layer_tensor | 22.7220μs | 2.9387μs | 340.2827 KOps/s | 354.2967 KOps/s | |
| test_tc_second_layer_nontensor | 35.5060μs | 6.2679μs | 159.5421 KOps/s | 169.9330 KOps/s | |
| test_unbind | 0.2469s | 13.9410ms | 71.7310 Ops/s | 63.7388 Ops/s | |
| test_full_like | 10.5915ms | 9.7512ms | 102.5512 Ops/s | 122.6091 Ops/s | |
| test_zeros_like | 6.8490ms | 5.0063ms | 199.7487 Ops/s | 309.0673 Ops/s | |
| test_ones_like | 9.9888ms | 5.6070ms | 178.3493 Ops/s | 267.7041 Ops/s | |
| test_clone | 7.3431ms | 5.8211ms | 171.7878 Ops/s | 172.2809 Ops/s | |
| test_squeeze | 63.0170μs | 11.9940μs | 83.3749 KOps/s | 83.4761 KOps/s | |
| test_unsqueeze | 0.2909ms | 92.0072μs | 10.8687 KOps/s | 11.0203 KOps/s | |
| test_split | 0.3996ms | 0.1958ms | 5.1074 KOps/s | 5.0550 KOps/s | |
| test_permute | 0.3971ms | 0.1987ms | 5.0316 KOps/s | 5.0239 KOps/s | |
| test_stack | 35.0441ms | 26.5972ms | 37.5979 Ops/s | 36.8495 Ops/s | |
| test_cat | 35.2442ms | 26.3516ms | 37.9484 Ops/s | 37.3504 Ops/s | 
          
 | 
    
| Name | Max | Mean | Ops | Ops on Repo HEAD
 | 
Change | 
|---|---|---|---|---|---|
| test_plain_set_nested | 67.9940μs | 13.3231μs | 75.0573 KOps/s | 76.0414 KOps/s | |
| test_plain_set_stack_nested | 43.9720μs | 13.4276μs | 74.4733 KOps/s | 74.7924 KOps/s | |
| test_plain_set_nested_inplace | 45.3330μs | 14.2939μs | 69.9597 KOps/s | 70.7995 KOps/s | |
| test_plain_set_stack_nested_inplace | 45.0430μs | 14.2916μs | 69.9711 KOps/s | 69.5472 KOps/s | |
| test_items | 23.1710μs | 3.0622μs | 326.5587 KOps/s | 340.0385 KOps/s | |
| test_items_nested | 0.3991ms | 0.3609ms | 2.7706 KOps/s | 2.6899 KOps/s | |
| test_items_nested_locked | 0.4745ms | 0.3672ms | 2.7231 KOps/s | 2.6907 KOps/s | |
| test_items_nested_leaf | 0.1311ms | 57.7453μs | 17.3174 KOps/s | 16.8103 KOps/s | |
| test_items_stack_nested | 0.5695ms | 0.3647ms | 2.7421 KOps/s | 2.6940 KOps/s | |
| test_items_stack_nested_leaf | 0.2513ms | 60.4847μs | 16.5331 KOps/s | 16.5860 KOps/s | |
| test_items_stack_nested_locked | 0.4277ms | 0.3676ms | 2.7200 KOps/s | 2.6882 KOps/s | |
| test_keys | 25.2710μs | 3.4776μs | 287.5560 KOps/s | 287.6707 KOps/s | |
| test_keys_nested | 0.1464ms | 88.3899μs | 11.3135 KOps/s | 11.2814 KOps/s | |
| test_keys_nested_locked | 0.7476ms | 94.3644μs | 10.5972 KOps/s | 10.4764 KOps/s | |
| test_keys_nested_leaf | 0.1079ms | 78.3402μs | 12.7648 KOps/s | 12.4914 KOps/s | |
| test_keys_stack_nested | 0.1687ms | 88.7424μs | 11.2686 KOps/s | 11.0433 KOps/s | |
| test_keys_stack_nested_leaf | 0.1115ms | 80.7591μs | 12.3825 KOps/s | 12.2877 KOps/s | |
| test_keys_stack_nested_locked | 0.1361ms | 94.2837μs | 10.6063 KOps/s | 10.3466 KOps/s | |
| test_values | 4.4385μs | 0.8506μs | 1.1757 MOps/s | 1.1687 MOps/s | |
| test_values_nested | 64.0340μs | 37.8066μs | 26.4504 KOps/s | 26.2316 KOps/s | |
| test_values_nested_locked | 0.1505ms | 39.8187μs | 25.1139 KOps/s | 24.8516 KOps/s | |
| test_values_nested_leaf | 78.2850μs | 41.8463μs | 23.8970 KOps/s | 23.3033 KOps/s | |
| test_values_stack_nested | 70.4050μs | 38.4358μs | 26.0174 KOps/s | 25.5904 KOps/s | |
| test_values_stack_nested_leaf | 0.1202ms | 42.7433μs | 23.3955 KOps/s | 23.1286 KOps/s | |
| test_values_stack_nested_locked | 67.7740μs | 40.3583μs | 24.7781 KOps/s | 24.3446 KOps/s | |
| test_membership | 1.5776μs | 0.5129μs | 1.9498 MOps/s | 1.9707 MOps/s | |
| test_membership_nested | 0.1695ms | 2.0916μs | 478.0978 KOps/s | 481.7119 KOps/s | |
| test_membership_nested_leaf | 14.8560μs | 2.0282μs | 493.0368 KOps/s | 474.0895 KOps/s | |
| test_membership_stacked_nested | 27.2120μs | 2.0842μs | 479.7962 KOps/s | 467.3081 KOps/s | |
| test_membership_stacked_nested_leaf | 31.1220μs | 2.1207μs | 471.5323 KOps/s | 472.0077 KOps/s | |
| test_membership_nested_last | 25.1020μs | 3.1250μs | 319.9986 KOps/s | 314.1280 KOps/s | |
| test_membership_nested_leaf_last | 0.1773ms | 3.1484μs | 317.6199 KOps/s | 316.2164 KOps/s | |
| test_membership_stacked_nested_last | 0.2133ms | 8.3816μs | 119.3083 KOps/s | 315.3178 KOps/s | |
| test_membership_stacked_nested_leaf_last | 67.1340μs | 8.1259μs | 123.0636 KOps/s | 316.2777 KOps/s | |
| test_nested_getleaf | 31.3120μs | 6.1790μs | 161.8383 KOps/s | 158.3714 KOps/s | |
| test_nested_get | 33.2520μs | 5.8427μs | 171.1539 KOps/s | 168.7750 KOps/s | |
| test_stacked_getleaf | 32.9520μs | 6.1865μs | 161.6430 KOps/s | 161.6635 KOps/s | |
| test_stacked_get | 28.4320μs | 5.8511μs | 170.9084 KOps/s | 171.3038 KOps/s | |
| test_nested_getitemleaf | 37.1820μs | 6.5175μs | 153.4335 KOps/s | 152.5942 KOps/s | |
| test_nested_getitem | 35.9720μs | 6.1321μs | 163.0774 KOps/s | 161.7576 KOps/s | |
| test_stacked_getitemleaf | 38.6320μs | 6.4368μs | 155.3576 KOps/s | 157.4079 KOps/s | |
| test_stacked_getitem | 33.3320μs | 6.1831μs | 161.7310 KOps/s | 162.9081 KOps/s | |
| test_lock_nested | 9.3431ms | 0.3445ms | 2.9029 KOps/s | 2.8653 KOps/s | |
| test_lock_stack_nested | 0.3809ms | 0.3334ms | 2.9993 KOps/s | 2.8938 KOps/s | |
| test_unlock_nested | 0.3886ms | 0.2774ms | 3.6050 KOps/s | 3.5565 KOps/s | |
| test_unlock_stack_nested | 0.3150ms | 0.2723ms | 3.6731 KOps/s | 3.5554 KOps/s | |
| test_flatten_speed | 0.1113ms | 74.3569μs | 13.4487 KOps/s | 13.0827 KOps/s | |
| test_unflatten_speed | 0.3689ms | 0.3187ms | 3.1382 KOps/s | 3.0733 KOps/s | |
| test_common_ops | 0.9074ms | 0.6612ms | 1.5125 KOps/s | 1.5337 KOps/s | |
| test_creation | 78.5050μs | 1.7633μs | 567.1285 KOps/s | 560.9024 KOps/s | |
| test_creation_empty | 36.1820μs | 10.3720μs | 96.4134 KOps/s | 98.5997 KOps/s | |
| test_creation_nested_1 | 45.8030μs | 12.1478μs | 82.3191 KOps/s | 83.8740 KOps/s | |
| test_creation_nested_2 | 43.2030μs | 14.7836μs | 67.6424 KOps/s | 68.2697 KOps/s | |
| test_clone | 43.6220μs | 10.6989μs | 93.4677 KOps/s | 94.1895 KOps/s | |
| test_getitem[int] | 1.1515ms | 10.7644μs | 92.8986 KOps/s | 92.9319 KOps/s | |
| test_getitem[slice_int] | 0.1586ms | 20.8887μs | 47.8728 KOps/s | 48.5234 KOps/s | |
| test_getitem[range] | 0.1764ms | 38.5640μs | 25.9309 KOps/s | 26.7178 KOps/s | |
| test_getitem[tuple] | 0.1307ms | 18.0908μs | 55.2768 KOps/s | 54.6872 KOps/s | |
| test_getitem[list] | 0.1495ms | 33.8402μs | 29.5507 KOps/s | 27.9504 KOps/s | |
| test_setitem_dim[int] | 39.2920μs | 19.5387μs | 51.1805 KOps/s | 49.0119 KOps/s | |
| test_setitem_dim[slice_int] | 0.1407ms | 38.2986μs | 26.1106 KOps/s | 23.8054 KOps/s | |
| test_setitem_dim[range] | 75.9540μs | 53.0097μs | 18.8645 KOps/s | 16.9790 KOps/s | |
| test_setitem_dim[tuple] | 53.3130μs | 32.1116μs | 31.1414 KOps/s | 28.4926 KOps/s | |
| test_setitem | 65.0640μs | 16.4150μs | 60.9198 KOps/s | 62.8923 KOps/s | |
| test_set | 0.1435ms | 16.1040μs | 62.0963 KOps/s | 64.2172 KOps/s | |
| test_set_shared | 0.5078ms | 0.1615ms | 6.1928 KOps/s | 6.2072 KOps/s | |
| test_update | 0.2432ms | 20.1842μs | 49.5437 KOps/s | 51.5696 KOps/s | |
| test_update_nested | 0.1114ms | 25.7430μs | 38.8455 KOps/s | 40.1706 KOps/s | |
| test_update__nested | 0.4646ms | 26.2406μs | 38.1089 KOps/s | 38.7071 KOps/s | |
| test_set_nested | 0.1804ms | 17.2893μs | 57.8394 KOps/s | 58.9665 KOps/s | |
| test_set_nested_new | 0.1640ms | 19.9823μs | 50.0442 KOps/s | 51.5776 KOps/s | |
| test_select | 0.1139ms | 30.6069μs | 32.6723 KOps/s | 31.2512 KOps/s | |
| test_select_nested | 0.2250ms | 44.3123μs | 22.5671 KOps/s | 22.3403 KOps/s | |
| test_exclude_nested | 0.1824ms | 63.1212μs | 15.8425 KOps/s | 15.5960 KOps/s | |
| test_empty[True] | 0.4118ms | 0.2944ms | 3.3969 KOps/s | 3.3279 KOps/s | |
| test_empty[False] | 19.9101μs | 0.8288μs | 1.2066 MOps/s | 1.1909 MOps/s | |
| test_to | 87.2650μs | 56.8095μs | 17.6027 KOps/s | 18.0548 KOps/s | |
| test_to_nonblocking | 0.2319ms | 50.2770μs | 19.8898 KOps/s | 19.5792 KOps/s | |
| test_unbind_speed | 0.2744ms | 0.2399ms | 4.1676 KOps/s | 4.1333 KOps/s | |
| test_unbind_speed_stack0 | 0.2896ms | 0.2343ms | 4.2677 KOps/s | 4.1118 KOps/s | |
| test_unbind_speed_stack1 | 95.3980ms | 0.7299ms | 1.3700 KOps/s | 1.3585 KOps/s | |
| test_split | 96.0433ms | 1.5996ms | 625.1581 Ops/s | 626.7431 Ops/s | |
| test_chunk | 99.8153ms | 1.6149ms | 619.2259 Ops/s | 625.2888 Ops/s | |
| test_consolidate[False-None] | 2.8944ms | 2.7214ms | 367.4629 Ops/s | 369.6318 Ops/s | |
| test_consolidate[default-None] | 2.1599ms | 1.7756ms | 563.1922 Ops/s | 586.1469 Ops/s | |
| test_consolidate[reduce-overhead-None] | 2.2250ms | 1.8125ms | 551.7237 Ops/s | 573.1627 Ops/s | |
| test_consolidate_njt[False-None] | 7.2009ms | 6.7689ms | 147.7352 Ops/s | 109.3147 Ops/s | |
| test_to[False-False-None] | 2.1255ms | 1.7393ms | 574.9358 Ops/s | 570.6478 Ops/s | |
| test_to[True-False-None] | 1.8113ms | 1.4243ms | 702.1081 Ops/s | 731.0703 Ops/s | |
| test_to[within-False-None] | 4.7218ms | 4.2775ms | 233.7803 Ops/s | 239.2023 Ops/s | |
| test_to[True-default-None] | 5.6819ms | 5.3452ms | 187.0839 Ops/s | 184.1184 Ops/s | |
| test_to_njt[False-False-None] | 7.6224ms | 7.1516ms | 139.8290 Ops/s | 139.7210 Ops/s | |
| test_to_njt[True-False-None] | 6.1413ms | 5.7126ms | 175.0525 Ops/s | 174.0499 Ops/s | |
| test_to_njt[within-False-None] | 12.8776ms | 12.5819ms | 79.4793 Ops/s | 78.2099 Ops/s | |
| test_creation[device0] | 0.4840ms | 83.5052μs | 11.9753 KOps/s | 12.0928 KOps/s | |
| test_creation_from_tensor | 0.5213ms | 85.1911μs | 11.7383 KOps/s | 11.3746 KOps/s | |
| test_add_one[memmap_tensor0] | 0.4519ms | 6.8517μs | 145.9485 KOps/s | 145.9421 KOps/s | |
| test_contiguous[memmap_tensor0] | 1.7956μs | 0.4192μs | 2.3852 MOps/s | 2.3708 MOps/s | |
| test_stack[memmap_tensor0] | 36.7620μs | 4.3568μs | 229.5257 KOps/s | 229.8589 KOps/s | |
| test_memmaptd_index | 1.5311ms | 0.2400ms | 4.1658 KOps/s | 4.0895 KOps/s | |
| test_memmaptd_index_astensor | 0.4467ms | 0.3037ms | 3.2930 KOps/s | 3.2326 KOps/s | |
| test_memmaptd_index_op | 0.7636ms | 0.6133ms | 1.6305 KOps/s | 1.6193 KOps/s | |
| test_serialize_model | 0.1315s | 0.1305s | 7.6649 Ops/s | 7.6554 Ops/s | |
| test_serialize_model_pickle | 1.3510s | 1.2121s | 0.8250 Ops/s | 0.8214 Ops/s | |
| test_serialize_weights | 0.1309s | 0.1298s | 7.7014 Ops/s | 7.7198 Ops/s | |
| test_serialize_weights_returnearly | 0.5331s | 74.5449ms | 13.4147 Ops/s | 11.6212 Ops/s | |
| test_serialize_weights_pickle | 1.3690s | 1.2174s | 0.8214 Ops/s | 0.8225 Ops/s | |
| test_reshape_pytree | 0.1734ms | 22.3354μs | 44.7719 KOps/s | 44.9524 KOps/s | |
| test_reshape_td | 0.1561ms | 26.9638μs | 37.0867 KOps/s | 36.1036 KOps/s | |
| test_view_pytree | 0.1725ms | 22.3511μs | 44.7405 KOps/s | 45.7195 KOps/s | |
| test_view_td | 0.1379ms | 32.4946μs | 30.7743 KOps/s | 29.5627 KOps/s | |
| test_unbind_pytree | 71.0540μs | 27.7094μs | 36.0889 KOps/s | 35.9030 KOps/s | |
| test_unbind_td | 0.7671ms | 37.2014μs | 26.8807 KOps/s | 27.0623 KOps/s | |
| test_split_pytree | 0.1558ms | 30.1415μs | 33.1768 KOps/s | 32.7774 KOps/s | |
| test_split_td | 0.7029ms | 39.8959μs | 25.0652 KOps/s | 25.0908 KOps/s | |
| test_add_pytree | 0.1983ms | 35.2479μs | 28.3705 KOps/s | 28.4441 KOps/s | |
| test_add_td | 0.1937ms | 53.0233μs | 18.8596 KOps/s | 18.5217 KOps/s | |
| test_compile_add_one_nested[tensordict-compile] | 0.2800ms | 0.1241ms | 8.0606 KOps/s | 7.4928 KOps/s | |
| test_compile_add_one_nested[tensordict-eager] | 0.2767ms | 0.1332ms | 7.5054 KOps/s | 7.4658 KOps/s | |
| test_compile_add_one_nested[pytree-compile] | 0.2418ms | 96.1904μs | 10.3961 KOps/s | 10.2241 KOps/s | |
| test_compile_add_one_nested[pytree-eager] | 1.3553ms | 0.1497ms | 6.6806 KOps/s | 6.7193 KOps/s | |
| test_compile_copy_nested[tensordict-compile] | 0.1517ms | 23.8887μs | 41.8608 KOps/s | 41.6726 KOps/s | |
| test_compile_copy_nested[tensordict-eager] | 0.1359ms | 29.4574μs | 33.9473 KOps/s | 33.5574 KOps/s | |
| test_compile_copy_nested[pytree-compile] | 0.1437ms | 65.6878μs | 15.2235 KOps/s | 15.1114 KOps/s | |
| test_compile_copy_nested[pytree-eager] | 0.1582ms | 49.2020μs | 20.3244 KOps/s | 20.0759 KOps/s | |
| test_compile_add_one_flat[tensordict-compile] | 0.2884ms | 0.1430ms | 6.9908 KOps/s | 7.0614 KOps/s | |
| test_compile_add_one_flat[tensordict-eager] | 0.3660ms | 0.2203ms | 4.5393 KOps/s | 4.5417 KOps/s | |
| test_compile_add_one_flat[tensorclass-compile] | 0.2467ms | 98.4480μs | 10.1577 KOps/s | 10.1736 KOps/s | |
| test_compile_add_one_flat[tensorclass-eager] | 0.2057ms | 56.0175μs | 17.8516 KOps/s | 17.9026 KOps/s | |
| test_compile_add_one_flat[pytree-compile] | 0.2771ms | 0.1362ms | 7.3446 KOps/s | 7.2917 KOps/s | |
| test_compile_add_one_flat[pytree-eager] | 0.6547ms | 0.4914ms | 2.0349 KOps/s | 2.0936 KOps/s | |
| test_compile_add_self_flat[tensordict-eager] | 0.4128ms | 0.2642ms | 3.7851 KOps/s | 3.7639 KOps/s | |
| test_compile_add_self_flat[tensordict-compile] | 0.3045ms | 0.1452ms | 6.8894 KOps/s | 6.9946 KOps/s | |
| test_compile_add_self_flat[tensorclass-eager] | 0.2757ms | 69.0591μs | 14.4804 KOps/s | 14.5583 KOps/s | |
| test_compile_add_self_flat[tensorclass-compile] | 0.2828ms | 0.1027ms | 9.7409 KOps/s | 10.0496 KOps/s | |
| test_compile_add_self_flat[pytree-eager] | 0.5627ms | 0.4029ms | 2.4819 KOps/s | 2.4637 KOps/s | |
| test_compile_add_self_flat[pytree-compile] | 0.2590ms | 0.1359ms | 7.3566 KOps/s | 7.3699 KOps/s | |
| test_compile_copy_flat[tensordict-compile] | 0.1625ms | 19.1663μs | 52.1749 KOps/s | 54.0735 KOps/s | |
| test_compile_copy_flat[tensordict-eager] | 58.0440μs | 31.1345μs | 32.1187 KOps/s | 32.2866 KOps/s | |
| test_compile_copy_flat[pytree-compile] | 0.2219ms | 71.4514μs | 13.9955 KOps/s | 14.0235 KOps/s | |
| test_compile_copy_flat[pytree-eager] | 0.1227ms | 51.5313μs | 19.4057 KOps/s | 18.9400 KOps/s | |
| test_compile_assign_and_add[tensordict-compile] | 1.6763ms | 0.3989ms | 2.5067 KOps/s | 2.1660 KOps/s | |
| test_compile_assign_and_add[tensordict-eager] | 2.8626ms | 2.6490ms | 377.5003 Ops/s | 370.8943 Ops/s | |
| test_compile_assign_and_add[pytree-compile] | 1.6095ms | 0.4328ms | 2.3105 KOps/s | 2.2407 KOps/s | |
| test_compile_assign_and_add[pytree-eager] | 2.9074ms | 2.6457ms | 377.9676 Ops/s | 374.6843 Ops/s | |
| test_compile_indexing[tensor-tensordict-compile] | 0.2954ms | 0.1152ms | 8.6803 KOps/s | 8.6083 KOps/s | |
| test_compile_indexing[tensor-tensordict-eager] | 0.5604ms | 80.2777μs | 12.4568 KOps/s | 12.2158 KOps/s | |
| test_compile_indexing[tensor-tensorclass-compile] | 0.2876ms | 0.1078ms | 9.2749 KOps/s | 9.2145 KOps/s | |
| test_compile_indexing[tensor-tensorclass-eager] | 0.2499ms | 68.6118μs | 14.5747 KOps/s | 14.4026 KOps/s | |
| test_compile_indexing[tensor-pytree-compile] | 0.2826ms | 0.1088ms | 9.1921 KOps/s | 9.1715 KOps/s | |
| test_compile_indexing[tensor-pytree-eager] | 0.2713ms | 69.7197μs | 14.3431 KOps/s | 14.4571 KOps/s | |
| test_compile_indexing[slice-tensordict-compile] | 0.2667ms | 0.1057ms | 9.4603 KOps/s | 9.1928 KOps/s | |
| test_compile_indexing[slice-tensordict-eager] | 0.1557ms | 18.1550μs | 55.0812 KOps/s | 49.0639 KOps/s | |
| test_compile_indexing[slice-tensorclass-compile] | 0.2562ms | 97.0568μs | 10.3032 KOps/s | 10.1274 KOps/s | |
| test_compile_indexing[slice-tensorclass-eager] | 0.2092ms | 16.0701μs | 62.2272 KOps/s | 62.2344 KOps/s | |
| test_compile_indexing[slice-pytree-compile] | 0.2614ms | 0.1016ms | 9.8382 KOps/s | 9.7946 KOps/s | |
| test_compile_indexing[slice-pytree-eager] | 0.1496ms | 16.1708μs | 61.8400 KOps/s | 63.0452 KOps/s | |
| test_compile_indexing[int-tensordict-compile] | 0.2722ms | 0.1024ms | 9.7647 KOps/s | 9.6485 KOps/s | |
| test_compile_indexing[int-tensordict-eager] | 0.5440ms | 17.7248μs | 56.4181 KOps/s | 58.1643 KOps/s | |
| test_compile_indexing[int-tensorclass-compile] | 0.2955ms | 97.8316μs | 10.2216 KOps/s | 10.1577 KOps/s | |
| test_compile_indexing[int-tensorclass-eager] | 0.1470ms | 16.1652μs | 61.8613 KOps/s | 63.7161 KOps/s | |
| test_compile_indexing[int-pytree-compile] | 0.2828ms | 0.1008ms | 9.9224 KOps/s | 10.1742 KOps/s | |
| test_compile_indexing[int-pytree-eager] | 0.1756ms | 21.5483μs | 46.4073 KOps/s | 61.7000 KOps/s | |
| test_mod_add[eager] | 0.2193ms | 42.4751μs | 23.5432 KOps/s | 25.2986 KOps/s | |
| test_mod_add[compile] | 0.2251ms | 79.9203μs | 12.5125 KOps/s | 10.4529 KOps/s | |
| test_mod_add[compile-overhead] | 0.3288ms | 0.1681ms | 5.9475 KOps/s | 5.4686 KOps/s | |
| test_mod_wrap[eager] | 0.4008ms | 0.2554ms | 3.9161 KOps/s | 3.8466 KOps/s | |
| test_mod_wrap[compile] | 0.4483ms | 0.2881ms | 3.4707 KOps/s | 3.4366 KOps/s | |
| test_mod_wrap[compile-overhead] | 8.1842ms | 3.8498ms | 259.7540 Ops/s | 274.0820 Ops/s | |
| test_mod_wrap_and_backward[eager] | 1.6311ms | 1.4050ms | 711.7234 Ops/s | 664.8407 Ops/s | |
| test_mod_wrap_and_backward[compile] | 1.4647ms | 1.2869ms | 777.0772 Ops/s | 721.5036 Ops/s | |
| test_mod_wrap_and_backward[compile-overhead] | 1.4076ms | 0.9444ms | 1.0588 KOps/s | 956.5817 Ops/s | |
| test_seq_add[eager] | 0.2823ms | 0.1229ms | 8.1391 KOps/s | 8.3564 KOps/s | |
| test_seq_add[compile] | 0.2449ms | 89.9058μs | 11.1228 KOps/s | 11.1093 KOps/s | |
| test_seq_add[compile-overhead] | 0.2857ms | 0.1301ms | 7.6860 KOps/s | 7.6224 KOps/s | |
| test_seq_wrap[eager] | 0.6264ms | 0.4359ms | 2.2942 KOps/s | 2.2798 KOps/s | |
| test_seq_wrap[compile] | 0.4595ms | 0.3034ms | 3.2957 KOps/s | 3.2668 KOps/s | |
| test_seq_wrap[compile-overhead] | 0.4007ms | 0.2289ms | 4.3687 KOps/s | 4.3734 KOps/s | |
| test_func_call_runtime[False-eager] | 0.9099ms | 0.7470ms | 1.3387 KOps/s | 1.3320 KOps/s | |
| test_func_call_runtime[False-compile] | 0.9210ms | 0.7468ms | 1.3390 KOps/s | 1.3270 KOps/s | |
| test_func_call_runtime[False-compile-overhead] | 0.5265ms | 0.3691ms | 2.7095 KOps/s | 2.7117 KOps/s | |
| test_func_call_runtime[True-eager] | 1.0668ms | 0.9112ms | 1.0975 KOps/s | 1.0857 KOps/s | |
| test_func_call_runtime[True-compile] | 0.9236ms | 0.7662ms | 1.3051 KOps/s | 1.2962 KOps/s | |
| test_func_call_runtime[True-compile-overhead] | 0.5199ms | 0.3918ms | 2.5525 KOps/s | 2.5693 KOps/s | |
| test_func_call_cm_runtime[False-eager] | 0.9442ms | 0.7454ms | 1.3415 KOps/s | 1.3408 KOps/s | |
| test_func_call_cm_runtime[False-compile] | 0.9183ms | 0.7528ms | 1.3285 KOps/s | 1.3209 KOps/s | |
| test_func_call_cm_runtime[False-compile-overhead] | 0.5137ms | 0.3729ms | 2.6818 KOps/s | 2.6809 KOps/s | |
| test_func_call_cm_runtime[True-eager] | 1.1746ms | 1.0213ms | 979.1656 Ops/s | 973.5874 Ops/s | |
| test_func_call_cm_runtime[True-compile] | 1.2371ms | 0.9940ms | 1.0060 KOps/s | 989.2261 Ops/s | |
| test_func_call_cm_runtime[True-compile-overhead] | 1.2589ms | 1.0551ms | 947.7582 Ops/s | 947.2030 Ops/s | |
| test_vmap_func_call_cm_runtime[eager] | 2.8156ms | 2.1741ms | 459.9607 Ops/s | 462.1416 Ops/s | |
| test_vmap_func_call_cm_runtime[compile] | 1.0297ms | 0.8227ms | 1.2155 KOps/s | 1.2139 KOps/s | |
| test_vmap_func_call_cm_runtime[compile-overhead] | 0.5825ms | 0.4250ms | 2.3528 KOps/s | 2.3593 KOps/s | |
| test_distributed | 2.3227ms | 0.1780ms | 5.6179 KOps/s | 8.4751 KOps/s | |
| test_tdmodule | 59.1730μs | 21.8048μs | 45.8614 KOps/s | 47.6194 KOps/s | |
| test_tdmodule_dispatch | 0.1736ms | 39.9300μs | 25.0438 KOps/s | 26.2630 KOps/s | |
| test_tdseq | 0.1150ms | 22.6934μs | 44.0657 KOps/s | 44.9462 KOps/s | |
| test_tdseq_dispatch | 0.1866ms | 42.1553μs | 23.7218 KOps/s | 24.0994 KOps/s | |
| test_instantiation_functorch | 1.9977ms | 1.5985ms | 625.5748 Ops/s | 620.7042 Ops/s | |
| test_exec_functorch | 0.5540ms | 0.1473ms | 6.7876 KOps/s | 6.7744 KOps/s | |
| test_exec_functional_call | 0.2598ms | 0.1402ms | 7.1334 KOps/s | 7.0407 KOps/s | |
| test_exec_td_decorator | 0.7467ms | 0.2016ms | 4.9595 KOps/s | 5.1691 KOps/s | |
| test_vmap_mlp_speed_decorator[True-True] | 1.1540ms | 0.7130ms | 1.4025 KOps/s | 1.4255 KOps/s | |
| test_vmap_mlp_speed_decorator[True-False] | 1.1539ms | 0.7104ms | 1.4077 KOps/s | 1.3896 KOps/s | |
| test_vmap_mlp_speed_decorator[False-True] | 1.0350ms | 0.6186ms | 1.6166 KOps/s | 1.6415 KOps/s | |
| test_vmap_mlp_speed_decorator[False-False] | 1.0355ms | 0.6290ms | 1.5898 KOps/s | 1.6384 KOps/s | |
| test_vmap_transformer_speed_decorator[True-True] | 20.0768ms | 19.4775ms | 51.3412 Ops/s | 51.1934 Ops/s | |
| test_vmap_transformer_speed_decorator[True-False] | 20.1998ms | 19.4775ms | 51.3414 Ops/s | 51.2787 Ops/s | |
| test_vmap_transformer_speed_decorator[False-True] | 20.2398ms | 19.3401ms | 51.7060 Ops/s | 51.5652 Ops/s | |
| test_vmap_transformer_speed_decorator[False-False] | 20.2286ms | 19.7054ms | 50.7476 Ops/s | 51.6299 Ops/s | |
| test_to_module_speed[True] | 1.4602ms | 0.9754ms | 1.0252 KOps/s | 1.0134 KOps/s | |
| test_to_module_speed[False] | 1.3532ms | 0.9554ms | 1.0467 KOps/s | 1.0329 KOps/s | |
| test_tc_init | 0.1401ms | 39.0204μs | 25.6276 KOps/s | 26.4599 KOps/s | |
| test_tc_init_nested | 0.4670ms | 78.1673μs | 12.7931 KOps/s | 12.8162 KOps/s | |
| test_tc_first_layer_tensor | 29.0910μs | 0.9017μs | 1.1090 MOps/s | 1.4362 MOps/s | |
| test_tc_first_layer_nontensor | 31.4920μs | 2.2857μs | 437.4946 KOps/s | 443.4784 KOps/s | |
| test_tc_second_layer_tensor | 0.3960ms | 1.7308μs | 577.7580 KOps/s | 699.4467 KOps/s | |
| test_tc_second_layer_nontensor | 31.7920μs | 3.0220μs | 330.9084 KOps/s | 329.4095 KOps/s | |
| test_unbind | 0.2203s | 12.2300ms | 81.7664 Ops/s | 142.9326 Ops/s | |
| test_full_like | 10.1628ms | 9.6178ms | 103.9738 Ops/s | 103.1455 Ops/s | |
| test_zeros_like | 9.5923ms | 7.3401ms | 136.2373 Ops/s | 227.3392 Ops/s | |
| test_ones_like | 5.5581ms | 4.4332ms | 225.5693 Ops/s | 226.6261 Ops/s | |
| test_clone | 7.4273ms | 6.8232ms | 146.5588 Ops/s | 147.1202 Ops/s | |
| test_squeeze | 60.4740μs | 9.7512μs | 102.5515 KOps/s | 103.6642 KOps/s | |
| test_unsqueeze | 0.1884ms | 73.6454μs | 13.5786 KOps/s | 13.1917 KOps/s | |
| test_split | 0.4143ms | 0.1618ms | 6.1816 KOps/s | 6.2745 KOps/s | |
| test_permute | 0.3209ms | 0.1869ms | 5.3501 KOps/s | 5.5854 KOps/s | |
| test_stack | 53.8224ms | 53.3824ms | 18.7328 Ops/s | 19.3985 Ops/s | |
| test_cat | 53.7418ms | 53.1374ms | 18.8191 Ops/s | 19.5631 Ops/s | 
  
    Sign up for free
    to join this conversation on GitHub.
    Already have an account?
    Sign in to comment
  
      Labels
      
    CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
  Add this suggestion to a batch that can be applied as a single commit.
  This suggestion is invalid because no changes were made to the code.
  Suggestions cannot be applied while the pull request is closed.
  Suggestions cannot be applied while viewing a subset of changes.
  Only one suggestion per line can be applied in a batch.
  Add this suggestion to a batch that can be applied as a single commit.
  Applying suggestions on deleted lines is not supported.
  You must change the existing code in this line in order to create a valid suggestion.
  Outdated suggestions cannot be applied.
  This suggestion has been applied or marked resolved.
  Suggestions cannot be applied from pending reviews.
  Suggestions cannot be applied on multi-line comments.
  Suggestions cannot be applied while the pull request is queued to merge.
  Suggestion cannot be applied right now. Please check back later.
  
    
  
    
Stack from ghstack (oldest at bottom):