🚀 The feature, motivation and pitch
TF32 support, MI300 & MI325 spec list TF32 performance numbers, but it seems there is no way of enabling it in pytorch
as per the torch compat document.
TF32 yields significant performance uplift across the board.
Alternatives
No response
Additional context
No response