### 🚀 The feature, motivation and pitch See https://github.com/pytorch/pytorch/issues/143913 . This is blocking improved CPU performance for bfloat16 decoding on Mac; setting up an issue on torchchat side to track. ### Alternatives robust setup for decoding using accelerators on Mac ### Additional context _No response_ ### RFC (Optional) _No response_