MS-AMP is an automatic mixed precision package for deep learning developed by Microsoft.
📢 v0.4.0 has been released!
Check aka.ms/msamp/doc for more details.
FP8-LM: Training FP8 Large Language Models [bib]
@misc{fp8lm,
title={FP8-LM: Training FP8 Large Language Models},
author={Houwen Peng and Kan Wu and Yixuan Wei and Guoshuai Zhao and Yuxiang Yang and Ze Liu and Yifan Xiong and Ziyue Yang and Bolin Ni and Jingcheng Hu and Ruihang Li and Miaosen Zhang and Chen Li and Jia Ning and Ruizhe Wang and Zheng Zhang and Shuguang Liu and Joe Chau and Han Hu and Peng Cheng},
year={2023},
eprint={2310.18313},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.