Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
-
Updated
May 15, 2023 - Python
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight)
Add a description, image, and links to the hardware-aware topic page so that developers can more easily learn about it.
To associate your repository with the hardware-aware topic, visit your repo's landing page and select "manage topics."