This repository was archived by the owner on Sep 3, 2025. It is now read-only.
Release v0.7.0π
- Further optimization for running FP8, and INT8 quantization.
- Support searching automatic calibration dataset batch size for running FMO.
- Support [AWQ(Activation-aware Weight Quantization)].