Clean C language version of quantizing llama2 model and running quantized llama2 model
quantization
google-colab
quantization-algorithms
quantization-efficient-network
large-language-models
-
Updated
Sep 8, 2023 - C