model-quantization

Here are 7 public repositories matching this topic...

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

compression language-model knowledge-distillation model-quantization pruning-algorithms llm llm-compression efficient-llm

Updated Jul 16, 2024
Python

htqin / BiBench

Star

[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binarization.

benchmark binarization model-compression binary-neural-networks binarized-neural-networks model-quantization icml-2023

Updated Mar 4, 2024
Python

htqin / QuantSR

Star

[NeurIPS 2023 Spotlight] This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution.

super-resolution quantized-neural-networks model-quantization

Updated May 13, 2024
Python

seonglae / llama2gptq

Star

Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.

chatbot cuda transformers question-answering gpt quantization rye model-quantization chatai streamlit-chat chatgpt langchain llama2 llama-2

Updated Nov 25, 2023
Python

nbasyl / OFQ

Star

The official implementation of the ICML 2023 paper OFQ-ViT

icml model-compression model-compression-papers model-quantization vision-transformer vision-transformers icml2023 quantization-awar

Updated Oct 3, 2023
Python

HaoranREN / TensorFlow_Model_Quantization

Star

A tutorial of model quantization using TensorFlow

machine-learning tensorflow tensorflow-lite tflite model-quantization inference-efficiency quantization-aware-training

Updated Aug 2, 2021
Python

dslisleedh / NCNet-flax

Star

Unofficial implementation of NCNet using flax and jax

flax super-resolution jax model-quantization

Updated Jan 11, 2023
Python

Improve this page

Add a description, image, and links to the model-quantization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the model-quantization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model-quantization

Here are 7 public repositories matching this topic...

horseee / Awesome-Efficient-LLM

htqin / BiBench

htqin / QuantSR

seonglae / llama2gptq

nbasyl / OFQ

HaoranREN / TensorFlow_Model_Quantization

dslisleedh / NCNet-flax

Improve this page

Add this topic to your repo