ptq

EfficientNetV2 (Efficientnetv2-b2) and quantization int8 and fp32 (QAT and PTQ) on CK+ dataset . fine-tuning, augmentation, solving imbalanced dataset, etc.

python tensorflow keras quantization emotion-recognition qat ckplus facial-emotion-recognition scale-down googlecolab efficientnet imbalanced-dataset quantization-aware-training post-training-quantization efficientnetv2 ptq real-time-emotion-classification real-time-emotion-detection efficientnetv2-b2

Updated May 4, 2024
Jupyter Notebook

Bobo-y / flexible-yolov5

Star

More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam，dcn and so on), and tensorrt

sparsity backbone pytorch resnet object-detection gcn tensorrt neck qat shufflenet yolov3 cbam hrnet dcnv2 yolov5 moblienet swin-transformer triton-server ptq

Updated May 8, 2024
Python

MAGICS-LAB / OutEffHop

Star

[ICML 2024] Outlier-Efficient Hopfield Layers for Large Transformer-Based Models

transformer outliers attention attention-mechanism outlier-removal outlier hopfield-neural-network ptq outlier-treatment modern-hopfield-networks modern-hopfield-model icml-2024 softmax-1 quantized-friendly no-op-outlier

Updated Jun 15, 2024
Python

ambideXtrous9 / Quantization-of-Models-PTQ-and-QAT

Star

Quantization of Models : Post-Training Quantization(PTQ) and Quantize Aware Training(QAT)

keras pytorch quantization qat tflite pytorch-implementation tflite-models quantization-aware-training ptq

Updated Jul 16, 2024
Jupyter Notebook

lix19937 / tensorrt-insight

Star

deep insight tensorrt

asp tensorrt qat ptq

Updated Jul 18, 2024
C++

Xilinx / brevitas

Star

Brevitas: neural network quantization in PyTorch

fpga deep-learning pytorch neural-networks xilinx quantization hardware-acceleration qat brevitas ptq

Updated Jul 18, 2024
Python

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.

machine-learning deep-neural-networks deep-learning neural-network tensorflow optimizer pytorch quantization qat network-quantization network-compression edge-ai ptq

Updated Jul 18, 2024
Python

ModelTC / llmc

Star

This is the official PyTorch implementation of "LLM-QBench: A Benchmark Towards the Best Practice for Post-training Quantization of Large Language Models"

Updated Jul 20, 2024
Python

Improve this page

Add a description, image, and links to the ptq topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ptq topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ptq

Here are 14 public repositories matching this topic...

yester31 / TensorRT_API

yester31 / TensorRT_ONNX

yester31 / Quantization_EX

BlindOver / blindover_AI

yester31 / TensorRT_Sparse

smpanaro / norm-tweaking

OmidGhadami95 / EfficientNetV2_Quantization_CK

Bobo-y / flexible-yolov5

MAGICS-LAB / OutEffHop

ambideXtrous9 / Quantization-of-Models-PTQ-and-QAT

lix19937 / tensorrt-insight

Xilinx / brevitas

sony / model_optimization

ModelTC / llmc

Improve this page

Add this topic to your repo