#

pruning

Here are 254 public repositories matching this topic...

deepsparse

neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs

nlp performance computer-vision inference machinelearning pruning object-detection pretrained-models quantization cpus onnx sparsification llm-inference deepsparse

Updated Jul 19, 2024
Python

VainF / Torch-Pruning

[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs

pruning model-compression channel-pruning network-pruning structured-pruning efficient-deep-learning depgraph structural-pruning cvpr2023

Updated Jul 21, 2024
Python

666DZY666 / micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、reg…

Updated Oct 6, 2021
Python

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

sparsity pruning quantization knowledge-distillation auto-tuning int8 low-precision quantization-aware-training post-training-quantization awq int4 large-language-models gptq smoothquant sparsegpt fp4 mxformat

Updated Jul 22, 2024
Python

quic / aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

open-source machine-learning opensource deep-neural-networks compression deep-learning pruning quantization auto-ml network-quantization network-compression

Updated Jul 21, 2024
Python

sparseml

neuralmagic / sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

nlp sparsity tensorflow keras pytorch deep-learning-algorithms image-classification deep-learning-library pruning object-detection transfer-learning automl computer-vision-algorithms onnx deep-learning-models sparsification pruning-algorithms smaller-models sparsification-recipes

Updated Jul 17, 2024
Python

PaddlePaddle / PaddleSlim

PaddleSlim is an open-source library for deep model compression and architecture search.

sparsity compression detection transformer segmentation pruning quantization nas bert tensorrt distillation ernie yolov5 yolov6 yolov7

Updated Jun 25, 2024
Python

tensorflow / model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

machine-learning sparsity compression deep-learning tensorflow optimization keras ml pruning quantization model-compression quantized-training quantized-neural-networks quantized-networks

Updated Jul 8, 2024
Python

open-mmlab / mmrazor

OpenMMLab Model Compression Toolbox and Benchmark.

detection pytorch classification segmentation pruning darts quantization nas knowledge-distillation spos autoslim

Updated Jun 11, 2024
Python

jacobgil / pytorch-pruning

PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference

deep-learning pytorch pruning

Updated Jul 12, 2019
Python

openvinotoolkit / nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

nlp sparsity compression deep-learning tensorflow transformers pytorch classification pruning object-detection quantization semantic-segmentation bert hawq onnx openvino mmdetection mixed-precision-training quantization-aware-training

Updated Jul 20, 2024
Python

horseee / LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.

bloom compression pruning llama language-model vicuna baichuan pruning-algorithms llm chatglm neurips-2023 llama-2

Updated Jul 19, 2024
Python

alibaba / TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

deep-neural-networks deep-learning pytorch pruning model-compression model-converter quantization-aware-training post-training-quantization

Updated Jul 16, 2024
Python

SforAiDl / KD_Lib

A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.

benchmarking data-science machine-learning pytorch deep-learning-library pruning quantization algorithm-implementations knowledge-distillation model-compression

Updated Mar 1, 2023
Python

he-y / filter-pruning-geometric-median

Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration (CVPR 2019 Oral)

pytorch pruning model-compression

Updated Aug 31, 2023
Python

princeton-nlp / LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

nlp efficiency pruning llama pre-training llm llama2

Updated Mar 4, 2024
Python

SpursLipu / YOLOv3v4-ModelCompression-MultidatasetTraining-Multibackbone

YOLO ModelCompression MultidatasetTraining

yolo pruning object-detection modelcompression mobilenetv3 quantization-aware-training multidataset

Updated Jun 21, 2022
Python

BenWhetton / keras-surgeon

Pruning and other network surgery for trained Keras models.

deep-learning keras pruning network-surgery

Updated Dec 5, 2023
Python

he-y / soft-filter-pruning

Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks

pytorch pruning model-compression

Updated Oct 2, 2019
Python

sparsezoo

neuralmagic / sparsezoo

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

nlp computer-vision deep-learning-algorithms yolo resnet pruning transfer-learning pretrained-models quantization mobilenet deep-learning-models object-detection-model sparsification-recipe smaller-models sparse-quantized-models models-optimized

Updated Jul 19, 2024
Python

Improve this page

Add a description, image, and links to the pruning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pruning topic, visit your repo's landing page and select "manage topics."