#

pruning

Here are 254 public repositories matching this topic...

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

sparsity pruning quantization knowledge-distillation auto-tuning int8 low-precision quantization-aware-training post-training-quantization awq int4 large-language-models gptq smoothquant sparsegpt fp4 mxformat

Updated Jul 3, 2024
Python

quic / aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

open-source machine-learning opensource deep-neural-networks compression deep-learning pruning quantization auto-ml network-quantization network-compression

Updated Jul 3, 2024
Python

openvinotoolkit / nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

nlp sparsity compression deep-learning tensorflow transformers pytorch classification pruning object-detection quantization semantic-segmentation bert hawq onnx openvino mmdetection mixed-precision-training quantization-aware-training

Updated Jul 2, 2024
Python

ModelTC / llmc

LLMC is an elegant tool for LLM compression.

Updated Jul 2, 2024
Python

sparseml

neuralmagic / sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

nlp sparsity tensorflow keras pytorch deep-learning-algorithms image-classification deep-learning-library pruning object-detection transfer-learning automl computer-vision-algorithms onnx deep-learning-models sparsification pruning-algorithms smaller-models sparsification-recipes

Updated Jul 2, 2024
Python

VainF / Torch-Pruning

[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs

pruning model-compression channel-pruning network-pruning structured-pruning efficient-deep-learning depgraph structural-pruning cvpr2023

Updated Jul 2, 2024
Python

ZIB-IOL / SMS

Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"

sparsity deep-learning neural-network optimization pytorch pruning averaging

Updated Jul 2, 2024
Python

xuxw98 / DSPDet3D

[ECCV 2024] 3D Small Object Detection with Dynamic Spatial Pruning

robotics point-cloud pruning object-detection point-clouds scannet sparse-convolution 3d-object-detection small-object-detection efficient-networks matterport3d 3d-scene-understanding dynamic-neural-network eccv2024

Updated Jul 2, 2024
Python

horseee / LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.

bloom compression pruning llama language-model vicuna baichuan pruning-algorithms llm chatglm neurips-2023 llama-2

Updated Jul 1, 2024
Python

HankYe / Once-for-Both

[CVPR'24] Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

imagenet pruning search-algorithm model-compression deit

Updated Jul 1, 2024
Python

deepsparse

neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs

nlp performance computer-vision inference machinelearning pruning object-detection pretrained-models quantization cpus onnx sparsification llm-inference deepsparse

Updated Jul 1, 2024
Python

luuyin / OWL

Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"

sparsity pruning llm largelanguagemodel

Updated Jun 26, 2024
Python

alibaba / TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

deep-neural-networks deep-learning pytorch pruning model-compression model-converter quantization-aware-training post-training-quantization

Updated Jun 26, 2024
Python

PaddlePaddle / PaddleSlim

PaddleSlim is an open-source library for deep model compression and architecture search.

sparsity compression detection transformer segmentation pruning quantization nas bert tensorrt distillation ernie yolov5 yolov6 yolov7

Updated Jun 25, 2024
Python

Nota-NetsPresso / shortened-llm

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

lightweight pytorch pruning large-language-models

Updated Jul 1, 2024
Python

amikom-gace-research-group / characterize-pruning

Characterization study repository for pruning, a popular way to compress a DL model. this repo also investigates optimal sparse tensor layouts for pruned nets

pruning model-compression edge-devices sparse-neural-networks characterization-study

Updated Jun 24, 2024
Python

muratonuryildirim / FOCIL

Code for the paper "FOCIL: Finetune-and-Freeze for Online Class-Incremental Learning by Training Randomly Pruned Sparse Experts"

sparsity pruning incremental-learning continual-learning

Updated Jun 20, 2024
Python

sparsezoo

neuralmagic / sparsezoo

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

nlp computer-vision deep-learning-algorithms yolo resnet pruning transfer-learning pretrained-models quantization mobilenet deep-learning-models object-detection-model sparsification-recipe smaller-models sparse-quantized-models models-optimized

Updated Jun 18, 2024
Python

tasket / wyng-backup

Fast backups for logical volumes & disk images

linux security backup xen incremental kvm img pruning btrfs qcow2 isolation lvm qubes-os xfs vmdk reflinks

Updated Jun 21, 2024
Python

nelaturuharsha / TurboPrune

Harness for training/finding lottery tickets in PyTorch. With support for multiple pruning techniques and augmented by distributed training, FFCV and AMP.

pytorch distributed pruning lottery-ticket-hypothesis

Updated Jun 15, 2024
Python

Improve this page

Add a description, image, and links to the pruning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pruning topic, visit your repo's landing page and select "manage topics."