efficient-inference

"LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS", Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang

3d-reconstruction efficient-inference gaussian-splatting

Updated Jun 20, 2024
Python

raymin0223 / fast_robust_early_exit

Star

Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)

nlp efficient-inference early-exiting autoregressive-models llms

Updated Jun 11, 2024
Python

horseee / learning-to-cache

Star

Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching

efficient-inference diffusion-models

Updated Jun 5, 2024
Python

huawei-noah / Efficient-AI-Backbones

Star

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

tensorflow pytorch transformer imagenet convolutional-neural-networks pretrained-models model-compression efficient-inference ghostnet vision-transformer

Updated May 8, 2024
Python

snap-research / graphless-neural-networks

Star

[ICLR 2022] Code for Graph-less Neural Networks: Teaching Old MLPs New Tricks via Distillation (GLNN)

deep-learning scalability pytorch knowledge-distillation efficient-inference distillation graph-algorithm graph-neural-networks gnn

Updated May 3, 2024
Python

SqueezeAILab / SqueezeLLM

Star

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

natural-language-processing text-generation transformer llama quantization model-compression efficient-inference post-training-quantization large-language-models llm small-models localllm

Updated May 2, 2024
Python

kssteven418 / BigLittleDecoder

Star

[NeurIPS'23] Speculative Decoding with Big Little Decoder

decoding efficient-inference speculative-execution fast-inference llm speculative-decoding

Updated Feb 6, 2024
Python

IBM / AutoVP

Star

[ICLR24] AutoVP: An Automated Visual Prompting Framework and Benchmark

efficient-inference finetuning reprogramming model-agnostic downstream-tasks low-data-regime foundation-models visual-prompt visual-prompting ood-robustness

Updated Jan 16, 2024
Python

lucidrains / speculative-decoding

Star

Explorations into some recent techniques surrounding speculative decoding

deep-learning transformers artificial-intelligence efficient-inference

Updated Oct 9, 2023
Python

snap-research / linkless-link-prediction

Star

[ICML 2023] Linkless Link Prediction via Relational Distillation

deep-learning scalability knowledge-distillation link-prediction efficient-inference distillation graph-neural-networks gnn

Updated Oct 4, 2023
Python

Deeplite / activ-sparse

Star

Official PyTorch training code of Accelerating Deep Neural Networks via Semi-Structured Activation Sparsity (ICCV2023-RCV)

raspberry-pi deep-neural-networks sparsity low-latency efficient-inference tinyml efficient-deep-learning

Updated Sep 29, 2023
Python

snap-research / EfficientFormer

Star

EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]

deep-learning detection transformers pytorch transformer imagenet semantic-segmentation mobile-devices efficient-inference efficient-neural-networks

Updated Aug 13, 2023
Python

MinghaoFu / WFPN

Star

[BMVC 2022] Wide Feature Projection with Fast and Memory-Economic Attention for Efficient Image Super-Resolution

computer-vision efficient-inference reparameterization super-resoluion

Updated Aug 13, 2023
Python

tchittesh / lzu

Star

Code for Learning to Zoom and Unzoom (CVPR 2023)

autonomous-driving efficient-inference 3d-detection spatial-attention

Updated Jun 10, 2023
Python

maxwells-daemons / genome

Star

Compute-efficient reinforcement learning with binary neural networks and evolution strategies.

python reinforcement-learning cython openai-gym evolutionary-algorithms efficient-inference

Updated Apr 21, 2023
Python

Improve this page

Add a description, image, and links to the efficient-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the efficient-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

efficient-inference

Here are 46 public repositories matching this topic...

SqueezeAILab / LLMCompiler

SqueezeAILab / KVQuant

Picovoice / picollm

czg1225 / AsyncDiff

horseee / DeepCache

VITA-Group / LightGaussian

raymin0223 / fast_robust_early_exit

horseee / learning-to-cache

huawei-noah / Efficient-AI-Backbones

snap-research / graphless-neural-networks

SqueezeAILab / SqueezeLLM

kssteven418 / BigLittleDecoder

IBM / AutoVP

lucidrains / speculative-decoding

snap-research / linkless-link-prediction

Deeplite / activ-sparse

snap-research / EfficientFormer

MinghaoFu / WFPN

tchittesh / lzu

maxwells-daemons / genome

Improve this page

Add this topic to your repo