Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
-
Updated
May 8, 2024 - Python
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
Code for paper " AdderNet: Do We Really Need Multiplications in Deep Learning?"
EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
"LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS", Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
[CVPR 2021] Exploring Sparsity in Image Super-Resolution for Efficient Inference
[ICLR 2022] Code for Graph-less Neural Networks: Teaching Old MLPs New Tricks via Distillation (GLNN)
(CVPR 2021, Oral) Dynamic Slimmable Network
[ECCV2022] Efficient Long-Range Attention Network for Image Super-resolution
[ECCV 2022] Official implementation of the paper "DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation"
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Explorations into some recent techniques surrounding speculative decoding
Soft Threshold Weight Reparameterization for Learnable Sparsity
Code for WF-IoT paper 'TinyML Benchmark: Executing Fully Connected Neural Networks on Commodity Microcontrollers'
Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.
[NeurIPS'23] Speculative Decoding with Big Little Decoder
[ECCV 2020] Code release for "Resolution Switchable Networks for Runtime Efficient Image Recognition"
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)
Add a description, image, and links to the efficient-inference topic page so that developers can more easily learn about it.
To associate your repository with the efficient-inference topic, visit your repo's landing page and select "manage topics."