🚀 EfficientAI

Efficient Inference for LLMs & MLLMs
An open-source research project from Alibaba Cloud dedicated to efficient large language model inference.

📋 Table of Contents

✨ Key Features
🔥 Latest Updates
📦 Installation
⚡ Quick Start
🧪 Benchmarks
📚 Publications
🤝 Contributing
📄 License
✉️ Contact

✨ Key Features

EfficientAI focuses on inference-time optimizations for LLMs and MLLMs:

Feature	Description	Status
🔹 Activation Sparsity	Dynamic sparsity methods for faster inference	✅ LaRoSa (ICML 2025)
🔹 Quantization	Post-training & quantization-aware techniques for MLLMs	✅ MASQuant (CVPR 2026)
🔹 Agentic Reasoning	Efficient tool-use and reasoning frameworks	✅ D-CORE
🔹 Reproducible Benchmarks	Standardized eval pipelines for research & production	🔄 In Progress

🔥 Latest Updates

📰 Changelog (Click to expand)

[2026-03] 🎉 MASQuant accepted to CVPR 2026
→ Multimodal LLM PTQ algorithm with SOTA accuracy-efficiency tradeoff
📄 Paper | 💻 Code
[2026-02] 🚀 D-CORE open-sourced
→ Efficient tool-use reasoning via dynamic computation routing
📄 Paper | 💻 Code | 🎮 Demo
[2026-01] 🏆 LaRoSa accepted to ICML 2025
→ Training-free activation sparsity for LLM acceleration
📄 Paper | 💻 Code

📦 Installation

# Clone the repository
git clone https://github.com/alibaba/EfficientAI.git
cd EfficientAI

# Install dependencies (recommended: use conda)
pip install -r requirements.txt

# Optional: Install with specific module support
# pip install -e ".[larosa]"   # for LaRoSa
# pip install -e ".[masquant]" # for MASQuant

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
dcore		dcore
images		images
larosa		larosa
masquant		masquant
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 EfficientAI

📋 Table of Contents

✨ Key Features

🔥 Latest Updates

📦 Installation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚀 EfficientAI

📋 Table of Contents

✨ Key Features

🔥 Latest Updates

📦 Installation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages