Skip to content
@LeanModels

LeanModels

LeanModels — Making Foundation Models Leaner and Meaner

Welcome to LeanModels, an organization founded by Tianyi Zhang dedicated to making foundation models, such as LLMs and diffusion models, more memory- and compute-efficient through practical compression and inference optimization techniques.

Explore our key projects:

  • DFloat11: A lossless LLM compression framework enabling efficient GPU inference
  • Bagel-DFloat11: DFloat11-compressed version of Bagel, a unified multimodal model
  • LeanQuant: Scalable, loss-error-aware quantization for LLMs

We welcome contributors, collaborators, and feedback! If you're working on model compression or efficient inference, feel free to reach out.

Pinned Loading

  1. DFloat11 Public

    DFloat11: Lossless LLM Compression for Efficient GPU Inference

    Python 453 28

  2. Bagel-DFloat11 Public

    Forked from ByteDance-Seed/Bagel

    Python 90 7

  3. LeanQuant Public

    Code repository for ICLR 2025 paper "LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid"

    Python 17 1

Repositories

Showing 6 of 6 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…