ModelTC
Model Infra
Pinned Loading
Repositories
Showing 10 of 52 repositories
- LightCompress Public
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
- flash-attn-3-build Public
- LightKernel Public
- HarmoniCa Public
[ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration".
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…