- All languages
- ASL
- Assembly
- Batchfile
- C
- C#
- C++
- CSS
- Cuda
- Dart
- Dockerfile
- Emacs Lisp
- GLSL
- Go
- HTML
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- LLVM
- MATLAB
- MLIR
- Makefile
- Markdown
- Mojo
- Objective-C
- Objective-C++
- PHP
- PowerShell
- Python
- Rust
- SCSS
- Scala
- Shell
- Swift
- SystemVerilog
- Tcl
- TeX
- TypeScript
- Verilog
- Vue
- WGSL
- WebAssembly
Starred repositories
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"
[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation
A Datacenter Scale Distributed Inference Serving Framework
Fast inference from large lauguage models via speculative decoding
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
Official Repo for "Compressing Large Language Models with Automated Sub-Network Search"
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
[ASPLOS 2019] PUMA-simulator provides a detailed simulation model of a dataflow architecture built with NVM (non-volatile memory), and runs ML models compiled using the puma compiler.
[ASPLOS 2024] CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators
Fully open reproduction of DeepSeek-R1
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.
Efficient and easy multi-instance LLM serving
[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…