zineos

🎯

Focusing

neos zineos

🎯

Focusing

42 followers · 800 following

Achievements

Organizations

Starred repositories

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 1,061 47 Updated Mar 27, 2025

HazyResearch / lolcats

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Python 226 26 Updated Jan 31, 2025

shufangxun / LLaVA-MoD

[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

Python 119 7 Updated Jan 22, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,301 222 Updated Mar 27, 2025

xiaomi-research / r1-aqa

🤗 R1-AQA Model: mispeech/r1-aqa

Python 207 17 Updated Mar 27, 2025

feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Python 695 68 Updated Aug 22, 2024

MambaQuant / MambaQuant

Python 20 5 Updated Feb 21, 2025

bytedance / UI-TARS

3,553 231 Updated Feb 17, 2025

bytedance / UI-TARS-desktop

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

TypeScript 9,092 658 Updated Mar 27, 2025

automl / automated-sub-network-selection

Official Repo for "Compressing Large Language Models with Automated Sub-Network Search"

Python 4 2 Updated Feb 6, 2025

IntelLabs / Hardware-Aware-Automated-Machine-Learning

Python 46 9 Updated Mar 17, 2025

OpenSparseLLMs / Linear-MoE

Python 72 5 Updated Mar 25, 2025

chenweiphd / DeepSeek-MoE-ResourceMap

124 9 Updated Feb 17, 2025

MiniMax-AI / MiniMax-01

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,416 178 Updated Mar 18, 2025

Aayush-Ankit / puma-simulator

[ASPLOS 2019] PUMA-simulator provides a detailed simulation model of a dataflow architecture built with NVM (non-volatile memory), and runs ML models compiled using the puma compiler.

Python 62 45 Updated Apr 17, 2023

Zhaoshixin-sky / CIM-MLC

[ASPLOS 2024] CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators

Python 29 4 Updated May 25, 2024

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 23,398 2,127 Updated Mar 27, 2025

deepseek-ai / DeepSeek-R1

87,632 11,316 Updated Feb 24, 2025

VIA-Research / uPIMulator

C 131 18 Updated Feb 1, 2025

VITA-MLLM / VITA

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,184 164 Updated Feb 13, 2025

upmem / upmem_llm_framework

UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.

Python 25 3 Updated Feb 11, 2025

tukl-msd / DRAMSys

DRAMSys a SystemC TLM-2.0 based DRAM simulator.

C++ 252 61 Updated Mar 18, 2025

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 86,710 12,838 Updated Mar 27, 2025

bytedance / Valley

Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.

Python 224 12 Updated Feb 27, 2025

AlibabaPAI / llumnix

Efficient and easy multi-instance LLM serving

Python 348 27 Updated Mar 27, 2025

deepseek-ai / DeepSeek-V3

Python 94,272 15,249 Updated Mar 16, 2025

FMInference / DejaVu

Python 311 41 Updated Apr 2, 2024

mit-han-lab / vila-u

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 249 7 Updated Jan 22, 2025

tud-ccc / Cinnamon

C++ 23 7 Updated Mar 18, 2025

CMU-SAFARI / ramulator2

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…

C++ 307 67 Updated Dec 11, 2024

neos zineos

Organizations

Starred repositories

video-quality-assessment