Skip to content
View zineos's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@TeamWiseFlow

Block or report zineos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 1,061 47 Updated Mar 27, 2025

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Python 226 26 Updated Jan 31, 2025

[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

Python 119 7 Updated Jan 22, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,301 222 Updated Mar 27, 2025

🤗 R1-AQA Model: mispeech/r1-aqa

Python 207 17 Updated Mar 27, 2025

Fast inference from large lauguage models via speculative decoding

Python 695 68 Updated Aug 22, 2024
Python 20 5 Updated Feb 21, 2025

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

TypeScript 9,092 658 Updated Mar 27, 2025

Official Repo for "Compressing Large Language Models with Automated Sub-Network Search"

Python 4 2 Updated Feb 6, 2025
Python 72 5 Updated Mar 25, 2025

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,416 178 Updated Mar 18, 2025

[ASPLOS 2019] PUMA-simulator provides a detailed simulation model of a dataflow architecture built with NVM (non-volatile memory), and runs ML models compiled using the puma compiler.

Python 62 45 Updated Apr 17, 2023

[ASPLOS 2024] CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators

Python 29 4 Updated May 25, 2024

Fully open reproduction of DeepSeek-R1

Python 23,398 2,127 Updated Mar 27, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,184 164 Updated Feb 13, 2025

UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.

Python 25 3 Updated Feb 11, 2025

DRAMSys a SystemC TLM-2.0 based DRAM simulator.

C++ 252 61 Updated Mar 18, 2025

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 86,710 12,838 Updated Mar 27, 2025

Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.

Python 224 12 Updated Feb 27, 2025

Efficient and easy multi-instance LLM serving

Python 348 27 Updated Mar 27, 2025
Python 311 41 Updated Apr 2, 2024

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 249 7 Updated Jan 22, 2025
C++ 23 7 Updated Mar 18, 2025

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…

C++ 307 67 Updated Dec 11, 2024
Next
Showing results