ZZfive

Follow

🎯

Focusing

叶夜靥 ZZfive

🎯

Focusing

Follow

8 followers · 12 following

Earth

Achievements

Achievements

Lists (21)

Sort

3d

20 repositories

agent

25 repositories

audio

59 repositories

comfyui

cv

55 repositories

cv_work

209 repositories

datasets

diffusion language model

digital-human

10 repositories

flow mathcing

inference optimization

infrastructure

28 repositories

Languge Diffusion Models

LLM reason model

15 repositories

LLMs

139 repositories

multi-modal

42 repositories

Personal skill improvement

111 repositories

RAG

RL

tool

video

68 repositories

Stars

liuff19 / Video-T1

Official Implementation of Video-T1: Test-Time Scaling for Video Generation

Python 148 6 Updated Mar 27, 2025

canopyai / Orpheus-TTS

TTS Towards Human-Sounding Speech

Python 3,082 219 Updated Mar 27, 2025

aigc3d / LHM

Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds

Python 1,173 78 Updated Mar 27, 2025

bytedance / InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Python 1,232 94 Updated Mar 25, 2025

Tencent / FlashVDM

Unleashing Vecset Diffusion Model for Fast Shape Generation within 1 Second.

Python 170 2 Updated Mar 21, 2025

nikhilsab / LLMFE

This is the official repo for the paper "LLM-FE"

6 Updated Mar 24, 2025

joel-simon / lluminate

Python 63 5 Updated Mar 18, 2025

jacklishufan / Reflect-DiT

Python 10 Updated Mar 20, 2025

gnobitab / RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 1,175 64 Updated Jul 20, 2024

fenghora / personalize-anything

Jupyter Notebook 265 7 Updated Mar 20, 2025

Fr0zenCrane / Cockatiel

Python 14 Updated Mar 26, 2025

ThisisBillhe / NAR

The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"

Python 30 Updated Mar 19, 2025

KwaiVGI / ReCamMaster

[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

745 17 Updated Mar 27, 2025

hp-l33 / ARPG

Autoregressive Image Generation with Randomized Parallel Decoding

Python 32 Updated Mar 27, 2025

rohitgandikota / distillation

Distilling Diversity and Control in Diffusion Models

Jupyter Notebook 32 1 Updated Mar 26, 2025

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,803 229 Updated Mar 27, 2025

xie-lab-ml / CoRe2

The official implementation of our paper "CoRe^2: Collect, Reflect and Refine to Generate Better and Faster".

Python 21 Updated Mar 19, 2025

prs-eth / thera

Thera: Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields

Python 707 46 Updated Mar 26, 2025

qixucen / atom

Python 523 46 Updated Mar 27, 2025

SesameAILabs / csm

A Conversational Speech Generation Model

Python 11,850 989 Updated Mar 27, 2025

thu-pacman / chitu

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,046 69 Updated Mar 27, 2025

kuleshov-group / bd3lms

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 431 28 Updated Mar 25, 2025

XianfengWu01 / LightGen

An Efficient Text-to-Image Generation Pretrain Pipeline

Python 90 3 Updated Mar 19, 2025

TideDra / lmm-r1

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 659 38 Updated Mar 27, 2025

modelcontextprotocol / servers

Model Context Protocol Servers

JavaScript 26,450 2,724 Updated Mar 28, 2025

Plachtaa / seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

Python 2,090 232 Updated Mar 24, 2025

THU-MIG / yoloe

YOLOE: Real-Time Seeing Anything

Python 928 76 Updated Mar 24, 2025

Osilly / Vision-R1

This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning ca…

Python 397 9 Updated Mar 24, 2025

RUCAIBox / Slow_Thinking_with_LLMs

A series of technical report on Slow Thinking with LLM

Python 596 33 Updated Mar 28, 2025

turningpoint-ai / VisualThinker-R1-Zero

Explore the Multimodal “Aha Moment” on 2B Model

Python 535 17 Updated Mar 18, 2025