[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,198 455 Updated Mar 22, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 24,588 2,149 Updated Mar 29, 2025

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,460 793 Updated Mar 12, 2025

baofff / U-ViT

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Jupyter Notebook 987 69 Updated Mar 25, 2023

Lightricks / LTX-Video

Official repository for LTX-Video

Python 3,221 282 Updated Mar 5, 2025

jmtomczak / intro_dgm

"Deep Generative Modeling": Introductory Examples

Jupyter Notebook 1,150 184 Updated Sep 22, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 21,080 1,490 Updated Feb 6, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

32,989 1,804 Updated Aug 1, 2024

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 8,151 729 Updated Mar 26, 2025

Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,169 90 Updated Feb 16, 2025

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,710 1,235 Updated May 23, 2024

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,092 1,374 Updated Mar 3, 2025

Tencent / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,026 334 Updated Jan 13, 2025

HVision-NKU / StoryDiffusion

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,247 623 Updated Sep 26, 2024

mini-sora / minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Python 1,262 151 Updated Feb 18, 2025

ZHO-ZHO-ZHO / ComfyUI-Workflows-ZHO

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

6,144 573 Updated Dec 20, 2024

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.

Go 135,241 11,223 Updated Mar 29, 2025

PixArt-alpha / PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,783 87 Updated Oct 31, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 25,889 2,493 Updated Mar 27, 2025

xai-org / grok-1

Grok open release

Python 50,249 8,359 Updated Aug 30, 2024

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,936 1,055 Updated Mar 29, 2025

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,402 533 Updated Mar 21, 2025

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,535 906 Updated Jul 1, 2024

painebenjamin / app.enfugue.ai

ENFUGUE is an open-source web app for making studio-grade images and video using generative AI.

Python 719 67 Updated Oct 29, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 31,530 3,192 Updated Jan 7, 2025

Zhuoning Yuan yzhuoning

Highlights

Lists (2)

ChatGPT

🔮 Future ideas

Starred repositories

Tensorflow