[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,196 454 Updated Mar 22, 2025

RenShuhuai-Andy / TimeChat

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Python 353 32 Updated Nov 19, 2024

interactive-3d / interactive3d

[CVPR'24] Interactive3D: Create What You Want by Interactive 3D Generation

Python 180 7 Updated Sep 9, 2024

antgroup / echomimic_v2

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 3,412 396 Updated Feb 27, 2025

dqqcasia / awesome-speech-translation

Forked from ucaslyc/speech_translation-papers

177 1 Updated Nov 10, 2021

pixeli99 / SVD_Xtend

Stable Video Diffusion Training Code and Extensions.

Python 676 66 Updated Jul 25, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,949 129 Updated Mar 9, 2025

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 11,063 1,054 Updated Mar 25, 2025

mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,324 111 Updated Aug 27, 2024

anliyuan / Ultralight-Digital-Human

一个超轻量级、可以在移动端实时运行的数字人模型

Python 1,749 255 Updated Mar 5, 2025

facebookresearch / sapiens

High-resolution models for human tasks.

Python 4,911 291 Updated Nov 18, 2024

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 10,824 1,496 Updated Mar 28, 2025

Huanshere / VideoLingo

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组

Python 12,073 1,184 Updated Mar 22, 2025

fudan-generative-vision / hallo2

Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 3,520 505 Updated Feb 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fa-Ting Hong harlanhong

Achievements

Achievements

Block or report harlanhong

Lists (1)

🔮 Future ideas

Stars

deepseek-ai / FlashMLA

AILab-CVC / CV-VAE

MiZhenxing / ThinkDiff

ArthurBrussee / brush

ali-vilab / MangaNinjia

ashawkey / stable-dreamfusion

Stability-AI / stable-point-aware-3d

khan9048 / Facial_depth_estimation

xg-chu / GAGAvatar_track

xg-chu / GPAvatar

yfeng95 / face3d

zhuhao-nju / facescape

xg-chu / GAGAvatar

Genesis-Embodied-AI / Genesis

chuangchuangtan / NPR-DeepfakeDetection

ShaelynZ / synergize-motion-appearance

FoundationVision / VAR