Skip to content
View Dk0071942's full-sized avatar

Block or report Dk0071942

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An open-source & self-hostable Heroku / Netlify / Vercel alternative.

PHP 40,165 2,393 Updated Apr 16, 2025

Industry leading face manipulation platform

Python 22,439 3,417 Updated Apr 16, 2025

State-of-the-art 2D and 3D Face Analysis Project

Python 24,905 5,548 Updated Mar 25, 2025

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

415 26 Updated Apr 11, 2025

The official code of our CVPR2025 paper: "Segment Any-Quality Images with Generative Latent Space Enhancement".

5 Updated Mar 12, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 2,470 209 Updated Mar 14, 2025

Lightweight and Efficient, 🎧Ultra High-Quality Voice Cloning, Chinese and English.

Python 56 4 Updated Apr 6, 2025
Python 4,617 313 Updated Apr 12, 2025

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,739 2,471 Updated Feb 10, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 934 80 Updated Apr 17, 2025

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Python 4,798 460 Updated Feb 12, 2025

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Python 2,961 299 Updated Aug 10, 2024

Depth-Aware Video Frame Interpolation (CVPR 2019)

Python 8,297 841 Updated Feb 13, 2023

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 3,941 496 Updated Apr 13, 2025

LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 1,816 127 Updated Apr 17, 2025

ComfyUI node for F5-Text To Speech

Python 166 21 Updated Apr 5, 2025

LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfac…

Python 1,615 138 Updated Apr 9, 2025

Taming Stable Diffusion for Lip Sync!

Python 3,664 542 Updated Apr 16, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 11,236 1,565 Updated Apr 14, 2025

đź“„ A curated list of awesome .cursorrules files

22,152 1,671 Updated Mar 20, 2025

Spark-TTS Inference Code

Python 8,519 876 Updated Apr 9, 2025

Explore RPF files just like OpenIV or CodeWalker just with the big difference that everything is implemented right into your Windows File Explorer!

C++ 9 2 Updated Apr 21, 2022
C# 559 227 Updated Apr 11, 2025

Bring portraits to life!

Python 1 Updated Feb 28, 2025

[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 3,790 420 Updated Dec 10, 2024

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 13,570 951 Updated Apr 17, 2025

OpenIV.asi but actually open source

C++ 44 7 Updated Feb 18, 2022

A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.

Python 365 59 Updated Apr 14, 2025

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 6,394 689 Updated Mar 5, 2025
Next
Showing results