Skip to content
View 425776024's full-sized avatar
🤒
Out sick
🤒
Out sick
  • Shenzhen China

Block or report 425776024

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pure Rust implementation of the DeepPhonemizer G2P model.

Rust 9 2 Updated May 7, 2024

TTS Towards Human-Sounding Speech

Python 3,083 219 Updated Mar 27, 2025

SynCity: Training-Free Generation of 3D Worlds

413 27 Updated Mar 21, 2025

A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.

Rust 84 4 Updated Mar 26, 2025

Bananas🍌, Cross-Platform screen 🖥️ sharing 📡 made simple ⚡.

Svelte 4,394 119 Updated Mar 17, 2025

一方云剪是一款不依赖服务器服务的视频剪辑站点,通过整合@hughfenghen的WebAV、opfs-tools,添加一些必要的剪辑功能,希望能给相关开发者更多的帮助和启发。

Vue 22 5 Updated Feb 24, 2025

Motion-Controllable Video Diffusion via Warped Noise

Python 820 42 Updated Mar 25, 2025

[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation

Python 487 18 Updated Jul 2, 2024

基于 Rust 构建的现代化高性能后台管理系统脚手架。采用 Axum 作为 Web 框架,SeaORM 处理数据库操作,Casbin 实现 RBAC 权限控制。特点是类型安全、模块化架构,并实现了核心的后台管理功能。

Rust 59 7 Updated Mar 24, 2025

🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.

Rust 458 40 Updated Mar 26, 2025

Code for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.

Python 712 25 Updated Jan 23, 2025

Video readers, writers, muxers, encoders and decoders for Rust based on ffmpeg libraries.

Rust 304 36 Updated Mar 3, 2025

Differentiable Rendering Toolkit

Cuda 86 6 Updated Mar 19, 2025

Web-based 3D visualization + Python

Python 1,058 72 Updated Mar 28, 2025

A Fish Speech implementation in Rust, with Candle.rs

Rust 75 3 Updated Feb 24, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,938 651 Updated Mar 27, 2025
Python 913 104 Updated Jan 23, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,245 278 Updated Nov 5, 2024

[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 3,733 412 Updated Dec 10, 2024

Gaussian Shell Maps for Efficient 3D Human Generation (CVPR 2024)

Jupyter Notebook 216 10 Updated Jan 3, 2024

Sample codes for my CUDA programming book

Cuda 1,677 341 Updated Feb 15, 2025

[CVPR 2024] Official implementation of Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation

Python 91 10 Updated Jun 15, 2024

Source code for: Expressive Speech-driven Facial Animation with controllable emotions

Python 37 6 Updated Jan 4, 2024

💎A high level python lib for face landmarks detection: training, eval, export, inference(Python/C++) and 100+ data augmentations.

Python 255 24 Updated Feb 7, 2025

Official Pytorch Implementation of SPECTRE: Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos

Python 271 23 Updated Mar 24, 2025

Official Pytorch Implementation of SMIRK: 3D Facial Expressions through Analysis-by-Neural-Synthesis (CVPR 2024)

Python 259 31 Updated May 31, 2024

Official Implementation of 'ReliableSwap: Boosting General Face Swapping Via Reliable Supervision'

Python 205 18 Updated Sep 28, 2023
Python 188 13 Updated Apr 11, 2024

Summary of publicly available ressources such as code, datasets, and scientific papers for the FLAME 3D head model

491 23 Updated Mar 10, 2025
Next
Showing results