Stars
Pure Rust implementation of the DeepPhonemizer G2P model.
A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.
Bananas🍌, Cross-Platform screen 🖥️ sharing 📡 made simple ⚡.
一方云剪是一款不依赖服务器服务的视频剪辑站点,通过整合@hughfenghen的WebAV、opfs-tools,添加一些必要的剪辑功能,希望能给相关开发者更多的帮助和启发。
Motion-Controllable Video Diffusion via Warped Noise
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
基于 Rust 构建的现代化高性能后台管理系统脚手架。采用 Axum 作为 Web 框架,SeaORM 处理数据库操作,Casbin 实现 RBAC 权限控制。特点是类型安全、模块化架构,并实现了核心的后台管理功能。
🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.
Code for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.
Video readers, writers, muxers, encoders and decoders for Rust based on ffmpeg libraries.
A Fish Speech implementation in Rust, with Candle.rs
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Gaussian Shell Maps for Efficient 3D Human Generation (CVPR 2024)
Sample codes for my CUDA programming book
[CVPR 2024] Official implementation of Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation
Source code for: Expressive Speech-driven Facial Animation with controllable emotions
💎A high level python lib for face landmarks detection: training, eval, export, inference(Python/C++) and 100+ data augmentations.
Official Pytorch Implementation of SPECTRE: Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos
Official Pytorch Implementation of SMIRK: 3D Facial Expressions through Analysis-by-Neural-Synthesis (CVPR 2024)
Official Implementation of 'ReliableSwap: Boosting General Face Swapping Via Reliable Supervision'
Summary of publicly available ressources such as code, datasets, and scientific papers for the FLAME 3D head model