Stars
Release for the Siggraph Asia 2023 SKEL paper "From Skin to Skeleton: Towards Biomechanically Accurate 3D Digital Humans".
An invisible desktop application to help you pass your technical interviews.
[CVPR 2025] UniK3D: Universal Camera Monocular 3D Estimation
Drivable 3D Gaussian Avatars - A 3D controllable model for human bodies rendered with Gaussian primitives embedded in tetrahedral cages.
[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Code release for paper "Reconstructing People, Places, and Cameras", In CVPR 2025
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
An open source voice assistant toolkit for many human languages
Offline private voice assistant for many human languages
Solving SMPL/MANO parameters from keypoint coordinates.
Official code for "SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation"
Prepare Raspberry Pi 3, 4 & 5 configurations using a virtual machine.
Make websites accessible for AI agents
Open-Sora: Democratizing Efficient Video Production for All
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
A Python framework for high performance GPU simulation and graphics
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
A list of Free Software network services and web applications which can be hosted on your own servers
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.