- Shanghai China
- http://satomic.in
Stars
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2E, F5-TTS, CosyVoice), with Whisper audio processing, RVC voice changer, YouTube downlo…
リアルタイムボイスチェンジャー Realtime Voice Changer
✨ A real-time voice changer application using WebSockets and ONNX/TensorFlow/PyTorch
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
A generative world for general-purpose robotics & embodied AI learning.
cmliu / edgetunnel
Forked from zizifn/edgetunnel在原版的基础上修改了显示 VLESS 配置信息转换为订阅内容。使用该脚本,你可以方便地将 VLESS 配置信息使用在线配置转换到 Clash 或 Singbox 等工具中。
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
A natural language interface for computers
A generative speech model for daily dialogue.
朋友圈转发截图生成工具(
Easy and blazing-fast book searcher, create and search your private library.
Official Code for DragGAN (SIGGRAPH 2023)
Yet another voice assistant, but alive.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.