jianchang512

🏠

Working from home

okmyworld jianchang512

🏠

Working from home

1.7k followers · 84 following

https://pyvideotrans.com

Achievements

Stars

ai

64 repositories

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,751 1,167 Updated Nov 14, 2024

deezer / spleeter

Deezer source separation library including pretrained models.

Python 28,046 3,065 Updated Apr 2, 2025

wladradchenko / wunjo.wladradchenko.ru

Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.

JavaScript 1,123 117 Updated Feb 3, 2026

yerfor / GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Python 2,659 300 Updated Oct 18, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,577 5,969 Updated Aug 16, 2024

babysor / MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,875 5,245 Updated Jan 7, 2026

KevinWang676 / ChatGLM2-Voice-Cloning

Chat with any character you like: ChatGLM2+SadTalker+Voice Cloning | 和喜欢的角色沉浸式对话吧：ChatGLM2+声音克隆+视频对话

Python 614 93 Updated Aug 11, 2023

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 34,492 4,902 Updated Nov 24, 2024

Anjok07 / ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 23,679 1,768 Updated Mar 13, 2025

OpenTalker / video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 7,215 1,068 Updated Aug 5, 2024

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

Python 27,989 5,081 Updated Nov 11, 2023

Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 5,019 737 Updated Jan 21, 2025

Const-me / Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

C++ 10,150 910 Updated Aug 3, 2024

KoljaB / RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 9,475 815 Updated Jul 11, 2025

Purfview / whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

2,869 155 Updated Nov 7, 2025

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 21,052 1,728 Updated Nov 19, 2025

netease-youdao / EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 8,445 745 Updated Aug 13, 2024

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 8,692 1,261 Updated Feb 16, 2026

KoljaB / RealtimeTTS

Converts text to speech in realtime

Python 3,767 366 Updated Jan 11, 2026

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,533 1,954 Updated Feb 11, 2026

labring / FastGPT

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 27,139 6,927 Updated Feb 21, 2026

songquanpeng / one-api

LLM API 管理 & 分发系统，支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型，统一 API 适配，可用于 key 管理与二次分发。单可执行文件，提供 Docker 镜像，一键部署，开箱即用。LLM API management & k…

JavaScript 29,733 5,720 Updated Jan 9, 2026

streamproc / MediaStreamRecorder

Cross browser audio/video/screen recording. It supports Chrome, Firefox, Opera and Microsoft Edge. It even works on Android browsers. It follows latest MediaRecorder API standards and provides simi…

JavaScript 2,675 558 Updated Jul 4, 2018

ai-ng / 2txt

Image to text, fast.

TypeScript 556 74 Updated Dec 3, 2025

nomadkaraoke / python-audio-separator

Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)

Python 1,034 166 Updated Jan 24, 2026

stlukey / whispercpp.py

Python bindings for whisper.cpp

Python 249 104 Updated Jun 1, 2024

EvalsOne / UnionLLM

通过与OpenAI兼容的统一方式调用国内外各种大语言模型和Agent编排工具API的轻量级开源Python工具包。

Python 111 9 Updated Jan 30, 2026

tsurumeso / vocal-remover

Vocal Remover using Deep Neural Networks

Python 1,743 254 Updated Jul 23, 2024

nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 8,347 1,908 Updated Sep 6, 2025

OpenBMB / MiniCPM-o

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 23,858 1,838 Updated Feb 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

okmyworld jianchang512

Achievements

Achievements

Block or report jianchang512

ai

facebookresearch / seamless_communication

deezer / spleeter

wladradchenko / wunjo.wladradchenko.ru

yerfor / GeneFace

coqui-ai / TTS

babysor / MockingBird

KevinWang676 / ChatGLM2-Voice-Cloning

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Anjok07 / ultimatevocalremovergui

OpenTalker / video-retalking

svc-develop-team / so-vits-svc

Plachtaa / VITS-fast-fine-tuning

Const-me / Whisper

KoljaB / RealtimeSTT

Purfview / whisper-standalone-win

SYSTRAN / faster-whisper

netease-youdao / EmotiVoice

fishaudio / Bert-VITS2

KoljaB / RealtimeTTS

PaddlePaddle / PaddleSpeech

labring / FastGPT

songquanpeng / one-api

streamproc / MediaStreamRecorder

ai-ng / 2txt

nomadkaraoke / python-audio-separator

stlukey / whispercpp.py

EvalsOne / UnionLLM

tsurumeso / vocal-remover

nl8590687 / ASRT_SpeechRecognition

OpenBMB / MiniCPM-o