Skip to content
View jianchang512's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report jianchang512

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

ai

64 repositories

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,751 1,167 Updated Nov 14, 2024

Deezer source separation library including pretrained models.

Python 28,046 3,065 Updated Apr 2, 2025

Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.

JavaScript 1,123 117 Updated Feb 3, 2026

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Python 2,659 300 Updated Oct 18, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,577 5,969 Updated Aug 16, 2024

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,875 5,245 Updated Jan 7, 2026

Chat with any character you like: ChatGLM2+SadTalker+Voice Cloning | 和喜欢的角色沉浸式对话吧:ChatGLM2+声音克隆+视频对话

Python 614 93 Updated Aug 11, 2023

Easily train a good VC model with voice data <= 10 mins!

Python 34,492 4,902 Updated Nov 24, 2024

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 23,679 1,768 Updated Mar 13, 2025

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 7,215 1,068 Updated Aug 5, 2024

SoftVC VITS Singing Voice Conversion

Python 27,989 5,081 Updated Nov 11, 2023

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 5,019 737 Updated Jan 21, 2025

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

C++ 10,150 910 Updated Aug 3, 2024

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 9,475 815 Updated Jul 11, 2025

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

2,869 155 Updated Nov 7, 2025

Faster Whisper transcription with CTranslate2

Python 21,052 1,728 Updated Nov 19, 2025

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 8,445 745 Updated Aug 13, 2024

vits2 backbone with multilingual-bert

Python 8,692 1,261 Updated Feb 16, 2026

Converts text to speech in realtime

Python 3,767 366 Updated Jan 11, 2026

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,533 1,954 Updated Feb 11, 2026

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 27,139 6,927 Updated Feb 21, 2026

LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…

JavaScript 29,733 5,720 Updated Jan 9, 2026

Cross browser audio/video/screen recording. It supports Chrome, Firefox, Opera and Microsoft Edge. It even works on Android browsers. It follows latest MediaRecorder API standards and provides simi…

JavaScript 2,675 558 Updated Jul 4, 2018

Image to text, fast.

TypeScript 556 74 Updated Dec 3, 2025

Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)

Python 1,034 166 Updated Jan 24, 2026

Python bindings for whisper.cpp

Python 249 104 Updated Jun 1, 2024

通过与OpenAI兼容的统一方式调用国内外各种大语言模型和Agent编排工具API的轻量级开源Python工具包。

Python 111 9 Updated Jan 30, 2026

Vocal Remover using Deep Neural Networks

Python 1,743 254 Updated Jul 23, 2024

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 8,347 1,908 Updated Sep 6, 2025

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 23,858 1,838 Updated Feb 15, 2026