Skip to content
View JRMeyer's full-sized avatar
👋
👋

Organizations

@coqui-ai

Block or report JRMeyer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

TTS with kokoro and onnx runtime

Python 1,719 157 Updated Mar 1, 2025

A Conversational Speech Generation Model

4,860 141 Updated Feb 26, 2025

Video editing with Python

Python 13,119 1,692 Updated Feb 6, 2025

A community-maintained Python framework for creating mathematical animations.

Python 30,411 2,128 Updated Mar 3, 2025

Animation engine for explanatory math videos

Python 75,815 6,590 Updated Feb 26, 2025

An open collection of annotated voices in Japanese language

Python 49 1 Updated Mar 7, 2025

Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

Rust 847 62 Updated Feb 9, 2025

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,814 775 Updated Feb 11, 2024

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,987 417 Updated May 10, 2023

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,098 321 Updated Nov 14, 2023

Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".

Python 886 57 Updated Oct 28, 2024

This repository contains the Hugging Face Agents Course.

Jupyter Notebook 13,906 842 Updated Mar 7, 2025

lightweight, python based chat ui

Python 287 42 Updated Mar 6, 2025

On-device voice activity detection (VAD) powered by deep learning

Python 200 13 Updated Mar 5, 2025

A curated list of awesome voice activity detection

40 2 Updated Nov 22, 2024

a Repository of Open-WebUI tools to use with your favourite LLMs

Python 149 15 Updated Mar 7, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 81,672 9,820 Updated Mar 7, 2025

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,615 320 Updated Jan 4, 2024

SALMONN: Speech Audio Language Music Open Neural Network

Python 1,174 93 Updated Mar 4, 2025

Audio Dataset for training CLAP and other models

Python 668 56 Updated Feb 5, 2024

A set of scripts to grab public datasets from resources related to arXiv

Python 429 70 Updated May 20, 2024

SpeechGPT Series: Speech Large Language Models

Python 1,350 90 Updated Jul 22, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 37,141 4,384 Updated Aug 19, 2024

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,622 116 Updated Jul 5, 2024

A curated list of awesome voice conversion, projects and communities.

223 13 Updated Jan 13, 2025

Easily train a good VC model with voice data <= 10 mins!

Python 27,627 3,919 Updated Nov 24, 2024

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,314 126 Updated Jul 11, 2024

The open source code for LLM-Codec

Python 130 7 Updated Aug 18, 2024

Audio Large Language Models

Python 421 26 Updated Feb 27, 2025

GPT-4o-level, real-time spoken dialogue system.

Python 283 19 Updated Jan 27, 2025
Next
Showing results