Stars
A quick way to automatically send Google Form responses to a Discord channel
Remove large amounts of unwanted applications quickly.
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
Foundational model for human-like, expressive TTS
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Webui for using XTTS and for finetuning it
Deezer source separation library including pretrained models.
Phython Script to bend GCode
Exploring the benefits and limitations of multi-axis 3D printing for improved part quality and reduced waste
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
TTS pipeline that uses RVC to enhance audio quality and cloning
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
File list with huggingface repo links for hubert quantizer models.
Variational Recurrent Autoencoder for timeseries clustering in pytorch
Suno AI's Bark model in C/C++ for fast text-to-speech generation
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Temporal Pattern Attention for Multivariate Time Series Forecasting
music generation with masked transformers!
Loop generation with MusicGen
BrowserFS is an in-browser filesystem that emulates the Node JS filesystem API and supports storing and retrieving files from various backends.
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
A language learning app to improve speaking and listening skills.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.