Skip to content
View DrewThomasson's full-sized avatar
💭
I’m a CS student who likes to build things.
💭
I’m a CS student who likes to build things.

Block or report DrewThomasson

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unified automatic quality assessment for speech, music, and sound.

Python 406 25 Updated Mar 7, 2025

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,819 775 Updated Feb 11, 2024

vits2 backbone with multilingual-bert

Python 8,302 1,171 Updated Mar 3, 2025

A nearly-live implementation of OpenAI's Whisper.

Python 2,544 330 Updated Feb 26, 2025

Bat To Exe in Portableapps.com Format. Official repo here https://github.com/99fk/Bat-To-Exe-Converter-Downloader

HTML 35 5 Updated Nov 29, 2019

A Calibre plugin to translate ebook into a specified language.

Python 1,883 129 Updated Mar 6, 2025

Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Python 787 65 Updated Mar 9, 2025

This will run the new self-hosted github actions runners with docker-in-docker

Shell 1,816 406 Updated Feb 24, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,378 1,119 Updated Nov 14, 2024

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 7,605 725 Updated Feb 27, 2025

A collection of inspiring lists, repos, datasets, models, tools and more for Persian language speech to text(stt) and text to speech(tts) .

55 5 Updated Dec 9, 2024

Persian/Farsi text to speech(TTS) training using coqui tts

Jupyter Notebook 137 18 Updated Feb 15, 2025

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Jupyter Notebook 541 55 Updated Sep 11, 2023

A collection of Docker templates and plugins for Unraid

Shell 6 17 Updated Mar 5, 2025

A Conversational Speech Generation Model

5,630 183 Updated Feb 26, 2025

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 5,727 763 Updated Dec 24, 2024

Pytorch Implementation of GoEmotions 😍😢😱

Python 158 49 Updated Jun 12, 2023

S3PRL-VC: A Voice Conversion Toolkit based on S3PRL

Python 99 12 Updated Jun 26, 2024

Emotional Speech Conversion using Nonparallel Data

Jupyter Notebook 16 5 Updated Apr 10, 2019

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Python 192 27 Updated Nov 9, 2022

Toolkit for linearizing PDFs for LLM datasets/training

Python 9,208 596 Updated Mar 7, 2025

Official Repo for Open-Reasoner-Zero

Python 1,558 73 Updated Mar 5, 2025

A Live Feed Facial Emotion Detection Web Application.

Jupyter Notebook 11 4 Updated Oct 5, 2020

Dynamic and static models for real-time facial emotion recognition

Jupyter Notebook 119 22 Updated Aug 2, 2024
Python 125 11 Updated Feb 28, 2025

The Runner for GitHub Actions 🚀

C# 5,158 1,025 Updated Mar 10, 2025
Python 3,881 308 Updated Mar 6, 2025

https://hf.co/hexgrad/Kokoro-82M

JavaScript 1,526 153 Updated Mar 1, 2025

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Python 464 28 Updated Feb 21, 2025

Run macOS on QEMU/KVM. With OpenCore + Monterey + Ventura + Sonoma support now! Only commercial (paid) support is available now to avoid spammy issues. No Mac system is required.

Python 21,264 1,910 Updated Oct 10, 2024
Next
Showing results