Skip to content
View R3gm's full-sized avatar

Block or report R3gm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 432 93 Updated Nov 25, 2024

workflow orchestration UI and nodes editor for your own python codebase

TypeScript 37 1 Updated Oct 30, 2024
C# 279 27 Updated Sep 9, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 7,121 853 Updated Mar 27, 2025

C++ library for converting text to phonemes for Piper

C++ 112 89 Updated Mar 13, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 14,701 1,598 Updated Mar 25, 2025

I've been trying quite hard to use the IVONA Amy voice on Linux natively, from trying to reverse engineer the APK's and dll files, to hacking Waydroid to be compatible, and port forwarding/ssh via …

Shell 3 Updated Jun 30, 2023

All Algorithms implemented in Python

Python 198,837 46,486 Updated Mar 24, 2025

A Jupyter widgets-based interactive notebook for Google Colab to generate images using Stable Diffusion.

Jupyter Notebook 17 10 Updated Dec 13, 2023

fMRI-to-image reconstruction on the NSD dataset.

Jupyter Notebook 320 46 Updated May 22, 2024
Python 55 15 Updated Mar 13, 2024

a colab notebook repo for using Diffusers library (not a webui)

Jupyter Notebook 20 Updated Oct 2, 2023

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,384 144 Updated Feb 10, 2025

Godot Engine – Multi-platform 2D and 3D game engine

C++ 95,599 22,007 Updated Mar 26, 2025

VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram

Python 244 32 Updated Jul 25, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,625 2,446 Updated Feb 10, 2025

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 8,748 2,690 Updated Aug 13, 2024

A curated list of open source projects used in nuclear science and engineering

369 72 Updated Aug 20, 2024

Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project

Python 984 102 Updated Aug 29, 2023

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

TypeScript 8,284 807 Updated Feb 13, 2025

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,517 2,334 Updated Jun 26, 2024

A Chess Bot powered by OpenAI's ChatGPT

Python 21 7 Updated Mar 7, 2024

A multi document reader and chatbot using LangChain and ChatGPT

Python 140 55 Updated Feb 7, 2024

template for duplicating and executing Hugging Face Spaces either on SM Studio Lab, Google Colab, or locally.

Jupyter Notebook 11 2 Updated Jan 9, 2023

📚 A collection of sketch based application papers.

618 62 Updated Mar 23, 2025

Panel: The powerful data exploration & web app framework for Python

Python 5,122 540 Updated Mar 27, 2025

A list of awesome beginners-friendly projects.

72,727 7,214 Updated Mar 21, 2025

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)

TypeScript 2,080 217 Updated Mar 21, 2025

A timeline of the latest AI models for audio generation, starting in 2023!

1,898 71 Updated Jan 4, 2024

Finetuning VITS Efficiently

Python 32 6 Updated Nov 6, 2023
Next
Showing results