Skip to content
View pluja's full-sized avatar
💭
Staying calm
💭
Staying calm

Organizations

@ytorg

Block or report pluja

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI

Artificial Intelligence related
28 repositories
Python 790 52 Updated Sep 22, 2022

Robust Speech Recognition via Large-Scale Weak Supervision

Python 94,843 11,779 Updated Dec 15, 2025

A Stable Diffusion desktop frontend with inpainting, img2img and more!

Jupyter Notebook 1,274 89 Updated Mar 21, 2023

Stable Diffusion web UI

Python 160,698 29,974 Updated Dec 18, 2025

A simple notebook demonstrating prompt-based music generation via Mubert API

Jupyter Notebook 2,736 232 Updated May 4, 2023

Rembg is a tool to remove images background

Python 21,912 2,228 Updated Feb 20, 2026

Port of OpenAI's Whisper model in C/C++

C++ 46,844 5,222 Updated Feb 19, 2026

OpenAI Whisper ASR Webservice API

Python 3,166 556 Updated Nov 23, 2025

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Python 17,902 1,313 Updated Feb 19, 2026

Your personal, fully customizable, Linux Voice Control Assistant.

Python 188 14 Updated Feb 10, 2024

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …

TypeScript 26,777 2,774 Updated Feb 20, 2026

An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, a…

JavaScript 10,253 851 Updated Feb 13, 2026

Stable diffusion for real-time music generation (web app)

TypeScript 2,680 213 Updated Jul 22, 2024

Stable Diffusion built-in to Blender

Python 8,115 436 Updated Aug 26, 2024

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,808 2,052 Updated Nov 19, 2024

Real-time face swap for PC streaming or video calls

Python 30,531 1,120 Updated Nov 8, 2024

The no-code platform for building custom LLM Agents

2,945 423 Updated Jun 17, 2024

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 6,789 552 Updated Jul 11, 2024

one-click face swap

Python 30,520 6,904 Updated Aug 19, 2024

Handwriting Synthesis with RNNs ✏️

Python 4,717 668 Updated Jan 11, 2024

Segment Anything in High Quality [NeurIPS 2023]

Jupyter Notebook 4,177 266 Updated Sep 12, 2025

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 23,000 2,574 Updated Mar 13, 2025

Self-hosted AI coding assistant

Rust 32,899 1,682 Updated Feb 14, 2026

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,750 1,168 Updated Nov 14, 2024

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,830 347 Updated Jan 21, 2025

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 4,041 350 Updated Jan 8, 2025

Inference and training library for high-quality TTS models.

Python 5,534 582 Updated Dec 10, 2024