- Ankara, Türkiye
-
18:43
(UTC +09:00) - https://esozbek.me
- @Trojaner_
ai
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
🔊 Text-Prompted Generative Audio Model
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
A latent text-to-image diffusion model
ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
The definitive Web UI for local AI, with powerful features and easy setup.
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)
Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes
A language for constraint-guided and efficient LLM programming.
Advanced fine tuning tools for vision models
Python library to download bulk of images from Bing.com
Windows compile of bitsandbytes for use in text-generation-webui.
An open-source tool for making really cool QR codes with AI
Easily train a good VC model with voice data <= 10 mins!
web UI for GPU-accelerated ONNX pipelines like Stable Diffusion, even on Windows and AMD
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Chat language model that can use tools and interpret the results
Official JavaScript / TypeScript library for the OpenAI API
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …





