Skip to content
View tomasklaen's full-sized avatar

Organizations

@drovp

Block or report tomasklaen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI

25 repositories

A reference containing Styles and Keywords that you can use with MidJourney AI. There are also pages showing resolution comparison, image weights, and much more!

12,285 1,665 Updated Apr 11, 2025

WebUI extension for ControlNet

Python 17,874 2,029 Updated Aug 12, 2024

roop extension for StableDiffusion web-ui

Python 3,514 609 Updated Apr 1, 2024

Multi-Platform Package Manager for Stable Diffusion

C# 7,554 521 Updated Feb 15, 2026

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 23,003 2,576 Updated Mar 13, 2025

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Python 3,003 359 Updated Apr 22, 2025

[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation

Python 1,018 65 Updated Feb 21, 2026

A ComfyUI workflows and models management extension to organize and manage all your workflows, models in one place. Seamlessly switch between workflows, as well as import, export workflows, reuse s…

TypeScript 1,414 74 Updated Apr 16, 2025

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,909 879 Updated Jul 18, 2024

Comflowyspace is an intuitive, user-friendly, open-source AI tool for generating images and videos, democratizing access to AI technology.

TypeScript 2,350 131 Updated Aug 30, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,980 4,020 Updated Apr 19, 2025

A simple, high-quality voice conversion tool focused on ease of use and performance.

Python 2,991 492 Updated Feb 18, 2026

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,103 2,081 Updated Feb 16, 2026

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 11,870 1,105 Updated Nov 5, 2025

[NeurIPS 2024] Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation

Python 420 22 Updated May 22, 2025

This workflow for ComfyUI will allow you to transfering subjects into new pictures while retaining their original features.

80 1 Updated Apr 2, 2025

AI Browser

JavaScript 6,793 660 Updated Jan 27, 2026

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 7,162 814 Updated Mar 5, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 19,120 1,666 Updated Nov 19, 2025

StableDelight: Revealing Hidden Textures by Removing Specular Reflections

Python 393 17 Updated Sep 18, 2025

[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Python 750 36 Updated Aug 2, 2025

SoTA open-source TTS

Python 22,773 2,996 Updated Feb 3, 2026
Python 6,065 469 Updated Aug 29, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 79,553 2,582 Updated Feb 20, 2026