Stars
A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine language models with specialized tools for tasks like web search…
整合图片识别 API,用于以图搜源 / Aggregator for Reverse Image Search API
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’
Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai+, arXiv2024]
Witness the aha moment of VLM with less than $3.
Community maintained fork of pdfminer - we fathom PDF
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Various AI scripts. Mostly Stable Diffusion stuff.
Training-free Regional Prompting for Diffusion Transformers 🔥
Official repository of In-Context LoRA for Diffusion Transformers
A minimal and universal controller for FLUX.1.
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Let your Claude able to think
[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)
[ECCV 2024] ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Medical Image
Official inference repo for FLUX.1 models
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Finetuning CLIP on a small image/text dataset using huggingface libs
A PyTorch Lightning solution to training OpenAI's CLIP from scratch.
[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gra…