Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,913 1,455 Updated Sep 5, 2024

facebookresearch / ov-seg

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Jupyter Notebook 717 63 Updated Oct 17, 2023

baaivision / Painter

Painter & SegGPT Series: Vision Foundation Models from BAAI

Python 2,557 178 Updated Dec 6, 2024

Saiyan-World / grounded-segment-any-parts

Grounded Segment Anything: From Objects to Parts

Jupyter Notebook 400 20 Updated May 19, 2023

segments-ai / panoptic-segment-anything

Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation

Jupyter Notebook 402 26 Updated May 3, 2024

johannakarras / DreamPose

Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"

Python 995 77 Updated Nov 2, 2023

facebookresearch / AnimatedDrawings

Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

Python 12,328 1,058 Updated Aug 9, 2024

gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,651 487 Updated May 31, 2024

andrewsonga / Total-Recon

[ICCV 2023] Total-Recon: Deformable Scene Reconstruction for Embodied View Synthesis

Python 216 6 Updated Aug 13, 2024

Nutlope / roomGPT

Upload a photo of your room to generate your dream room with AI.

TypeScript 10,260 1,419 Updated Apr 20, 2024

Nutlope / restorePhotos

Restoring old and blurry face photos with AI.

TypeScript 4,042 642 Updated Jun 20, 2024

phamquiluan / jdeskew

ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation

Jupyter Notebook 133 11 Updated Jan 11, 2025

deepdoctection / deepdoctection

A Repo For Document AI

Python 2,746 150 Updated Mar 11, 2025

mindee / doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 4,403 480 Updated Mar 10, 2025

chenfei-wu / TaskMatrix

Python 34,522 3,301 Updated Jan 6, 2024

zj-dong / AG3D

Official code release for ICCV2023 paper AG3D: Learning to Generate 3D Avatars from 2D Image Collections

Python 261 23 Updated Sep 25, 2023

cleanlab / cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

Python 1,056 72 Updated Apr 23, 2024

bmaltais / kohya_ss

Python 10,275 1,322 Updated Mar 6, 2025

trzy / ChatARKit

Using ChatGPT to create AR experiences with natural language.

C 432 36 Updated Mar 31, 2023

synthesiaresearch / humanrf

Official code for "HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion"

Python 466 28 Updated Sep 17, 2024

XingangPan / DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,883 3,450 Updated May 18, 2024

OpenGVLab / DragGAN

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" （DragGAN 全功能实现，在线Demo，本地部署试用，代码、模型已全部开源，支持Windows, macOS, Linux）

Python 4,985 489 Updated Jul 17, 2023

thu-ml / prolificdreamer

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)

Python 1,524 45 Updated Nov 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fabio Dias Rollo fabiodr

Block or report fabiodr

Computer Vision

yfeng95 / SCARF

CMU-Perceptual-Computing-Lab / openpose

yfeng95 / DECA

yfeng95 / PIXIE

YuliangXiu / ICON

YuliangXiu / ECON

MVIG-SJTU / AlphaPose

IDEA-Research / Grounded-Segment-Anything