Computer Vision
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)
[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals
[CVPR'23, Highlight] ECON: Explicit Clothed humans Optimized via Normal integration
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
Painter & SegGPT Series: Vision Foundation Models from BAAI
Grounded Segment Anything: From Objects to Parts
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
[ICCV 2023] Total-Recon: Deformable Scene Reconstruction for Embodied View Synthesis
Upload a photo of your room to generate your dream room with AI.
Restoring old and blurry face photos with AI.
ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Official code release for ICCV2023 paper AG3D: Learning to Generate 3D Avatars from 2D Image Collections
Automatically find issues in image datasets and practice data-centric computer vision.
Using ChatGPT to create AR experiences with natural language.
Official code for "HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion"
Official Code for DragGAN (SIGGRAPH 2023)
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)