Computer Vision
OpenMMLab Pose Estimation Toolbox and Benchmark.
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
LAVIS - A One-stop Library for Language-Vision Intelligence
Azure AI Foundry (demos, documentation, accelerators).
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
A GPT-4/Gemini Voice/Video Exploration Tool
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。
pix2tex: Using a ViT to convert images of equations into LaTeX code.
We write your reusable computer vision tools. 💜
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
DeepFaceLab is the leading software for creating deepfakes.
State-of-the-art 2D and 3D Face Analysis Project
[AAAI 2025] Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
Strong and Open Vision Language Assistant for Mobile Devices
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Emu Series: Generative Multimodal Models from BAAI
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Industry leading face manipulation platform
Metric depth estimation from a single image
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Industry leading face manipulation platform
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
OCR, layout analysis, reading order, table recognition in 90+ languages
Segment Anything in Medical Images
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop




