- 1) Pubilc Datasets and Challenges
- 2) Pioneers and Experts
- 3) Blogs, Videos and Applications
- 4) Papers and Sources Codes
- ▶ Related Survey
- ▶ Single Person Pose Estimation
- ▶ Two-Stage [Top-Down] Multiple Person Pose Estimation
- ▶ Two-Stage [Bottom-Up] Multiple Person Pose Estimation
- ▶ Single-Stage Multiple Person Pose Estimation
- ▶ Simultaneous Multiple Person Pose Estimation and Instance Segmentation
- ▶ 3D Multiple Person Pose Estimation
- ▶ Special Multiple Person Pose Estimation
- ▶ Transfer Learning of Multiple Person Pose Estimation
- ▶ Keypoints Meet Large Language Model
- ▶ Keypoints for Human Motion Generation
- LIP(Look Into Person)
- Human3.6M (TPAMI2014) (3D single person)
- MPII Human Pose Dataset [Annotations(Matlab-->Python)]
- COCO - Common Objects in Context
- AI Challenger (arxiv2017 & ICME2019)[paper link]
- MHP - Multi-Human Parsing (ACMMM2018)
- DensePose-COCO Dataset (CVPR2018)
- PoseTrack: Dataset and Benchmark [challenges links][paper link][github link]
- ⭐OCHuman(Occluded Human) Dataset (CVPR2019) [github link]
- ⭐CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark (CVPR2019) [paper link]
- ⭐JTA(Joint Track Auto) - A synthetical dataset from GTA-V (ECCV2018)[paper link][github link][JTA-Extension]
- Mannequin RGB and IRS in-bed pose estimation dataset
- ⭐CMU Panoptic Studio Dataset (3D single and multiple real person pose in the lab) [github link]
- SURREAL dataset (CVPR2017) (3D single synthetic person pose in the indoor)[paper link]
- Drive&Act dataset (ICCV2019) (3D openpose single real person pose in the car with 5 views)[paper link]
- ⭐COCO-WholeBody (ECCV2020) (re-annotated based on keypoints in COCO dataset)[paper link][ZoomNAS(TPAMI2022)]
- Halpe-FullBody (CVPR2020) (full body human pose estimation and human-object interaction detection dataset)[paper link]
- IKEA ASSEMBLY DATASET (WACV2021) (3D single and multiple real person pose in the lab with 3 views)[paper link][google drive]
- Yoga-82: A New Dataset for Fine-grained Classification of Human Poses[kaggle]
- UAV-Human Dataset (CVPR2021) (not all appeared persons are annotated)[paper link][google drive]
- Mirrored-Human Dataset: Reconstructing 3D Human Pose by Watching Humans in the Mirror (CVPR2021 Oral)[paper link]
- ⭐AGORA: A synthetic human pose and shape dataset (CVPR2021) [paper link][github link][STAR (ECCV2020)][SMPL-X (CVPR2019)][FLAME (SIGGRAPH2017)][SMPL (SIGGRAPH2015)][rankers webpage]
- InfiniteForm: Open Source Dataset for Human Pose Estimation (NIPSW2021) [paper link][github link]
- Lower Body Rehabilitation Dataset and Model Optimization (ICME2021) [
The first human keypoints detection dataset for physical therapy, in particular lower body rehabilitation
] - ⭐UrbanPose: A new benchmark for VRU pose estimation in urban traffic scenes (IEEE Intelligent Vehicles Symposium (IV) 2021) [paper link]
- HMR-Benchmarks: Benchmarking 3D Pose and Shape Estimation Beyond Algorithms (NIPS2022) [paper link]
- SynPose: A Large-Scale and Densely Annotated Synthetic Dataset for Human Pose Estimation in Classroom (ICASSP2022) [paper link][
Based on GTA-V, CycleGAN, ST-GCN and DEKR
] - ⭐JRDB-Pose: A Large-scale Dataset for Multi-Person Pose Estimation and Tracking (ICCV2019 & CVPR2021 & ECCV2022 & CVPR2023) [paper link][arxiv link][dataset details]
- ⭐Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes (CVPR2023) [paper link][By IDEA-Research]
👍Alejandro Newell 👍Jia Deng 👍Zhe Cao 👍Tomas Simon 👍tensorboy 👍murdockhou 👍张兆翔
- (B站 video) 张锋-2D单人人体姿态估计及其应用
- (B站 video) 人工智能 | 基于人体骨架的行为识别
- (Website) 姿态估计交流网站ilovepose
- (CSDN blog) Paper List:CVPR 2018 人体姿态估计相关
- (blog) ECCV 2020 论文大盘点-姿态估计与动作捕捉篇
- (blog) ECCV 2020 论文大盘点-3D人体姿态估计篇
- (github) Awesome Human Pose Estimation (cbsudux)
- (github) Awesome Human Pose Estimation (wangzheallen)
- (real time pose in github) tf-pose-estimation
- (real time pose in github) 💃 Real-time single person pose estimation for Android and iOS
- (real time pose in github) Real-time 2D MPPE on CPU: Lightweight OpenPose
- (Application) FXMirror虚拟试衣解决方案
- (Application) 3D试衣间:人工智能虚拟试衣系统
- (blog) A Comprehensive Guide to Human Pose Estimation
- (blog) (MMPose) 2D BODY KEYPOINT DATASETS
- (github) (coco-annotator) Web-based image segmentation tool for object detection, localization, and keypoints
- ComputingSurveys 2022 Recent Advances of Monocular 2D and 3D Human Pose Estimation: A Deep Learning Perspective [paper link][arxiv link][
JD
+HIT
]
-
Modeep(ACCV2014)(video based) MoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation [arxiv link]
-
(NIPS2014)(heatmaps) Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation [arxiv link]
-
⭐PoseMachines(ECCV2014)(regression) Pose Machines: Articulated Pose Estimation via Inference Machines [paper link][project link]
-
⭐DeepPose(CVPR2014)(AlexNet based)(regression) DeepPose: Human Pose Estimation via Deep Neural Networks [arxiv link][Codes|OpenCV(unoffical)]
-
(ICCV2015)(video based) Flowing ConvNets for Human Pose Estimation in Videos [arxiv link]
-
(ECCV2016)(heatmaps) Human Pose Estimation using Deep Consensus Voting [arxiv link]
-
(CVPR2016)(structure information) End-To-End Learning of Deformable Mixture of Parts and Deep Convolutional Neural Networks for Human Pose Estimation [paper link]
-
(CVPR2016)(structure information) Structured Feature Learning for Pose Estimation [paper link]
-
IEF(CVPR2016)(GoogleNet Based)(regression) Human Pose Estimation with Iterative Error Feedback [arxiv link]
-
⭐CPM(CVPR2016)(heatmaps) Convolutional Pose Machines [arxiv link][Codes|Caffe(offical)][Codes|Tensorflow(unoffical)]
-
⭐StackedHourglass(ECCV2016)(heatmaps) Stacked Hourglass Networks for Human Pose Estimation [arxiv link][Codes|Torch7(offical old)][Codes|PyTorch(offical new)][Codes|Tensorflow(unoffical)]
-
HourglassResidualUnits(HRUs)(CVPR2017)(heatmaps) Multi-context Attention for Human Pose Estimation [arciv link]
-
PyraNet(ICCV2017)(heatmaps) Learning Feature Pyramids for Human Pose Estimation [arxiv link][Codes|Torch(offical)]
-
(ICCV2017)(ResNet-50 Based)(regression) Compositional Human Pose Regression [arxiv link]
-
⭐Adversarial-PoseNet(ICCV2017)(GAN) Adversarial PoseNet: A Structure-aware Convolutional Network for Human Pose Estimation [arxiv link][Codes|PyTorch(unoffical)]
-
(ECCV2018)(structure information) Multi-Scale Structure-Aware Network for Human Pose Estimation [arxiv link]
-
(ECCV2018)(structure information) Deeply Learned Compositional Models for Human Pose Estimation [paper link]
-
(CVPR2018)(multi-task/video based)(regression) 2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning [arxiv link]
-
(CVPR2019)(structure information) Does Learning Specific Features for Related Parts Help Human Pose Estimation? [paper link]
-
(arxiv2020)(video based) Key Frame Proposal Network for Efficient Pose Estimation in Videos [arxiv link]
-
UniPose(CVPR2020)(video based) UniPose: Unified Human Pose Estimation in Single Images and Videos [arxiv link][Codes|PyTorch(offical)]
-
(ECCVW2016) Multi-Person Pose Estimation with Local Joint-to-Person Associations [arxiv link]
-
(CVPR2017) Towards Accurate Multi-person Pose Estimation in the Wild [arxiv link]
-
(ICCV2017) A Coarse-Fine Network for Keypoint Localization [paper link]
-
⭐AlphaPose/RMPE(ICCV2017) RMPE: Regional Multi-person Pose Estimation [arxiv link][project link][Codes|PyTorch(offical)]
-
⭐SimpleBaseline(ECCV2018) Simple Baselines for Human Pose Estimation and Tracking [arxiv link][Codes|PyTorch(offical)][Codes|PyTorch(flowtrack part)]
-
⭐CPN(CVPR2018) Cascaded Pyramid Network for Multi-Person Pose Estimation [arxiv link][Codes|Tensorflow(offical)][Codes|Tensorflow(offical megvii)][zhihu blogs]
-
⭐HRNet(CVPR2019) Deep High-Resolution Representation Learning for Human Pose Estimation [arxiv link][Codes|PyTorch(offical)][Codes|(Repositories using HRNet as backbone)][Codes|Tensorflow for fun][Codes|Tensorflow HRNet-V2(unoffical)]
-
⭐CrowdPose(CVPR2019) CrowdPose: Efficient Crowded Scenes Pose Estimation and a New Benchmark [paper link][codes|(SJTU) official PyTorch]
-
(CVPR2019) Multi-Person Pose Estimation with Enhanced Channel-wise and Spatial Information [arxiv link]
-
(CVPR2019) PoseFix: Model-Agnostic General Human Pose Refinement Network [paper link]
-
(arxiv2019) Rethinking on Multi-Stage Networks for Human Pose Estimation [arxiv link]
-
⭐DarkPose(CVPR2020) Distribution-Aware Coordinate Representation for Human Pose Estimation [arxiv link][project link][Codes|PyTorch(offical)]
-
⭐UDP-Pose(CVPR2020) The Devil Is in the Details: Delving Into Unbiased Data Processing for Human Pose Estimation [arxiv link][Codes|][
A model-agnostic approach
,Plug-and-Play
] -
Graph-PCNN(arxiv 2020) Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement [arxiv link]
-
RSN-PRM(arxiv2020) Learning Delicate Local Representations for Multi-Person Pose Estimation [arxiv link]
-
OPEC-Net(arxiv2020)(ECCV2020) Peeking into occluded joints: A novel framework for crowd pose estimation [arxiv link][for
Crowded Human Pose Estimation
] -
(arxiv2020)(video based) Self-supervised Keypoint Correspondences for Multi-Person Pose Estimation and Tracking in Videos [arxiv link]
-
⭐PoseNAS(ACMMM2020) Pose-native Network Architecture Search for Multi-person Human Pose Estimation [paper link][codes|official PyTorch][
Network Architecture Search (NAS) based two-stage MPPE
] -
CCM(IJCV2021) Towards High Performance Human Keypoint Detection [paper link][codes|official (not released)]
-
OmniPose(arxiv2021) OmniPose: A Multi-Scale Framework for Multi-Person Pose Estimation [arxiv link]
-
RLE(ICCV2021) Human Pose Regression With Residual Log-Likelihood Estimation [paper link][code|official]
-
MIPNet(ICCV2021) Multi-Instance Pose Networks: Rethinking Top-Down Pose Estimation [paper link][project link][codes|official demo]
-
⭐TransPose(ICCV2021) TransPose: Keypoint Localization via Transformer [paper link][codes|official PyTroch][
Transformer based two-stage MPPE (light-weight)
] -
⭐TokenPose(ICCV2021) TokenPose: Learning Keypoint Tokens for Human Pose Estimation [paper link][codes|official PyTroch][
Token representation based two-stage MPPE (light-weight)
] -
⭐Lite-HRNet(CVPR2021) Lite-HRNet: A Lightweight High-Resolution Network [paper link][codes|official PyTorch][
This work is done by the original group of HRNet
] -
HRFormer(NIPS2021) HRFormer: High-Resolution Transformer for Dense Prediction [paper link][code|official][
multi-task
,2D Human Pose Estimation
,Semantic Segmentation
] -
⭐LitePose(CVPR2022) Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation [paper link][project link][codes|official PyTorch][
Model quantization and compression on Qualcomm Snapdragon
] -
CID(CVPR2022) Contextual Instance Decoupling for Robust Multi-Person Pose Estimation [paper link][codes|official][(TPAMI2023) Contextual Instance Decoupling for Instance-Level Human Analysis][First Author: Dongkai Wang]
-
Poseur(ECCV2022) Poseur: Direct Human Pose Regression with Transformers [paper link][code|official][
RLE-based
,DETR-based top-down framework
] -
SCIO(ECCV2022) Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation [paper link][arxiv link][
Test-Time Adaptation
, the same author ofSCAI
] -
Swin-Pose(arxiv2022)(MIPR2022) Swin-Pose: Swin Transformer Based Human Pose Estimation [paper link][
Swin Transformer
] -
⭐ViTPose(NIPS2022) ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation [paper link][arxiv link][code|official][ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation (arxiv2022.12) (TPAMI2023)][
Tao Dacheng
,plain vision transformer
] -
PCT(CVPR2023) Human Pose as Compositional Tokens [arxiv link][code|official][
Transformer-based
] -
DistilPose(CVPR2023) DistilPose: Tokenized Pose Regression With Heatmap Distillation [paper link][arxiv link][code|offical][
Xia Men University
,Regression-based
, Transformer] -
BCIR(Bias Compensated Integral Regression)(TPAMI2023) Bias-Compensated Integral Regression for Human Pose Estimation [paper link][arxiv link][
A model-agnostic approach
,Plug-and-Play
] -
ICON(AAAI2023) Inter-image Contrastive Consistency for Multi-Person Pose Estimation [paper link][
Xixia Xu
,No code
, sever as a play-in-plug]
-
DeepCut(CVPR2016) DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation [arxiv link]
-
⭐DeeperCut(ECCV2016) DeeperCut: A Deeper, Stronger, and Faster Multi-Person Pose Estimation Model [arxiv link][project link][Codes|Tensorflow(offical)]
-
ArtTrack(CVPR2017) ArtTrack: Articulated Multi-Person Tracking in the Wild [paper link]
-
⭐OpenPose(CVPR2017) Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [arxiv link][Codes|Caffe&Matlab(offical)][Codes|Caffe(offical only for testing)]Codes|PyTorch(unoffical by tensorboy)]
-
⭐AssociativeEmbedding(NIPS2017) Associative Embedding: End-to-end Learning for Joint Detection and Grouping [arxiv link][Codes|PyTorch(offical)]
-
(ICCVW2017) Multi-Person Pose Estimation for PoseTrack with Enhanced Part Affinity Fields [paper link][CSDN blog]
-
PPN(ECCV2018) Pose Partition Networks for Multi-Person Pose Estimation [paper link][
To partition all keypoint detections using dense regressions from keypoint candidates to centroids of persons
,similar to SPM
] -
(CVPRW2018) Learning to Refine Human Pose Estimation [arxiv link]
-
⭐MultiPoseNet(ECCV2018)(multi-task) MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network [arxiv link][Codes|PyTorch(offical)]
-
OpenPoseTrain(ICCV2019) Single-Network Whole-Body Pose Estimation [paper link][codes|official][
simultaneous localization of body, face, hands, and feet keypoints
] -
⭐OpenPifPaf(CVPR2019) PifPaf: Composite Fields for Human Pose Estimation [paper link][Codes|PyTorch(offical)]
-
⭐HigherHRNet(CVPR2020) HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation [arxiv link][Codes|PyTorch(offical)]
-
⭐MDN3(CVPR2020) Mixture Dense Regression for Object Detection and Human Pose Estimation [arxiv link][Codes|PyTorch(offical)]
-
HGG(arxiv2020)(ECCV2020) Differentiable Hierarchical Graph Grouping for Multi-person Pose Estimation [paper link][arxiv link]
-
⭐EfficientHRNet(arxiv2020) EfficientHRNet: Efficient Scaling for Lightweight High-Resolution Multi-Person Pose Estimation [paper link]
-
SimplePose(AAAI2020) Simple pose: Rethinking and improving a bottom-up approach for multi-person pose estimation [paper link][codes|official PyTorch][
An improved OpenPose based on Stacked Hourglass and proposed Body Parts
] -
DGCN(AAAI2020) DGCN: Dynamic Graph Convolutional Network for Efficient Multi-Person Pose Estimation [paper link][
Graph based two-stage MPPE
] -
⭐CenterGroup(ICCV2021) The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation [paper link][codes|official PyTorch based on mmpose and HigherHRNet]
-
⭐SWAHR(CVPR2021) Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation [arxiv link][Codes|official pytorch based on HigherHRNet]
-
⭐DEKR(CVPR2021) Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression [arxiv link][Codes|official pytorch]
-
PINet(NIPS2021) Robust Pose Estimation in Crowded Scenes with Direct Pose-Level Inference [paper link][codes|official PyTorch][First Author: Dongkai Wang][For
Crowded Scenes
, FollowingHigherHRNet
andDEKR
] -
DAC(arxiv2022) Bottom-Up 2D Pose Estimation via Dual Anatomical Centers for Small-Scale Persons [arxiv link][
Dual Anatomical Centers (Head + Body)
] -
CoupledEmbedding(ECCV2022) Regularizing Vector Embedding in Bottom-Up Human Pose Estimation [paper link][codes|official PyTorch]
-
DEKRv2(ICIP2022) DEKRv2: More Fast or Accurate than DEKR [paper link][codes|official PyTorch]
-
HrHRNet-CF(CVPR2023) A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation [paper link]
-
BUCTD(ICCV2023) Rethinking Pose Estimation in Crowds: Overcoming the Detection Information Bottleneck and Ambiguity [paper link][project link][arxiv link][code|official][
EPFL
]
-
DirectPose(arxiv2019) DirectPose: Direct End-to-End Multi-Person Pose Estimation [arxiv link][
DirectPose proposes to directly regress the instance-level keypoints by considering the keypoints as a special bounding-box with more than two corners.
] -
SPM(ICCV2019) Single-Stage Multi-Person Pose Machines [arxiv link][Codes|PyTorch(offical not released)][Codes|Tensorflow(unoffical)][CSDN blog]
-
CenterNet(arxiv2019) Objects as Points [arxiv link]
-
Point-Set Anchors(ECCV2020) Point-Set Anchors for Object Detection, Instance Segmentation and Pose Estimation [paper link]
-
POET(arxiv2021) End-to-End Trainable Multi-Instance Pose Estimation with Transformers [arxiv link][
DETR-based
,regression
] -
TFPose(arxiv2021) TFPose: Direct Human Pose Estimation with Transformers [arxiv link][project link][
It adopts Detection Transformers to estimate the cropped single-person images as a query-based regression task
][end2end top-down
] -
InsPose(ACMMM2021) InsPose: Instance-Aware Networks for Single-Stage Multi-Person Pose Estimation [paper link][code|official][
It designs instance-aware dynamic networks to adaptively adjust part of the network parameters for each instance
] -
DeepDarts(CVPRW2021) DeepDarts: Modeling Keypoints as Objects for Automatic Scorekeeping in Darts using a Single Camera [paper link]
-
⭐FCPose(CVPR2021) FCPose: Fully Convolutional Multi-Person Pose Estimation With Dynamic Instance-Aware Convolutions [paper link][codes|official]
-
PRTR(CVPR2021) Pose Recognition With Cascade Transformers [paper link][codes|official][
transformer-based
,high input resolution and stacked attention modules
,high complexity and require huge memory during the training phase
][end2end top-down
] -
⭐KAPAO(ECCV2022) Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose Estimation [arxiv link][codes|(official pytorch using YOLOv5)]
-
YOLO-Pose(CVPRW2022) YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss [paper link][codes|official edgeai-yolox][codes|official edgeai-yolov5]
-
⭐AdaptivePose(AAAI2022) AdaptivePose: Human Parts as Adaptive Points [paper link][codes|official PyTorch]
-
⭐AdaptivePose++(TCSVT2022) AdaptivePose++: A Powerful Single-Stage Network for Multi-Person Pose Regression [paper link][codes|official PyTorch]
-
LOGO-CAP(CVPR2022) Learning Local-Global Contextual Adaptation for Multi-Person Pose Estimation [paper link][codes|official PyTorch]
-
PETR(CVPR2022) End-to-End Multi-Person Pose Estimation With Transformers [paper link][codes|official PyTorch][
transformer-based
,high input resolution and stacked attention modules
,high complexity and require huge memory during the training phase
][fully end2end
] -
QueryPose(NIPS2022) QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query [openreview link][arxiv link][code|official][
fully end2end
] -
⭐ED-Pose(ICLR2023) Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation [arxiv link][openreview link][code|official][
IDEA-Research
][fully end2end
] -
PolarPose(TIP2023) PolarPose: Single-Stage Multi-Person Pose Estimation in Polar Coordinates [paper link]
-
👍GroupPose(ICCV2023) Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation [paper link][arxiv link][code|official Paddle][code|official PyTorch]
-
👍RTMO(arxiv2023.12) RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation [arxiv link][code|official][
Tsinghua Shenzhen International Graduate School
andShanghai AI Laboratory
; the code is released byOpen-MMLab
]
-
⭐Mask R-CNN(ICCV2017)(multi-task) Mask R-CNN [paper link]
-
⭐PersonLab(ECCV2018)(multi-task) PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model [arxiv link][Codes|Keras&Tensorflow(unoffical by octiapp)][Codes|Tensorflow(unoffical)]
-
ACPNet(ICME2019) ACPNet: Anchor-Center Based Person Network for Human Pose Estimation and Instance Segmentation [paper link][
based on Mask R-CNN
] -
Pose2Seg(CVPR2019) Pose2Seg: Detection Free Human Instance Segmentation [paper link][codes|official]
-
PointSetNet(ECCV2020) Point-Set Anchors for Object Detection, Instance Segmentation and Pose Estimation [paper link][
Not a multi-task end-to-end network
,The proposed Point-Set Anchors can be applied to object detection, instance segmentation and human pose estimation tasks separately
] -
MG-HumanParsing(CVPR2021) Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing [paper link][code|official]
-
Multitask-CenterNet(ICCVW2021) MultiTask-CenterNet (MCN): Efficient and Diverse Multitask Learning Using an Anchor Free Approach [paper link][
based on the CenterNet
] -
MDSP(IVS2022 Oral) Multitask Network for Joint Object Detection, Semantic Segmentation and Human Pose Estimation in Vehicle Occupancy Monitoring [paper link]
-
PosePlusSeg(AAAI2022) Joint Human Pose Estimation and Instance Segmentation with PosePlusSeg [paper link][codes|official tensorflow][
similarly with the PersonLab
,Niaz Ahmad
,suspected of plagiarism
] -
MultiPoseSeg(ICPR2022) MultiPoseSeg: Feedback Knowledge Transfer for Multi-Person Pose Estimation and Instance Segmentation [paper link][code|official][
similarly with the PersonLab
,Niaz Ahmad
,suspected of plagiarism
] -
HCQNet(Human-Centric Query)(arxiv2023.03) Object-Centric Multi-Task Learning for Human Instances [paper link][based on the
Mask2Former (CVPR2022) (Masked-attention mask transformer for universal image segmentation)
]
-
mvpose(CVPR2019)(monocular multi-view) Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views [arxiv link][project link][Codes|Torch&Tensorflow(offical)]
-
EpipolarPose(CVPR2019)(monocular multi-view) Self-Supervised Learning of 3D Human Pose using Multi-view Geometry [arxiv link][project link][Codes|PyTorch(offical)]
-
SMAP(ECCV2020) SMAP: Single-Shot Multi-person Absolute 3D Pose Estimation [paper link][project link][codes|official PyTorch]
-
(multi-views)(ICCV2021) Shape-aware Multi-Person Pose Estimation from Multi-View Images [paper link][project link][codes|official]
-
MVP(NIPS2021) Direct Multi-view Multi-person 3D Pose Estimation [paper link][codes|official PyTorch]
-
InverseKinematics(ECCV2022) Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement [paper link][datasets
3DPW
,MuCo-3DHP
andAGORA
][transformer
] -
HUPOR(ECCV2022) Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation [arxiv link][paper link][code|official]
-
POTR3D(ICCV2023) Towards Robust and Smooth 3D Multi-Person Pose Estimation from Monocular Videos in the Wild [paper link][arxiv link][
Seoul National University
]
-
PoseTrack(CVPR2017) PoseTrack: Joint Multi-Person Pose Estimation and Tracking [arxiv link][Codes|Matlab&Caffe]
-
Detect-and-Track(CVPR2018) Detect-and-Track: Efficient Pose Estimation in Videos [arxiv link][project link][Codes|Detectron(offical)][codes|official]
-
PoseFlow(BMVC2018) Pose Flow: Efficient Online Pose Tracking [arxiv link][Codes|AlphaPose(offical)]
-
DensePose(CVPR2018) DensePose: Dense Human Pose Estimation In The Wild [arxiv link][project link][Codes|Caffe2(offical)]
-
RF-Pose(CVPR2018)(radio frequency) Through-Wall Human Pose Estimation Using Radio Signals [paper link][project link]
-
👍LIP_JPPNet(TPAMI2019) Look into Person: Joint Body Parsing & Pose Estimation Network and a New Benchmark [paper link][Lab Homepage][code|official][
Joint Body Parsing & Pose Estimation
] -
DoubleFusion(TPAMI2019)(3D single-view real-time depth-sensor) DoubleFusion: Real-time Capture of Human Performances with Inner Body Shapes from a Single Depth Sensor [arxiv link]
-
Keypoint-Communities(ICCV2019) Keypoint Communities [paper link][
Model all keypoints belonging to a human or an object (the pose) as a graph
] -
BlazePose (CVPRW2020) BlazePose: On-device Real-time Body Pose tracking [paper link][project link]
-
ODKD(arxiv2021) Orderly Dual-Teacher Knowledge Distillation for Lightweight Human Pose Estimation [paper link][
Knowledge Distillation of MPPE based on HRNet
] -
DDP(3DV2021) Direct Dense Pose Estimation [paper link][
Dense human pose estimation
] -
MEVADA(ICCV2021) Single View Physical Distance Estimation using Human Pose [paper link][project link]
-
Unipose+(TPAMI2022) UniPose+: A Unified Framework for 2D and 3D Human Pose Estimation in Images and Videos [paper link][author given link]
-
👍HTCorrM(Human Task Correlation Machine)(TPAMI2022) On the Correlation among Edge, Pose and Parsing [paper link][pdf link][
Multi-tasks Learning
] -
PoseTrack21(CVPR2022) PoseTrack21: A Dataset for Person Search, Multi-Object Tracking and Multi-Person Pose Tracking [paper link][codes|official][
jointly person search, multi-object tracking and multi-person pose tracking
] -
PoseTrans(ECCV2022) PoseTrans: A Simple Yet Effective Pose Transformation Augmentation for Human Pose Estimation [paper link]
-
⭐DeciWatch(ECCV2022) DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation [paper link][code|official][project link][
Video based human pose estimation
] -
PPT(ECCV2022) PPT: Token-Pruned Pose Transformer for Monocular and Multi-view Human Pose Estimation [paper link][code|official]
-
QuickPose(SIGGRAPH2022) QuickPose: Real-time Multi-view Multi-person Pose Estimation in Crowded Scenes [paper link][
ZJU
] -
TDMI-ST(CVPR2023) Mutual Information-Based Temporal Difference Learning for Human Pose Estimation in Video [paper link][
PoseTrack2017, PoseTrack2018, and PoseTrack21
,video-based HPE
] -
MG-HumanParsing(TPAMI2023) Differentiable Multi-Granularity Human Parsing [paper link][code|official][
Human Parsing
] -
Obj2Seq(NIPS2022) Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks [openreview link][arxiv link][code|official][
ViT-based
,Multi-task model
] -
👍AutoLink(NIPS2022) AutoLink: Self-supervised Learning of Human Skeletons and Object Outlines by Linking Keypoints [arxiv link][openreview link][project link]
Domain Adaptive / Unsupervised / Self-Supervised / Semi-Supervised / Weakly-Supervised / Generalizable
- VL4Pose(BMVC2022) VL4Pose: Active Learning Through Out-Of-Distribution Detection For Pose Estimation [arxiv link][code|official][with tasks of single
human pose
andhand pose
]
-
👍SynPose(ICASSP2022) Synpose: A Large-Scale and Densely Annotated Synthetic Dataset for Human Pose Estimation in Classroom [paper link][project link][
Based on GTA-V, CycleGAN, ST-GCN and DEKR
] -
👍CC-PoseNet(ICASSP2023) CC-PoseNet: Towards Human Pose Estimation in Crowded Classrooms [paper link]
-
WS-CDA(ICCV2019) Cross-Domain Adaptation for Animal Pose Estimation [paper link][arxiv link][project link][code|official][
Animal Pose Dataset
,Leverages human pose data and a partially annotated animal pose dataset to perform semi-supervised domain adaptation
] -
👍CC-SSL(CVPR2020) Learning From Synthetic Animals [paper link][arxiv link][code|official][
Animal Pose
][It proposed invariance and equivariance consistency learning with respect to transformations as well as temporal consistency learning with a video
;It employs a single end-to-end trained network
] -
👍MDAM, UDA-Animal-Pose(CVPR2021) From Synthetic to Real: Unsupervised Domain Adaptation for Animal Pose Estimation [paper link][codes|PyTorch][
Animal Pose
][ResNet + Hourglass
][It proposed a refinement module and a self-feedback loop to obtain reliable pseudo labels
;It addresses the teacher-student paradigm alongside a novel pseudo-label strategy
] -
⚡DeepLabCut (Nature Methods 2022) Multi-animal pose estimation, identification and tracking with DeepLabCut [paper link]
-
⚡Social LEAP Estimates Animal Poses (SLEAP) (Nature Methods 2022) SLEAP: A deep learning system for multi-animal pose tracking [paper link]
-
SemiMultiPose(arxiv2022) SemiMultiPose: A Semi-supervised Multi-animal Pose Estimation Framework [paper link][
Semi-Supervised Keypoint Localization
] -
AnimalKingdom (CVPR2022) Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding [paper link][project link][arxiv link][code|official]
-
⭐ScarceNet(CVPR2023) ScarceNet: Animal Pose Estimation With Scarce Annotations [paper link][arxiv link][code|official][
Animal Pose
,Semi-Supervised Keypoint Localization
, based onHRNet
][small-loss trick for reliability check
+agreement check to identify reusable samples
+student-teacher network (Mean Teacher) to enforce a consistency constraint
] -
AnimalTrack (IJCV2023) AnimalTrack: A Benchmark for Multi-Animal Tracking in the Wild [arxiv link][project link][download page][
Animal dataset
] -
LoTE-Animal (ICCV2023) LoTE-Animal: A Long Time-span Dataset for Endangered Animal Behavior Understanding [paper link][project link][Animal dataset]
-
Animal3D (ICCV2023) Animal3D: A Comprehensive Dataset of 3D Animal Pose and Shape [paper link][arxiv link][project link][based on the
SMAL
model, Animal dataset] -
⚡Social Behavior Atlas (SBeA) (Nature Machine Intelligence 2024) Multi-animal 3D social pose estimation, identification and behaviour embedding with a few-shot learning framework [paper link]
-
(CVIU2017) Hand Pose Estimation through Semi-Supervised and Weakly-Supervised Learning [paper link][arxiv link][
Universite de Lyon
; using thedepth
input] -
(ECCV2018) Weakly-supervised 3D Hand Pose Estimation from Monocular RGB Images [paper link][
No code is available
,Nanyang Technological University
,a weakly-supervised method with the aid of depth images
,3D Hand Pose Estimation
,Keypoints
] -
SO-HandNet(ICCV2019) SO-HandNet: Self-Organizing Network for 3D Hand Pose Estimation With Semi-Supervised Learning [paper link][
No code is available
,Wuhan University
,3D Hand Pose Estimation
,Keypoints
, based onSO-Net
and 3D point clouds] -
weak_da_hands(CVPR2020) Weakly-Supervised Domain Adaptation via GAN and Mesh Model for Estimating 3D Hand Poses Interacting Objects [paper link][code|official (not available)]
-
SemiHand(ICCV2021) SemiHand: Semi-Supervised Hand Pose Estimation With Consistency [paper link][
No code is available
,semi-supervised hand pose estimation
] -
MarsDA(TCSVT2022) Multibranch Adversarial Regression for Domain Adaptative Hand Pose Estimation [paper link][based on
RegDA
,hand datasets (RHD→H3D)
,It applies a teacher-student approach to edit RegDA
] -
👍C-GAC(ECCV2022) Domain Adaptive Hand Keypoint and Pixel Localization in the Wild [paper link][arxiv link][project link][based on
Stacked Hourglass
,all compared methods are reproduced by the author
,no code is available
] -
DM-HPE(CVPR2023) Cross-Domain 3D Hand Pose Estimation With Dual Modalities [paper link][
No code is available
,cross-domain semi-supervised hand pose estimation
,Dual Modalities
]
belonging to the Domain Adaptive Regression (DGA)
or Semi-Supervised Rotation Regression
problem
-
PADACO(ICCV2019) Deep Head Pose Estimation Using Synthetic Images and Partial Adversarial Domain Adaption for Continuous Label Spaces [paper link][code|official][An adversarial training approach based on
domain adversarial neural networks
is used to force the extraction of domain-invariant features] -
👍Gaze360(ICCV2019) Gaze360: Physically Unconstrained Gaze Estimation in the Wild [paper link][arxiv link][project link][
dataset Gaze360
,Domain Adaptive Gaze Estimation
] -
few_shot_gaze(ICCV2019 oral) Few-Shot Adaptive Gaze Estimation [paper link][arxiv link][code|official][
Domain Adaptive Gaze Estimation
] -
DeepDAR(SpringerBook2020) Deep Domain Adaptation for Regression [paper link][pdf link][
Domain Adaptive Regression (DGA)
theory,Age Estimation
andHead Pose Estimation
][book titleDevelopment and Analysis of Deep Learning Architectures
] -
DAGEN(ACCV2020) Domain Adaptation Gaze Estimation by Embedding with Prediction Consistency [paper link][arxiv link][
Eye Gaze Estimation
] -
(FG2021) Relative Pose Consistency for Semi-Supervised Head Pose Estimation [paper link][pdf link][
Semi-Supervised
] -
PnP-GA(ICCV2021) Generalizing Gaze Estimation With Outlier-Guided Collaborative Adaptation [paper link][arxiv link][code|official][
Domain Adaptive Gaze Estimation
] -
👍RSD(ICML2021) Representation Subspace Distance for Domain Adaptation Regression [paper link][code|official][
Domain Adaptive Regression (DGA)
theory,Mingsheng Long
, datasets dSprites(a standard 2D synthetic dataset for deep representation learning) and MPI3D(a simulation-to-real dataset of 3D objects)] -
DINO-INIT & DINO-TRAIN(NIPS2022) Distribution-Informed Neural Networks for Domain Adaptation Regression [paper link][
Domain Adaptive Regression (DGA)
theory] -
SynGaze(CVPRW2022) Learning-by-Novel-View-Synthesis for Full-Face Appearance-Based 3D Gaze Estimation [paper link][arxiv link][
The University of Tokyo
,Eye Gaze Estimation
, No code] -
RUDA(CVPR2022) Generalizing Gaze Estimation With Rotation Consistency [paper link][
Eye Gaze Estimation
, No code] -
CRGA(CVPR2022) Contrastive Regression for Domain Adaptation on Gaze Estimation [paper link][
SJTU
,Eye Gaze Estimation
, No code] -
(TBIOM2023) Domain Adaptation for Head Pose Estimation Using Relative Pose Consistency [paper link]
-
AdaptiveGaze(arxiv2023.05) Domain-Adaptive Full-Face Gaze Estimation via Novel-View-Synthesis and Feature Disentanglement [arxiv link][code|official][
The University of Tokyo
,Eye Gaze Estimation
] -
👍DARE-GRAM(CVPR2023) DARE-GRAM: Unsupervised Domain Adaptation Regression by Aligning Inverse Gram Matrices [paper link][code|official][HPE domain transfer test for Male --> Female on
BIWI
dataset] -
(AAAI2023) Learning a Generalized Gaze Estimator from Gaze-Consistent Feature [paper link]
-
👍UnReGA(CVPR2023) Source-Free Adaptive Gaze Estimation by Uncertainty Reduction [paper link][paperswithcode link][code|official (not released)]
-
PnP-GA+(TPAMI2023) PnP-GA+: Plug-and-Play Domain Adaptation for Gaze Estimation using Model Variants [paper link][
Domain Adaptive Gaze Estimation
, extended based onPnP-GA(ICCV2021)
]
-
pose-hg-3d(ICCV2017) Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach [paper link][code|official][
3D keypoints detection
,weakly-supervised domain adaptation with a 3D geometric constraint-induced loss
] -
3DKeypoints-DA(ECCV2018) Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency [paper link][arxiv link][code|official][
It utilizes view-consistency to regularize predictions from unlabeled target domain in 3D keypoints detection, but depth scans and images from different views are required on the target domain
] -
(ACMMM2019) Unsupervised Domain Adaptation for 3D Human Pose Estimation [paper link][
3D keypoints detection
] -
(CVPR2020) Weakly-Supervised 3D Human Pose Learning via Multi-View Images in the Wild [paper link][arxiv link][
NVIDIA
,It focuses on unlabelled multi-view images
,Self-supervised learning for 3D human pose estimation
] -
AdaptPose(CVPR2022) AdaptPose: Cross-Dataset Adaptation for 3D Human Pose Estimation by Learnable Motion Generation [paper link][
3D keypoints detection
] -
FewShot3DKP(CVPR2023) Few-Shot Geometry-Aware Keypoint Localization [paper link][project link][
Few-Shot Learning
,3D Keypoint Localization
,human faces, eyes, animals, cars, and never-before-seen mouth interior (teeth) localization tasks
] -
ACSM-Plus(CVPR2023) Learning Articulated Shape With Keypoint Pseudo-Labels From Web Images [paper link][
2D Keypoints for downstream application
,3D Reconstruction / Shape Recovery from 2D images
] -
PoseDA (ICCV2023) Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation [paper link][arxiv link][code|official][
ZJU
] -
👍3D-Pose-Transfer (ICCV2023) Weakly-supervised 3D Pose Transfer with Keypoints [paper link][arxiv link][project link][code|official][
National University of Singapore
] -
UAO(arxiv2024.02) Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation [arxiv link][
Peking University, Shenzhen
]
-
DataDistill, Pseudo-Labeling, PL(CVPR2018) Data Distillation: Towards Omni-Supervised Learning [paper link][arxiv link][
Omni-Supervised Learning
,a special regime of semi-supervised learning
, with taskshuman keypoint detection
andgeneral object detection
] -
MONET(ICCV2019) MONET: Multiview Semi-Supervised Keypoint Detection via Epipolar Divergence [paper link][arxiv link][code|official][
University of Minnesota
, multi-view inputs] -
Pose_DomainAdaption(ACMMM2020) Alleviating Human-level Shift: A Robust Domain Adaptation Method for Multi-person Pose Estimation [paper link][Codes|PyTorch (not available)][(TMM2022 extended journal version) Structure-enriched Topology Learning for Cross-domain Multi-person Pose estimation]
-
⭐SSKL(ICLR2021) Semi-supervised Keypoint Localization [openreview link][arxiv link][code|official][author Olga Moskvyak's homepage][
single hand datasets
,single person datasets
,Semi-Supervised Keypoint Localization
] -
⭐Semi_Human_Pose(ICCV2021) An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation [paper link][arxiv link][codes|official PyTorch][
Semi-Supervised 2D Human Pose Estimation
] -
👍❤RegDA(CVPR2021) Regressive Domain Adaptation for Unsupervised Keypoint Detection [paper link][project library][code|official][
hand datasets (RHD→H3D)
,human datasets (SURREAL→Human3.6M, SURREAL→LSP)
][ResNet101 + Simple Baseline
][based on the DA classification method disparity discrepancy (DD) (ICML2019, authors including Mingsheng Long and Michael Jordan)][It utilizes one shared feature extractor and two separate regressors
;It made changes in DD for human and hand pose estimation tasks, which measures discrepancy by estimating false predictions on the target domain
] -
👍HPE-AdaptOR(arxiv2021.08)(Medical Image Analysis2022) Unsupervised domain adaptation for clinician pose estimation and instance segmentation in the operating room [paper link][arxiv link][code|official]
-
TransPar(TIP2022) Learning Transferable Parameters for Unsupervised Domain Adaptation [paper link][arxiv link][evaluation on tasks
image classification
andregression tasks (keypoint detection)
][hand datasets (RHD→H3D)
,It emphasizes transferable parameters using a similar structure as RegDA which has one shared feature extractor and two separate regressors
] -
👍❤UniFrame, UDA_PoseEstimation(ECCV2022) A Unified Framework for Domain Adaptive Pose Estimation [paper link][arxiv link][code|official][
hand datasets (RHD→H3D)
,human datasets (SURREAL→Human3.6M, SURREAL→LSP)
,animal datasets (SynAnimal→TigDog, SynAnimal→AnimalPose)
, based onRegDA
][ResNet101 + Simple Baseline
][AdaIN (ICCV2017)for image style transfer
+Mean Teacher for student model updating
;It modifies the classic Mean-Teacher model by combining it with style transfer AdaIN
] -
⭐iart-semi-pose(ACMMM2022) Semi-supervised Human Pose Estimation in Art-historical Images [arxiv link][code|official][
Germany
,Semi-Supervised 2D Human Pose Estimation
] -
⭐PLACL(ICLR2022) Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization [openreview link][arxiv link][author Sheng Jin's homepage][
Semi-Supervised Keypoint Localization
, backboneHRNet-w32
,Curriculum Learning
+Reinforcement Learning
, slightly better thanSSKL(ICLR2020)
][largely based on(Curriculum-Labeling, AAAI2021) Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning
] -
ADHNN(AAAI2022) Adaptive Hypergraph Neural Network for Multi-person Pose Estimation [paper link][Codes|PyTorch (not available)]
-
(WACV2022) Transfer Learning for Pose Estimation of Illustrated Characters [paper link][arxiv link][codes|official PyTorch]
-
CD_HPE(ICASSP2022) Towards Accurate Cross-Domain in-Bed Human Pose Estimation [paper link][arxiv link][code|official]
-
EdgeTrans4Mark(ECCV2022) One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement [paper link][arxiv link][code|official][
PKU
,Landmark Localization
,Medical Image
] -
⭐SSPCM(CVPR2023) Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction Module [paper link][arxiv link][code|official][
Semi-Supervised 2D Human Pose Estimation
] -
SCAI(self-correctable and adaptable inference)(CVPR2023) Self-Correctable and Adaptable Inference for Generalizable Human Pose Estimation [paper link][arxiv link][
Domain Generalization
][It works as a play-in-plug for top-down human pose estimation methods like SimpleBaseline and HRNet
, the same author ofSCIO
] -
Full-DG(full-view data generation)(TNNLS2023) Overcoming Data Deficiency for Multi-Person Pose Estimation [paper link][Full-DG can help improve pose estimators’
robustness
andgeneralizability
] -
MAPS(arxiv2023.02) MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection [arxiv link][code|official][
hand datasets (RHD→H3D)
,human datasets (SURREAL→LSP)
,animal datasets (SynAnimal→TigDog, SynAnimal→AnimalPose)
, based onRegDA
andUniFrame
] -
ImSty(Implicit Stylization)(ICLRW2023) Implicit Stylization for Domain Adaptation [openreview link][pdf link][workshop homepage]
-
⭐SF-DAPE(ICCV2023) Source-free Domain Adaptive Human Pose Estimation [paper link][arxiv link][code|official][
Source-free Domain Adaptation
,hand datasets (RHD→H3D, RHD→FreiHand)
,human datasets (SURREAL→Human3.6M, SURREAL→LSP)
] -
POST(ICCV2023) Prior-guided Source-free Domain Adaptation for Human Pose Estimation [paper link][arxiv link][
Source-free Domain Adaptation
,Self-training
,human datasets (SURREAL→Human3.6M, SURREAL→LSP)
] -
Pseudo-Heatmaps(arxiv2023.10) Denoising and Selecting Pseudo-Heatmaps for Semi-Supervised Human Pose Estimation [arxiv link][based on the
DualPose (ICCV2021)
, do not compare withSSPCM(CVPR2023)
] -
MDSs(arxiv2023.10)(under review in ICLR2024) Modeling the Uncertainty with Maximum Discrepant Students for Semi-supervised 2D Pose Estimation [arxiv link][code|official][based on the
DualPose (ICCV2021)
, do not compare withSSPCM(CVPR2023)
]
Large Language Model / Large Vision Model / Vision-Language Model for Human / Animals / Anything
-
👍CLAMP(CVPR2023) CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose [paper link][arxiv link][code|official][
CLIP
,Tao Dacheng
, trained and tested on datasetAP-10K
, also seeAPT-36K
] -
PoseFix(ICCV2023) PoseFix: Correcting 3D Human Poses with Natural Language [paper link][arxiv link][code|official]
-
UniAP(arxiv2023.08) UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning [arxiv link][
CLIP
,ZJU
,Few-shot Learning
,various perception tasks including pose estimation, segmentation, and classification tasks
] -
KDSM(arxiv2023.10) Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face [arxiv link][
CLIP
,XJU + Shanghai AI Lab
,Open-Vocabulary Keypoint Detection
] -
UniPose(arxiv2023.10)(under review in ICLR2024) UniPose: Detecting Any Keypoints [openreview link][arxiv link][project link][code|official][
IDEA-Research
,using visual or textual prompts
] -
VLPosee(arxiv2024.02) VLPose: Bridging the Domain Gap in Pose Estimation with Language-Vision Tuning [arxiv link][by
CUHK
,Language-Vision Model
, on datasetsCOCO
andHumanArt
][VLPose leverages the synergy betweenlanguage
andvision
to extend thegeneralization
androbustness
of pose estimation models beyond the traditional domains.]
Motion Synthesis / Motion Diffusion Model
-
MoFusion(CVPR2023) MoFusion: A Framework for Denoising-Diffusion-based Motion Synthesis [paper link][arxiv link][project link][code is not avaliable][
MPII
] -
GMD(ICCV2023) Guided Motion Diffusion for Controllable Human Motion Synthesis [paper link][arxiv link][project link][code|official][
ETH
] -
PhysDiff(ICCV2023 Oral) PhysDiff: Physics-Guided Human Motion Diffusion Model [paper link][arxiv link][project link][code is not avaliable][
NVIDIA
] -
InterDiff(ICCV2023) InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion [paper link][arxiv link][project link][code|official][
University of Illinois at Urbana-Champaign
,Human-Object Interactions
] -
OmniControl(arxiv2023.10) OmniControl: Control Any Joint at Any Time for Human Motion Generation [arxiv link][project link][code|official][
Northeastern University + Google Research
]