Skip to content

Latest commit

 

History

History
205 lines (200 loc) · 104 KB

humans-face-body-pose-gesture-movement.md

File metadata and controls

205 lines (200 loc) · 104 KB

CVPR-2023-Papers

Application App
New collections Conference

Humans: Face, Body, Pose, Gesture, Movement

Section Papers Preprint Papers Papers with Open Code Papers with Video

Title Repo Paper Video
Micron-BERT: BERT-Based Facial Micro-Expression Recognition GitHub thecvf
arXiv
YouTube
NIKI: Neural Inverse Kinematics With Invertible Neural Networks for 3D Human Pose and Shape Estimation GitHub thecvf
arXiv
YouTube
A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation thecvf
Executing Your Commands via Motion Diffusion in Latent Space GitHub Page
GitHub
thecvf
arXiv
YouTube
MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID GitHub thecvf
arXiv
Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation GitHub thecvf
arXiv
Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation GitHub thecvf
arXiv
Dynamic Aggregated Network for Gait Recognition GitHub thecvf YouTube
Object Pop-Up: Can We Infer 3D Objects and Their Poses From Human Interactions Alone? GitHub Page
GitHub
thecvf
arXiv
YouTube
Unsupervised Sampling Promoting for Stochastic Human Trajectory Prediction GitHub thecvf
arXiv
YouTube
ECON: Explicit Clothed Humans Optimized via Normal Integration
CVPR - Highlight
GitHub Page
GitHub
thecvf
arXiv
YouTube
Neuron Structure Modeling for Generalizable Remote Physiological Measurement GitHub thecvf
arXiv
YouTube
Continuous Sign Language Recognition With Correlation Network GitHub thecvf
arXiv
YouTube
Parametric Implicit Face Representation for Audio-Driven Facial Reenactment GitHub thecvf
arXiv
YouTube
CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model GitHub thecvf
arXiv
PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation GitHub thecvf
arXiv
YouTube
3D Human Mesh Estimation From Virtual Markers GitHub thecvf
arXiv
YouTube
3D Human Pose Estimation via Intuitive Physics GitHub thecvf
arXiv
YouTube
ARCTIC: A Dataset for Dexterous Bimanual Hand-Object Manipulation GitHub thecvf
arXiv
YouTube
Generating Holistic 3D Human Motion From Speech GitHub Page
GitHub
thecvf
arXiv
HARP: Personalized Hand Reconstruction From a Monocular RGB Video GitHub thecvf
arXiv
Learning Locally Editable Virtual Humans GitHub Page
GitHub
thecvf
arXiv
YouTube
Reconstructing Signing Avatars From Video Using Linguistic Priors GitHub Page
GitHub
thecvf
arXiv
YouTube
DrapeNet: Garment Generation and Self-Supervised Draping GitHub thecvf
arXiv
YouTube
X-Avatar: Expressive Human Avatars GitHub Page
GitHub
thecvf
arXiv
YouTube
Hi4D: 4D Instance Segmentation of Close Human Interaction GitHub thecvf
arXiv
YouTube
Vid2Avatar: 3D Avatar Reconstruction From Videos in the Wild via Self-Supervised Scene Decomposition GitHub Page
GitHub
thecvf
arXiv
YouTube
CloSET: Modeling Clothed Humans on Continuous Surface With Explicit Template Decomposition GitHub thecvf
arXiv
YouTube
Graphics Capsule: Learning Hierarchical 3D Face Representations From 2D Images thecvf
arXiv
Rethinking the Learning Paradigm for Dynamic Facial Expression Recognition thecvf
arXiv
YouTube
HandNeRF: Neural Radiance Fields for Animatable Interacting Hands thecvf
arXiv
YouTube
Relightable Neural Human Assets From Multi-View Gradient Illuminations GitHub Page thecvf
arXiv
YouTube
Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training
CVPR - Highlight
GitHub thecvf
arXiv
YouTube
DeFeeNet: Consecutive 3D Human Motion Prediction With Deviation Feedback thecvf
arXiv
BioNet: A Biologically-Inspired Network for Face Recognition GitHub thecvf YouTube
Boosting Detection in Crowd Analysis via Underutilized Output Features GitHub thecvf
arXiv
YouTube
Learning Analytical Posterior Probability for Human Mesh Recovery GitHub thecvf YouTube
Listening Human Behavior: 3D Human Pose Estimation With Acoustic Signals GitHub thecvf YouTube
Detecting and Grounding Multi-Modal Media Manipulation GitHub thecvf
arXiv
YouTube
RelightableHands: Efficient Neural Relighting of Articulated Hand Models GitHub Page thecvf
arXiv
MEGANE: Morphable Eyeglass and Avatar Network GitHub Page thecvf
arXiv
YouTube
SunStage: Portrait Reconstruction and Relighting Using the Sun as a Light Stage GitHub thecvf
arXiv
YouTube
TryOnDiffusion: A Tale of Two UNets GitHub thecvf
arXiv
YouTube
Semi-Supervised Hand Appearance Recovery via Structure Disentanglement and Dual Adversarial Discrimination thecvf
arXiv
YouTube
POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery GitHub thecvf
arXiv
YouTube
Scene-Aware Egocentric 3D Human Pose Estimation GitHub thecvf
arXiv
YouTube
PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation With Progressive Video Transformers thecvf
arXiv
YouTube
Trajectory-Aware Body Interaction Transformer for Multi-Person Pose Forecasting GitHub thecvf
arXiv
YouTube
A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation From a Single RGB Image GitHub thecvf
arXiv
YouTube
TRACE: 5D Temporal Regression of Avatars With Dynamic Cameras in 3D Environments GitHub Page
GitHub
thecvf
arXiv
YouTube
Skinned Motion Retargeting With Residual Perception of Motion Semantics & Geometry GitHub thecvf
arXiv
YouTube
Generating Human Motion From Textual Descriptions With Discrete Representations GitHub Page
GitHub
thecvf
arXiv
YouTube
Learning Human Mesh Recovery in 3D Scenes GitHub thecvf
arXiv
AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction GitHub Page thecvf
arXiv
YouTube
3D-Aware Face Swapping GitHub thecvf YouTube
Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos GitHub thecvf
arXiv
YouTube
GFPose: Learning 3D Human Pose Prior With Gradient Fields GitHub thecvf
arXiv
YouTube
Rethinking Feature-Based Knowledge Distillation for Face Recognition thecvf
One-Stage 3D Whole-Body Mesh Recovery With Component Aware Transformer GitHub thecvf
arXiv
YouTube
Towards Stable Human Pose Estimation via Cross-View Fusion and Foot Stabilization thecvf YouTube
Ego-Body Pose Estimation via Ego-Head Pose Estimation
CVPR - Award
GitHub Page
GitHub
thecvf
arXiv
YouTube
TOPLight: Lightweight Neural Networks With Task-Oriented Pretraining for Visible-Infrared Recognition thecvf
StyleIPSB: Identity-Preserving Semantic Basis of StyleGAN for High Fidelity Face Swapping GitHub thecvf YouTube
Improving Fairness in Facial Albedo Estimation via Visual-Textual Cues
CVPR - Highlight
thecvf YouTube
FLEX: Full-Body Grasping Without Full-Body Grasps GitHub thecvf
arXiv
YouTube
EDGE: Editable Dance Generation From Music GitHub Page
GitHub
thecvf
arXiv
YouTube
Complete 3D Human Reconstruction From a Single Incomplete Image thecvf YouTube
Zero-Shot Pose Transfer for Unrigged Stylized 3D Characters GitHub Page thecvf
arXiv
YouTube
Hand Avatar: Free-Pose Hand Animation and Rendering From Monocular Video GitHub thecvf
arXiv
YouTube
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes GitHub thecvf
arXiv
YouTube
Learning Neural Proto-Face Field for Disentangled 3D Face Modeling in the Wild thecvf
CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose GitHub thecvf
arXiv
YouTube
Invertible Neural Skinning GitHub Page thecvf
arXiv
YouTube
DiffusionRig: Learning Personalized Priors for Facial Appearance Editing GitHub Page
GitHub
thecvf
arXiv
YouTube
Harmonious Feature Learning for Interactive Hand-Object Pose Estimation GitHub thecvf YouTube
Leapfrog Diffusion Model for Stochastic Trajectory Prediction GitHub thecvf
arXiv
YouTube
NeuFace: Realistic 3D Neural Face Rendering From Multi-View Images GitHub thecvf
arXiv
YouTube
DiffSwap: High-Fidelity and Controllable Face Swapping via 3D-Aware Masked Diffusion GitHub thecvf
GFIE: A Dataset and Baseline for Gaze-Following From 2D to 3D in Indoor Environments GitHub Page
GitHub
thecvf YouTube
Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition From Egocentric RGB Videos GitHub thecvf
arXiv
YouTube
Decompose More and Aggregate Better: Two Closer Looks at Frequency Representation Learning for Human Motion Prediction thecvf YouTube
Human Pose As Compositional Tokens GitHub thecvf
arXiv
YouTube
Normal-Guided Garment UV Prediction for Human Re-Texturing
CVPR - Highlight
thecvf
arXiv
YouTube
Dynamic Graph Learning With Content-Guided Spatial-Frequency Relation Reasoning for Deepfake Detection thecvf YouTube
VGFlow: Visibility Guided Flow Network for Human Reposing thecvf
arXiv
Mutual Information-Based Temporal Difference Learning for Human Pose Estimation in Video thecvf
arXiv
YouTube
PREIM3D: 3D Consistent Precise Image Attribute Editing From a Single Image GitHub thecvf
arXiv
YouTube
HuManiFlow: Ancestor-Conditioned Normalising Flows on SO(3) Manifolds for Human Pose and Shape Distribution Estimation GitHub thecvf
arXiv
YouTube
Implicit Identity Driven Deepfake Face Swapping Detection GitHub thecvf YouTube
Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion GitHub thecvf
arXiv
YouTube
3D-Aware Facial Landmark Detection via Multi-View Consistent Training on Synthetic Data thecvf YouTube
SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments GitHub Page
GitHub
thecvf
arXiv
YouTube
Zero-Shot Text-to-Parameter Translation for Game Character Auto-Creation thecvf
arXiv
YouTube
AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation GitHub thecvf
arXiv
YouTube
UDE: A Unified Driving Engine for Human Motion Generation GitHub Page
GitHub
thecvf
arXiv
YouTube
CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior GitHub thecvf
arXiv
YouTube
Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction Module GitHub thecvf
arXiv
YouTube
Learning Personalized High Quality Volumetric Head Avatars From Monocular RGB Videos GitHub Page thecvf
arXiv
YouTube
HOOD: Hierarchical Graphs for Generalized Modelling of Clothing Dynamics GitHub thecvf
arXiv
YouTube
ACR: Attention Collaboration-Based Regressor for Arbitrary Two-Hand Reconstruction GitHub thecvf
arXiv
YouTube
HumanBench: Towards General Human-Centric Perception With Projector Assisted Pretraining GitHub thecvf
arXiv
YouTube
CIMI4D: A Large Multimodal Climbing Motion Dataset Under Human-Scene Interactions GitHub Page thecvf
arXiv
YouTube
Human Pose Estimation in Extremely Low-Light Conditions GitHub Page
GitHub
thecvf
arXiv
DistilPose: Tokenized Pose Regression With Heatmap Distillation GitHub thecvf
arXiv
YouTube
Human Body Shape Completion With Implicit Shape and Flow Learning thecvf YouTube
Source-Free Adaptive Gaze Estimation by Uncertainty Reduction GitHub thecvf YouTube
Music-Driven Group Choreography GitHub Page
GitHub
thecvf
arXiv
YouTube
Robust Model-Based Face Reconstruction Through Weakly-Supervised Outlier Segmentation GitHub thecvf
arXiv
YouTube
MARLIN: Masked Autoencoder for Facial Video Representation LearnINg GitHub thecvf
arXiv
YouTube
Transformer-Based Unified Recognition of Two Hands Manipulating Objects GitHub thecvf YouTube
Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization GitHub thecvf
arXiv
YouTube
ScarceNet: Animal Pose Estimation With Scarce Annotations GitHub thecvf
arXiv
YouTube
FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face Reconstruction GitHub thecvf
arXiv
YouTube
MoDi: Unconditional Motion Synthesis From Diverse Data GitHub thecvf
arXiv
YouTube
Feature Representation Learning With Adaptive Displacement Generation and Transformer Fusion for Micro-Expression Recognition thecvf
arXiv
YouTube
MeMaHand: Exploiting Mesh-Mano Interaction for Single Image Two-Hand Reconstruction thecvf
arXiv
YouTube
Stimulus Verification Is a Universal and Effective Sampler in Multi-Modal Human Trajectory Prediction GitHub thecvf YouTube
TokenHPE: Learning Orientation Tokens for Efficient Head Pose Estimation via Transformers GitHub thecvf YouTube
Handy: Towards a High Fidelity 3D Hand Shape and Appearance Model GitHub Page
GitHub
thecvf YouTube
CIRCLE: Capture in Rich Contextual Environments GitHub Page
GitHub
thecvf
arXiv
YouTube
Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention GitHub thecvf
arXiv
YouTube
Implicit Neural Head Synthesis via Controllable Local Deformation Fields GitHub Page thecvf
arXiv
YouTube
Continuous Intermediate Token Learning With Implicit Motion Manifold for Keyframe Based Motion Interpolation GitHub thecvf
arXiv
JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and Tracking WEB Page thecvf
arXiv
STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection GitHub thecvf
arXiv
GM-NeRF: Learning Generalizable Model-Based Neural Radiance Fields From Multi-View Images GitHub thecvf
arXiv
YouTube
Decoupled Multimodal Distilling for Emotion Recognition
CVPR - Highlight
GitHub thecvf
arXiv
YouTube
HaLP: Hallucinating Latent Positives for Skeleton-Based Self-Supervised Learning of Actions GitHub thecvf
arXiv
YouTube
ReDirTrans: Latent-to-Latent Translation for Gaze and Head Redirection thecvf YouTube
QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation
CVPR - Highlight
GitHub thecvf
arXiv
YouTube
Multi-Modal Gait Recognition via Effective Spatial-Temporal Feature Fusion thecvf YouTube
Probabilistic Knowledge Distillation of Face Ensembles thecvf YouTube
Learning Semantic-Aware Disentangled Representation for Flexible 3D Human Body Editing GitHub thecvf YouTube
Parameter Efficient Local Implicit Image Function Network for Face Segmentation thecvf
arXiv
HumanGen: Generating Human Radiance Fields With Explicit Priors thecvf
arXiv
YouTube
Biomechanics-Guided Facial Action Unit Detection Through Force Modeling thecvf
Decoupling Human and Camera Motion From Videos in the Wild GitHub thecvf
arXiv
YouTube
Overcoming the Trade-Off Between Accuracy and Plausibility in 3D Hand Shape Reconstruction thecvf
arXiv
YouTube
Instant-NVR: Instant Neural Volumetric Rendering for Human-Object Interactions From Monocular RGBD Stream GitHub Page
GitHub
thecvf
arXiv
YouTube
PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation GitHub thecvf
arXiv
YouTube
Analyzing and Diagnosing Pose Estimation With Attributions thecvf YouTube
Unsupervised Visible-Infrared Person Re-Identification via Progressive Graph Matching and Alternate Learning GitHub thecvf
Shape-Erased Feature Learning for Visible-Infrared Person Re-Identification GitHub thecvf
arXiv
Distilling Cross-Temporal Contexts for Continuous Sign Language Recognition thecvf
Avatars Grow Legs: Generating Smooth Human Motion From Sparse Tracking Inputs With Diffusion Model GitHub thecvf
arXiv
Local Connectivity-Based Density Estimation for Face Clustering GitHub thecvf YouTube
SelfME: Self-Supervised Motion Learning for Micro-Expression Recognition thecvf YouTube
Detecting Human-Object Contact in Images GitHub Page thecvf
arXiv
YouTube
Controllable Light Diffusion for Portraits GitHub thecvf
arXiv
YouTube
InstantAvatar: Learning Avatars From Monocular Video in 60 Seconds GitHub thecvf
arXiv
YouTube
NeMo: Learning 3D Neural Motion Fields From Multiple Video Instances of the Same Action
CVPR - Highlight
GitHub Page
GitHub
thecvf
arXiv
YouTube
Privacy-Preserving Adversarial Facial Features GitHub thecvf
arXiv
YouTube
Self-Correctable and Adaptable Inference for Generalizable Human Pose Estimation thecvf
arXiv
YouTube
DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face Alignment GitHub thecvf
arXiv
YouTube
Clothed Human Performance Capture With a Double-Layer Neural Radiance Fields GitHub Page thecvf YouTube
Continuous Landmark Detection With 3D Queries thecvf YouTube
Learning a 3D Morphable Face Reflectance Model From Low-Cost Data GitHub thecvf
arXiv
YouTube
AUNet: Learning Relations Between Action Units for Face Forgery Detection GitHub thecvf YouTube
3D Human Pose Estimation With Spatio-Temporal Criss-Cross Attention GitHub thecvf YouTube
Implicit 3D Human Mesh Recovery Using Consistency With Pose and Shape From Unseen-View thecvf
arXiv
YouTube
3D Human Keypoints Estimation From Point Clouds in the Wild Without Human Labels thecvf
arXiv
YouTube
Multi-Label Compound Expression Recognition: C-EXPR Database & Network thecvf YouTube
FlexNeRF: Photorealistic Free-Viewpoint Rendering of Moving Humans From Sparse Views GitHub Page thecvf
arXiv
YouTube
Two-Stage Co-Segmentation Network Based on Discriminative Representation for Recovering Human Mesh From Videos thecvf
Co-Speech Gesture Synthesis by Reinforcement Learning With Contrastive Pre-Trained Rewards GitHub thecvf YouTube
FeatER: An Efficient Network for Human Reconstruction via Feature Map-based TransformER GitHub Page
GitHub
thecvf
arXiv
YouTube