autoupdate paper list
-
Updated
Jun 2, 2024 - Python
autoupdate paper list
Project Page for Paper "Deep Learning-Based Object Pose Estimation: A Comprehensive Survey"
A deep learning framework for multi-animal pose tracking.
Accelerated pose estimation and tracking using semi-supervised convolutional networks.
Deep learned, NVIDIA-accelerated 3D object pose estimation
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
The collection of pre-trained, state-of-the-art AI models for ailia SDK
RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.
This repository contains an implementation of a deep learning approach for yoga pose classification using Convolutional Neural Networks (CNN) and MediaPipe for body keypoint detection. The project aims to classify various yoga poses with high accuracy and low latency, making it suitable for real-world applications.
🤗 image matching toolbox webui
This project is meant to give a concise overview of the YOLO models family. Following a series of projects, we'll go through YOLO's development history, technical breakthroughs, use cases, and practical projects. Although I plan to use several packages, our main focus will be the Ultralytics package and its API.
We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensembl…
Image Classification, Object Detection, Image Segmentation, Instance Segmentation and Pose Estimation
專題分類動作的程式
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
Code for the paper - 'Leveraging Monocular Infrastructure Cameras for Collaborative Multi-View Perception for Indoor Autonomous Mobile Robots'
This is a multi-task neural network that utilizes the GELAN backbone and ViT encoder to perform multiple tasks, maximizing multi-class classification performance.
Add a description, image, and links to the pose-estimation topic page so that developers can more easily learn about it.
To associate your repository with the pose-estimation topic, visit your repo's landing page and select "manage topics."