TAIHRI: Task-Aware 3D Human Keypoints Localization for Close-Range Human-Robot Interaction

Ao Li, Yonggen Ling, Yiyang Lin, Yuji Wang, Yong Deng, Yansong Tang

The repository contains the official implementation for the paper "TAIHRI: Task-Aware 3D Human Keypoints Localization for Close-Range Human-Robot Interaction".

📁 Repository Structure

demo/
- demo.py — 2D/3D keypoint inference (MLLM)
eval/
- eval.py — evaluation script
eval_wrapper/ — wrappers, parsers, task definitions, visualization
demo_script.sh / eval_script.sh — runnable examples

🧰 Requirements

Linux
Python 3.10+
CUDA-capable GPU (recommended for vLLM / FlashAttention)

📦 Installation

1) Setup Pytorch Environment

pip install torch==2.8.0 torchdata==0.11.0 torchvision==0.23.0

2) Install Other Packages

pip install -r requirements.txt

💡 Checkpoints

You may download released checkpoints from huggingface and put it under ./checkpoints

Note: This repository references external modules (e.g., model weights, optional packages). Make sure they are available in your environment.

🚀 Quick Start

Run the provided script:

bash demo_script.sh

🧪 Evaluation

Run the provided evaluation script:

bash eval_script.sh

🔧 Common Arguments

--model_path: local path or HuggingFace repo
--backend: transformers or vllm
--focal_length, --princpt_x, --princpt_y: camera intrinsics
--input_path, --output_path: input/output paths

❤️ Acknowledgements

This project builds on and is inspired by several excellent open-source projects and tools, including:

Rex-Omni for the code base of Qwen-VL SFT.
Qwen3-VL for the code of Qwen3-VL finetuning and inference.
vLLM for the efficient inference acceleration.
SAM-3D-Body for 3D body mesh recovery and visualization in the demo pipeline.

We also thank the community contributors and dataset providers who make research and evaluation possible.

📄 License

See the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TAIHRI: Task-Aware 3D Human Keypoints Localization for Close-Range Human-Robot Interaction

Ao Li, Yonggen Ling, Yiyang Lin, Yuji Wang, Yong Deng, Yansong Tang

📁 Repository Structure

🧰 Requirements

📦 Installation

1) Setup Pytorch Environment

2) Install Other Packages

💡 Checkpoints

🚀 Quick Start

🧪 Evaluation

🔧 Common Arguments

❤️ Acknowledgements

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
demo		demo
eval		eval
eval_wrapper		eval_wrapper
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo_script.sh		demo_script.sh
eval_script.sh		eval_script.sh
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

TAIHRI: Task-Aware 3D Human Keypoints Localization for Close-Range Human-Robot Interaction

Ao Li, Yonggen Ling, Yiyang Lin, Yuji Wang, Yong Deng, Yansong Tang

📁 Repository Structure

🧰 Requirements

📦 Installation

1) Setup Pytorch Environment

2) Install Other Packages

💡 Checkpoints

🚀 Quick Start

🧪 Evaluation

🔧 Common Arguments

❤️ Acknowledgements

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages