Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation #24915

Open
2 tasks done
amyeroberts opened this issue Jul 19, 2023 · 4 comments
Open
2 tasks done

Comments

@amyeroberts
Copy link
Collaborator

Model description

ViTPose is used in 2D human pose estimation, a subset of the keypoint detection task #24044

It provides a simple baseline for vision transformer-based human pose estimation. It utilises a pretrained vision transformer backbone to extract features and a simple decoder head to process the extracted features. Despite no elaborate designs in the model, ViTPose obtained state-of-the-art (SOTA) performance of 80.9 AP on the MS COCO Keypoint test-dev set.

Open source status

  • The model implementation is available
  • The model weights are available

Provide useful links for the implementation

Code and weights: https://github.com/ViTAE-Transformer/ViTPose
Paper: https://arxiv.org/abs/2204.12484

@Annbless

@ydshieh
Copy link
Collaborator

ydshieh commented Jul 20, 2023

Glad you get something different to work on 🚀 👀 🎉

@shauray8
Copy link
Contributor

Hi, @amyeroberts, I don't know if you are working on this but if not I would be more than happy to take it up.

@ydshieh
Copy link
Collaborator

ydshieh commented Jul 20, 2023

Oh, this is the issue page, not the PR page!

@amyeroberts
Copy link
Collaborator Author

@shauray8 You're very welcome to take this up! :)

This model presents a new task for the library, so there might be some iterations and discussions on what the inputs and outputs should look like. The model translation should be fairly straightforward though, so I'd suggest starting with a PR that implements that and then on the PR we can figure out what works best.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants