Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

New datasets for human pose estimation #6698

Open
carandraug opened this issue Oct 4, 2022 · 6 comments
Open

New datasets for human pose estimation #6698

carandraug opened this issue Oct 4, 2022 · 6 comments

Comments

@carandraug
Copy link

carandraug commented Oct 4, 2022

馃殌 The feature

We propose to add the following datasets for human pose estimation to torchvision:

Motivation, pitch

These are some of our most downloaded datasets and we're trying to make them more useful by providing DataLoaders built into torchvision. Torchvision already includes some of our datasets (OxfordIIITPet, VOCDetection, VOCSegmentation, Flowers102, DTD, and FGVCAircraft) so we hope that the proposed new datastes would be accepted.

Alternatives

No response

Additional context

No response

cc @pmeier @NicolasHug

@carandraug
Copy link
Author

To make it clear, I'm proposing to do this work (and I've actually already started). The question here is if there's interested on the torchvision side to include it (and if that comes with any conditions).

carandraug added a commit to carandraug/pytorch-vision that referenced this issue Oct 5, 2022
carandraug added a commit to carandraug/pytorch-vision that referenced this issue Oct 5, 2022
carandraug added a commit to carandraug/pytorch-vision that referenced this issue Oct 5, 2022
@bjuncek
Copy link
Contributor

bjuncek commented Oct 11, 2022

Hi @pmeier , do you have input on these?

Usually, the requirements for datasets are number of citations and papers used, combined with the most recent usage (vaguely).
Having said this, I believe we might be pausing the implementation until we get the prototype datasets are ready to replace the current implementations?

Some of these (namely VOC) are a staple, but they are also quite old...

@pmeier
Copy link
Collaborator

pmeier commented Oct 12, 2022

Usually, the requirements for datasets are number of citations and papers used, combined with the most recent usage (vaguely). Having said this, I believe we might be pausing the implementation until we get the prototype datasets are ready to replace the current implementations?

Correct. Unfortunately, datasets v2 takes longer than we anticipated. It makes little sense to implement wrappers for these datasets now just to supersede them in the near future.

Still, our original plan was to support quite a few more datasets than we had before. Since we currently don't have any datasets for human pose estimation, we could start with these. cc @NicolasHug

@carandraug is it ok to ping you if we finally managed to bring datasets v2 to a stable-ish state? If yes, could you provide a link to the paper than introduced the dataset as well as a citation count, for example from Google Scholar.

Plus, since I'm not familiar with the task, could either @carandraug or @bjuncek explain what the output of the dataset would look like?

@carandraug
Copy link
Author

carandraug commented Oct 12, 2022

@carandraug is it ok to ping you if we finally managed to bring datasets v2 to a stable-ish state? If yes, could you provide a link to the paper than introduced the dataset as well as a citation count, for example from Google Scholar.

Yes, that is fine.

  • TV Human Interactions Dataset:

  • Human Pose Evaluator dataset (this is actually two datasets in one):

  • Pose estimation datasets (these are 5 different datasets):

    • dataset webpage
    • This is a bit tricky because it's split over a few papers but I can sort this up once I get to these datasets)
    • Number of citations: 100 + 573 + 173 + 79 + 40

From the number of citations of the paper it doesn't look very impressive. However, we're basing the decision of doing this for these datasets on the number of downloads from our servers. We were surprised by the high number of downloads in certain old datasets which we assumed were no longer relevant. We believe that they're mainly used in teaching because when we take them down temporarily for maintenance we mainly received emails from undergrad students doing assignments and TA preparing lectures.

@datumbox
Copy link
Contributor

One additional thing to keep in mind while discussing Pose estimation is that our Transforms V2 don't currently offer support for Keypoints. This is something we should factor in while prioritizing work.

@pmeier
Copy link
Collaborator

pmeier commented Oct 12, 2022

There is #5326 that I could revisit. #5326 (comment) states

we'll put this PR on hold until we have full support for bounding boxes and segmentation masks.

We do have support for this now, but I think priorities have changed in the mean time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants