Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about MVImageNet Dataset (3.4 TB) #98

Open
trungpx opened this issue Jun 12, 2024 · 2 comments
Open

Question about MVImageNet Dataset (3.4 TB) #98

trungpx opened this issue Jun 12, 2024 · 2 comments

Comments

@trungpx
Copy link

trungpx commented Jun 12, 2024

Dear authors,

Could you please tell me how many images have been used for MVImageNet?

It is said on their web that: "MVImgNet contains 6.5 million frames from 219,188 videos, the total size is about 3.4 TB." So I just wondering have you used this huge full data (3.4 TB) to train AnyDoor to achieve the reported performance?

@XavierCHEN34
Copy link
Collaborator

No, we only use the subset with segmentation masks

@trungpx
Copy link
Author

trungpx commented Jun 13, 2024

Thanks so much for your reply. Could you help to elaborate more a confusion below?

In the paper MVImageNet, Table 1 lists 104,261 segmentations.
image
Figure 1. MVImageNet paper

In AnyDoor paper, Table 1 lists as follows:
image
Figure 2. AnyDoor paper

It means that AnyDoor used full 104,261 segmentations which corresponding to 219,188 videos. Is it correct?
Could you share an estimated number of videos have been used so that I can download the proper ones? Since I looked up their datasets, it contains a lot of huge files, really heavy if download all of them.

image
Figure 3. Dataset download page

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants