Question about MVImageNet Dataset (3.4 TB) #98

trungpx · 2024-06-12T15:55:39Z

Dear authors,

Could you please tell me how many images have been used for MVImageNet?

It is said on their web that: "MVImgNet contains 6.5 million frames from 219,188 videos, the total size is about 3.4 TB." So I just wondering have you used this huge full data (3.4 TB) to train AnyDoor to achieve the reported performance?

XavierCHEN34 · 2024-06-13T09:12:46Z

No, we only use the subset with segmentation masks

trungpx · 2024-06-13T10:03:49Z

Thanks so much for your reply. Could you help to elaborate more a confusion below?

In the paper MVImageNet, Table 1 lists 104,261 segmentations.

Figure 1. MVImageNet paper

In AnyDoor paper, Table 1 lists as follows:

Figure 2. AnyDoor paper

It means that AnyDoor used full 104,261 segmentations which corresponding to 219,188 videos. Is it correct?
Could you share an estimated number of videos have been used so that I can download the proper ones? Since I looked up their datasets, it contains a lot of huge files, really heavy if download all of them.

Figure 3. Dataset download page

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about MVImageNet Dataset (3.4 TB) #98

Question about MVImageNet Dataset (3.4 TB) #98

trungpx commented Jun 12, 2024

XavierCHEN34 commented Jun 13, 2024

trungpx commented Jun 13, 2024

Question about MVImageNet Dataset (3.4 TB) #98

Question about MVImageNet Dataset (3.4 TB) #98

Comments

trungpx commented Jun 12, 2024

XavierCHEN34 commented Jun 13, 2024

trungpx commented Jun 13, 2024