Skip to content

Images @ 224x336 resolution

Compare
Choose a tag to compare
@woctezuma woctezuma released this 20 Jul 13:12
· 48 commits to main since this release
7466780

On July 20, 2023, images were downloaded and resized from 300x450 to 224x336 with img2dataset:

!img2dataset --url_list=urls.txt --image_size=224 --resize_mode=keep_ratio

Images are stored in two archives due to Github's maximal file size being set to 2 GB:

  • images_partA.tar.gz
  • images_partB.tar.gz

App IDs found in apps.json were these used to create urls.txt.
The matching between image indices and app IDs is entirely managed by apps.json. Keep these app IDs as a reference!


App IDs found in filtered_apps.json are these for which images were successfully downloaded.

Indices appear in image filenames, so it is more convenient to handle indices when trimming the image dataset.
The indices of filtered app IDs can be found in filtered_indices.json, and are such that:

for index, app_id in zip(filtered_indices, filtered_apps, strict=True):
  assert apps[index] == app_id