TorchImageNet

ImageNet downloader and PyTorch Dataset implementation in PyTorch.

Requirements: Please see requirements.yml for details. To install new environment called imagenet, run the following command:

> conda env create -f requirements.yml

Download imagenet data

Downloading imagenet samples works by running the script download_imagenet_images.py [num_images]. It will download the number of images specified by first downloading image urls from the ImageNet API, then randomly shuffeling all the urls, and finally downloading from these urls until [num_images] were successfully downloaded.

> python download_imagenet_images.py 100

Will download 100 images to a subdirectory with the name images.

It takes quite a while... so let it run over night ;-).

Use Data Set

To use the dataset, add the parent directory of the project to the sys.path list. For example, if you cloned this repo into /foo/bar, i.e., this repo is located at /foo/bar/torch_imagenet, then /foo/bar should be in your path:

import sys
sys.append('/foo/bar')

Afterwards, you can import the ImageNetDataset as follows:

from torch_imagenet import ImageNetDataset

Note that some of the downloaded images may be gray scale and thus only have one channel. In such cases, the dataset may get some hickups. To fix this, you can run

> python identify_bad_images.py

This script will generate a pickle file with all the filenames of images that does not have exactly three channels. The ImageNetDataset will automatically pick up the file next time the dataset is used.

To see the code in action, please see this example code.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
download_imagenet_images.py		download_imagenet_images.py
flickr_empty.png		flickr_empty.png
identify_bad_images.py		identify_bad_images.py
imagenet_dataset.py		imagenet_dataset.py
imagenet_label_mapping		imagenet_label_mapping
requirements.yml		requirements.yml
synset_id_to_class		synset_id_to_class

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TorchImageNet

Download imagenet data

Use Data Set

About

Releases

Packages

Languages

License

fhvilshoj/torch_imagenet

Folders and files

Latest commit

History

Repository files navigation

TorchImageNet

Download imagenet data

Use Data Set

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages