Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What does "ilsvrc12_train.txt" entail? #2

Open
BJTUJia opened this issue Mar 15, 2018 · 4 comments
Open

What does "ilsvrc12_train.txt" entail? #2

BJTUJia opened this issue Mar 15, 2018 · 4 comments

Comments

@BJTUJia
Copy link

BJTUJia commented Mar 15, 2018

Dear author, thanks a lot for your contribution! However, I can't find the file ilsvrc12_train.txt anywhere. I guess it's the training images list. I am new to deep learning and not familiar with Imagenet either. Can you kindly attach the files?

@BJTUJia BJTUJia closed this as completed Mar 15, 2018
@BJTUJia BJTUJia reopened this Mar 16, 2018
@topiaruss
Copy link
Contributor

topiaruss commented Jun 28, 2018

for a tar of tars of images:
http://www.image-net.org/challenges/LSVRC/2012/nonpub-downloads
Untar the tars, then untar those tars to get the images, record the imagenames.

Roughly this:
After download you will have ISVRC2012_img_train.tar
$ mkdir ISVRC2012_img_train
$ cd ISVRC2012_img_train
$ tar xvf ../ISVRC2012_img_train.tar
$ find . -name *tar -exec tar xvf {} ; > ilsvrc12_train.txt

that may take a while. At the end you will have all the filenames in the .txt file.

You should then run produce_small_data.py. I got this working in python3 on the master branch of my fork. There is no need for a class label in the txt file, since we are doing self-supervised learning.

@chho-work
Copy link

Sir, thanks for sharing the code!
I have the same question about "ilsvrc12_train.txt". I am not using the original Imanaget1K dataset, but a custom Imagenet dataset with fewer classes and images. So, I didn't download the tar files as you mentioned above.
Suppose I have a train dir with 100K jpg image files, all in one single dir. The file names are similar to these:
ILSVRC2012_val_00000907.JPEG
ILSVRC2012_val_00002769.JPEG
n02086240_105.JPEG
n02086240_126.JPEG
etc...

Then, the "custom_ilsvrc12_train.txt" that I need to create should contain a list of 100K file names. Each file name should be listed in each line without coma or space. Is this correct?
Your prompt response will be appreciated! Thanking you in advance!

@siyuan2018
Copy link

Sir, thanks for sharing the code!
I have the same question about "ilsvrc12_train.txt". I am not using the original Imanaget1K dataset, but a custom Imagenet dataset with fewer classes and images. So, I didn't download the tar files as you mentioned above.
Suppose I have a train dir with 100K jpg image files, all in one single dir. The file names are similar to these:
ILSVRC2012_val_00000907.JPEG
ILSVRC2012_val_00002769.JPEG
n02086240_105.JPEG
n02086240_126.JPEG
etc...

Then, the "custom_ilsvrc12_train.txt" that I need to create should contain a list of 100K file names. Each file name should be listed in each line without coma or space. Is this correct?
Your prompt response will be appreciated! Thanking you in advance!

Hi, may I ask how you solved the problem? I have the same question here. Thank you!

@Aiiiiii
Copy link

Aiiiiii commented Dec 14, 2020

Dear author, thanks a lot for your contribution! However, I can't find the file ilsvrc12_train.txt anywhere. I guess it's the training images list. I am new to deep learning and not familiar with Imagenet either. Can you kindly attach the files?

Any update for this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants