Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What is the requirements for the data folder #7

Closed
nguyenminhduc9988 opened this issue Mar 19, 2015 · 13 comments
Closed

What is the requirements for the data folder #7

nguyenminhduc9988 opened this issue Mar 19, 2015 · 13 comments

Comments

@nguyenminhduc9988
Copy link

When I try to import data, I have this problem. My folder contain sub folder for each class. I am not sure if there are other requirements for the data folder. If there is any requirements for the data folder, could you please inform me of it

ERROR: no images added

@lukeyeager
Copy link
Member

So, there's a little tooltip next to the input field that says this:

Indicate a folder which holds subfolders full of images. Each subfolder should be named according to the desired label for the images that it holds. Can also be a URL for an apache/nginx auto-indexed folder.

That's admittedly a little cryptic. I used to have more detailed instructions on there, but they were cluttering up the page. Here is the folder layout that DIGITS is looking for:

/path/to/images/
├── cat
│   ├── 1.jpg
│   └── 2.JPEG
└── dog
    ├── 01.png
    └── 02.jpeg

So, you would enter /path/to/images in the "training images" field, and DIGITS would find two subfolders, "cat" and "dog," and use those as category names. Each category would have two images in it. Obviously, you'll want more than two images per category 😉

@lukeyeager
Copy link
Member

@ducminhnguyen, were you able to build your dataset?

@nguyenminhduc9988
Copy link
Author

I have try to layout the folder as you said, digit detect the folder and the number of image right but when it starts building the dataset file the same error occurs.

@lukeyeager lukeyeager added the bug label Mar 20, 2015
@lukeyeager
Copy link
Member

Ok, then there might be something else going wrong. I'm filing this as a bug for poor error-catching. I have plans for improving the error catching in this area, so that you get a better idea of what's going wrong.

In the meantime, what is your Pillow version? Run this: pip list | grep pillow -i. You should see that Pillow is at least 2.3.0. I've seen someone with an outdated version run into an issue like yours.

@nguyenminhduc9988
Copy link
Author

digits error

This is the bug I was getting.
I am running Pillow (2.3.0)

@nguyenminhduc9988
Copy link
Author

I have also try to add data from the Mnist example in the web installer but it get the same error

@nguyenminhduc9988
Copy link
Author

I finnally manage to import my files. The problem with it is that I use the original caffe for digits when I switch to NVIDIA/caffe it works perfectly.
Caffe is changing alot, do you have any plan to flow up on the changes or just keep the NVIDIA/caffe as it is now

@sersajur
Copy link

Is it possible to create dataset from .bmp images?
I've just tried, but failed with error: "ERROR: Did not find two valid categories".

@lukeyeager
Copy link
Member

Is it possible to create dataset from .bmp images?

Image formats other than JPEG and PNG are not currently supported. But it is trivial to add support for more. I'm opening a new issue for this, as it is unrelated to @ducminhnguyen's problem.

@cicero19
Copy link

cicero19 commented May 4, 2015

Just wondering if 16bit PNGs are also accepted? Thanks.

@lukeyeager
Copy link
Member

I haven't tried it myself, but DIGITS should load the images just fine. However, they will be converted to 8-bits before saving to the database or passing to caffe for testing. See here:
https://github.com/NVIDIA/DIGITS/blob/v1.1.0/digits/utils/image.py#L125-L126

Is that a problem? Would you like to work with 16-bit data? Have you done that with caffe before?

@cicero19
Copy link

cicero19 commented May 5, 2015

Shouldn't be a huge issue (i think). I am using it for medical images (DICOM), which tend to be 16 bit greyscale. Good to know. I'll probably stick to 8bit.

@homah
Copy link

homah commented Nov 16, 2016

hi, I could load mnist and train ,test that dataset. now I find medical Xray dataset, and build folder train and test and their classes and labels like mnist also it is in the path that mnist is there. but when I want to create dataset in digit, it cant create dataset and show this error message"ERROR: you do not have read access to folder "/home/homay/Xray/train". how can I solve this problem?
Xray is the folder and in xray there are train and test folders. in each of them there are 0and 1 folders that contains images. and in each test and train folder there are labels.txt and train.txt and test.txt.
please help me. :((((((

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants