How to train on own dataset ? #3

gauthsvenkat · 2019-01-16T09:14:40Z

I wanna try training LCFCN on my own dataset. What are the things I should be looking at (images, annotations, etc) to train the model on my own dataset ?

IssamLaradji · 2019-01-16T11:05:39Z

The __getitem__ function in the dataset loaders such as in datasets/trancos.py shows you what LCFCN and its loss expect. They expect the following items:

 return {"images":image, "points":points, 
                "counts":counts, "index":index,
                "image_path":self.path + name + ".jpg"}

where images is an RGB with shape (1,3,H,W), points is a matrix with a single point for each object and has the shape (1,H,W), counts has the count for each category with shape (1, K), index is the image id; also,

H: is the image height
W: is the image width
K: is the number of classes

You can create a file like trancos.py for your dataset and then load it for training. Let me know if you need help in this part. Cheers!

gauthsvenkat · 2019-01-16T16:00:16Z

First of all, Thanks a ton for the fast reply!

I'll explore the file and try to reverse engineer it as much a possible. I've never worked with pytorch so this is pretty new to me.

When you say a single point for each object... does that mean something like the center point of each object ? For example

0 0 0 0
0 1 0 0
0 0 0 0
0 0 0 0

would mean that the 1 corresponds to the center of an object ?

If that's the case I have the four coordinates for each object (I annotated them because I tried to solve it as an object detection challenge), I could just take the centroid right ?

IssamLaradji · 2019-01-16T16:53:01Z

Happy to help!

When you say a single point for each object... does that mean something like the center point of each object ?

Yes, you can take the center of the object as a single point, just like the example you showed. The value of the point represents the class of the object.

gauthsvenkat · 2019-01-17T09:47:29Z

Okay I've seen the trancos.py file and mostly understand what's happening.

What are the .mat files that are being loaded ?
Also what are the .txt files present in the images directory ? (I'm looking at the TRANCOS dataset)
What exactly is the transform function ?

IssamLaradji · 2019-01-17T15:07:53Z

the .mat files are binary matrices that represent the regions of interest in the image. Not all datasets have that, for example, shanghai.py doesn't have that.
the .txt files contain the paths to the images which you use to load image at every iteration by calling __getitem__
the transform function are used to flip, rotate, or/and normalize the image. normalization is important if you are using a pretrained network like resnet which expects a specific kind of input distribution.

gauthsvenkat · 2019-01-17T16:08:22Z

So I don't necessarily need to have regions of interests included in my training images right ?
I get the .txt files in image_sets/*.txt, but the .txt files in images/ which have the same name as that of the image. They have some numbers in them.

IssamLaradji · 2019-01-18T00:37:45Z

You are right, you don't need regions of interests for your training images;
The image_sets specify which image files are for training, validation, and testing. So for training, you only load the images mentioned in image_sets/train.txt from images/

gauthsvenkat · 2019-01-21T14:08:44Z

Man, the last few days I've been breaking my head over this. I don't exactly "get" the loss function (All 4 losses) or how you implemented it in torch. I was hoping if I get the loss function I could write it in keras (which I'm comfortable with). Is there maybe another source (like a blog post or an article) that explains how you practically implemented the loss (and the entire model in general) ?

Thanks a ton for helping out!

(I'm also closing this issue, since you did solve the actual issue)

IssamLaradji · 2019-01-21T14:40:52Z

you are welcome! you are free to open another issue where i can explain each part of the loss and/or architecture for you.

I don't think there is another source yet, but I am planning to create a blog post on this at some point. Sorry :(

tongpinmo · 2019-04-15T01:03:22Z

You are right, you don't need regions of interests for your training images;

The image_sets specify which image files are for training, validation, and testing. So for training, you only load the images mentioned in image_sets/train.txt from images/
　
I have images and the points files ,should I produce the dots.png and .mat files ?

tongpinmo · 2019-04-15T01:05:12Z

the .mat files are binary matrices that represent the regions of interest in the image. Not all datasets have that, for example, shanghai.py doesn't have that.

the .txt files contain the paths to the images which you use to load image at every iteration by calling __getitem__

the transform function are used to flip, rotate, or/and normalize the image. normalization is important if you are using a pretrained network like resnet which expects a specific kind of input distribution.

Actually ,in shanghai.py , line 45, there are .mat files?So ,what's the meaning of shanghai.py doesn't have that as you mentioned

gauthsvenkat closed this as completed Jan 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train on own dataset ? #3

How to train on own dataset ? #3

gauthsvenkat commented Jan 16, 2019

IssamLaradji commented Jan 16, 2019 •

edited

Loading

gauthsvenkat commented Jan 16, 2019

IssamLaradji commented Jan 16, 2019

gauthsvenkat commented Jan 17, 2019

IssamLaradji commented Jan 17, 2019

gauthsvenkat commented Jan 17, 2019

IssamLaradji commented Jan 18, 2019 •

edited

Loading

gauthsvenkat commented Jan 21, 2019 •

edited

Loading

IssamLaradji commented Jan 21, 2019

tongpinmo commented Apr 15, 2019

tongpinmo commented Apr 15, 2019

How to train on own dataset ? #3

How to train on own dataset ? #3

Comments

gauthsvenkat commented Jan 16, 2019

IssamLaradji commented Jan 16, 2019 • edited Loading

gauthsvenkat commented Jan 16, 2019

IssamLaradji commented Jan 16, 2019

gauthsvenkat commented Jan 17, 2019

IssamLaradji commented Jan 17, 2019

gauthsvenkat commented Jan 17, 2019

IssamLaradji commented Jan 18, 2019 • edited Loading

gauthsvenkat commented Jan 21, 2019 • edited Loading

IssamLaradji commented Jan 21, 2019

tongpinmo commented Apr 15, 2019

tongpinmo commented Apr 15, 2019

IssamLaradji commented Jan 16, 2019 •

edited

Loading

IssamLaradji commented Jan 18, 2019 •

edited

Loading

gauthsvenkat commented Jan 21, 2019 •

edited

Loading