If I wana detect small object, which args should I modify? #586

albertyou2 · 2017-05-22T05:54:49Z

@rbgirshick
Thanks for your good work!
I wana detect very small objects , like 10 * 10 .But I don't know how to do this .
I have search the hole issue list but didn't see any compelete instructions for this job.
Could you please show me a way to do this?something like which args to modified

Thank you

djdam · 2017-05-22T14:18:14Z

I am struggling with this as well, though my objects can be in a range from +/- 5 to 100 px with different ratios. Most of my initial effort was targeted at changing the parameters going into the anchor_target_layer and proposal_layer:

scales: decrease these values to account for smaller boxes
ratios: adjust them depending on the shape of your grount-truth boxes
feat_stride : supposedly this can be modified to improve accuracy of the generated anchors

I made a script for analysing the images & boxes in your IMDB, it is available here:

https://github.com/djdam/faster-rcnn-scenarios/blob/master/src/analysis/imdb_analyse.py

albertyou2 · 2017-05-23T02:47:51Z

@djdam wow that's cool!
The script really helps!
I will try you suggestions soon ,Thank you very much!

ravikantb · 2017-05-26T15:22:18Z

@albertyou2 @djdam: Reducing the value of the following configuration while training may help you create a model that detects small objects. Using 4 instead of default 16 helped in my case.
https://github.com/rbgirshick/py-faster-rcnn/blob/master/lib/fast_rcnn/config.py#L118

You may also directly try changing the value at test time for your existing model at the following line. Not sure how good it would be though.
https://github.com/rbgirshick/py-faster-rcnn/blob/master/lib/fast_rcnn/config.py#L164

albertyou2 · 2017-05-27T06:47:37Z

hi @ravikantb
Thank you for your suggestion.And I 'am wondering if the RPN_MIN_SIZE is the only one parameter will effect the detection of small objects?

I have read some issues and articles , and I heard a parameter named something like 'min scale' or 'aspect ratio' .But it's very difficult for me to find they out.
sorry if this question is stupid ,as you see I 'm very new to caffe and DL.

ravikantb · 2017-05-27T08:09:27Z

@albertyou2 : My guess is the papers where you read about 'min scale' and 'aspect ratio' meant to refer to the anchor box ratios and scales. You will need to change them also (in addition to RPN_MIN_SIZE) to detect small objects. The default anchor boxes have 3 scales and 3 ratios (in total 9 anchor boxes for every position of feature map given by VGG/ZF convolution layers). Following method is called whenever reference to anchor boxes is needed:
https://github.com/rbgirshick/py-faster-rcnn/blob/master/lib/rpn/generate_anchors.py#L37

My suggestion would be to decrease the default ratios and scales. You can also add new ratios and scales in addition to default ones but then you will need to make some more changes in prototxts to account for change in RPN outputs triggered by change in anchor boxes. Let me know if you need any help in that.

albertyou2 · 2017-05-27T08:54:56Z

@ravikantb

Thank you .I will have a try as your helpful suggestion!
Have nice day!

hito0512 · 2017-06-02T09:13:26Z

@djdam i run the imdb_analyse.py,but it occue some errors: File "imdb_analyse.py", line 30, in get_statistics
for boxes, width, height, filename in [(entry['boxes'], entry['width'], entry['height'], entry['filename']) for entry in imdb.gt_roidb()]:
KeyError: 'width'
can u help me?

djdam · 2017-06-02T09:15:24Z

@jiayandekafei

technicaldrawings_numbers_train is the name for my dataset. You need to change it to your own dataset's name.

about the width error, you need to modify your dataset factory Python file and add the missing keys to the roidb entries, like 'width', 'height' etc

djdam · 2017-06-02T09:21:25Z

eg an example of a dataset factory is https://github.com/rbgirshick/py-faster-rcnn/blob/master/lib/datasets/pascal_voc.py . You probably have your own custom one. In the case of pascal_voc, you would need to add the height. width and filename keys in this line:

https://github.com/rbgirshick/py-faster-rcnn/blob/master/lib/datasets/pascal_voc.py#L220

hito0512 · 2017-06-02T14:05:46Z

@djdam i add as fllow:
return {'boxes' : boxes,
'width' : (x2 - x1 + 1),
'height': (y2 - y1 + 1),
'filename': filename,
is it right?

djdam · 2017-06-02T14:07:09Z

looks good. Don't forget to delete the cached .pkl file

hito0512 · 2017-06-03T10:38:15Z

@djdam thank you ,have a nice day

mysayalHan · 2017-06-11T06:01:49Z

@djdam Did you try to remove layer 'pool4' in VGG16 in terms of small object? I got low score. Considering pooling is one kind of down-sampling, why the result was worse? Thanks

liu09114 · 2017-06-19T11:15:15Z

good job

315386775 · 2017-06-20T07:57:25Z

good job！

xxc1005 · 2017-12-13T03:06:00Z

@ravikantb ,if I have changed the anchor ratio and scale,what should I do accompanied ?should I change the feat_stride?or something else?thanks

ashnair1 · 2019-04-09T06:15:53Z

Hey @ravikantb and @djdam. I had a question regarding your modifications for detecting small objects. You said that changing ratios and decreasing scales would be a good idea. Isn't it possible to just reduce the base size of anchors while keeping all other parameters the same? For example, say the default anchors are (32,64,128,256,512), couldn't I just change it to (8,16,32,64,128) or (4,8,16,32,64) to detect smaller objects since the aspect ratios and scales are relative to the anchor size?

zhilaAI · 2019-04-23T18:37:12Z

Hey everybody, I have the same problem to detect very small object less than 10 pixels. Does this modification help me? I am using Faster rcnn by tensorflow

HamdiTarek · 2022-07-06T14:21:10Z

Hey @ravikantb and @djdam. I had a question regarding your modifications for detecting small objects. You said that changing ratios and decreasing scales would be a good idea. Isn't it possible to just reduce the base size of anchors while keeping all other parameters the same? For example, say the default anchors are (32,64,128,256,512), couldn't I just change it to (8,16,32,64,128) or (4,8,16,32,64) to detect smaller objects since the aspect ratios and scales are relative to the anchor size?

yes this works for me, even for objects small than 10 pixels, I used (4,8,16,32,64)

albertyou2 closed this as completed May 27, 2017

hadign20 mentioned this issue Aug 21, 2017

Low and weird mAP results on new dataset CharlesShang/TFFRCNN#71

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

If I wana detect small object, which args should I modify? #586

If I wana detect small object, which args should I modify? #586

albertyou2 commented May 22, 2017

djdam commented May 22, 2017

albertyou2 commented May 23, 2017

ravikantb commented May 26, 2017

albertyou2 commented May 27, 2017

ravikantb commented May 27, 2017

albertyou2 commented May 27, 2017

hito0512 commented Jun 2, 2017

djdam commented Jun 2, 2017

djdam commented Jun 2, 2017

hito0512 commented Jun 2, 2017

djdam commented Jun 2, 2017

hito0512 commented Jun 3, 2017

mysayalHan commented Jun 11, 2017

liu09114 commented Jun 19, 2017

315386775 commented Jun 20, 2017

xxc1005 commented Dec 13, 2017

ashnair1 commented Apr 9, 2019 •

edited

zhilaAI commented Apr 23, 2019

HamdiTarek commented Jul 6, 2022

If I wana detect small object, which args should I modify? #586

If I wana detect small object, which args should I modify? #586

Comments

albertyou2 commented May 22, 2017

djdam commented May 22, 2017

albertyou2 commented May 23, 2017

ravikantb commented May 26, 2017

albertyou2 commented May 27, 2017

ravikantb commented May 27, 2017

albertyou2 commented May 27, 2017

hito0512 commented Jun 2, 2017

djdam commented Jun 2, 2017

djdam commented Jun 2, 2017

hito0512 commented Jun 2, 2017

djdam commented Jun 2, 2017

hito0512 commented Jun 3, 2017

mysayalHan commented Jun 11, 2017

liu09114 commented Jun 19, 2017

315386775 commented Jun 20, 2017

xxc1005 commented Dec 13, 2017

ashnair1 commented Apr 9, 2019 • edited

zhilaAI commented Apr 23, 2019

HamdiTarek commented Jul 6, 2022

ashnair1 commented Apr 9, 2019 •

edited