feature pyramids #555

ghost · 2018-03-25T19:22:54Z

in yolo3, you use concept from feature pyramid networks, can you explain why 'feature pyramid' likes dssd, retinanet outperform 'feature pyramid' likes ssd for object detection (not segmentation)? thanks!

pjreddie · 2018-03-26T04:17:26Z

Feature pyramids use backwards connections from later layers to merge information with earlier layers for predicting detections at higher resolution. This allows them to use both strong semantic signals from later layers and fine grained spatial features from earlier layers. Pretty cool design!

ghost · 2018-03-26T14:18:41Z

i have question, with same clustering method on same dataset, but yolo3's anchors' scales are larger than yolo2's many times, anyway to make them have same scale?

pjreddie · 2018-03-26T14:42:22Z

So YOLOv2 I made some design choice errors, I made the anchor box size be relative to the feature size in the last layer. Since the network was downsampling by 32 this means it was relative to 32 pixels so an anchor of 9x9 was actually 288px x 288px.

In YOLOv3 anchor sizes are actual pixel values. this simplifies a lot of stuff and was only a little bit harder to implement

xiayq1 · 2019-01-23T05:48:10Z

Sry to bother u. In my training, I set random=0, using VOC data for training. Only use 10000 steps. The YOLO can detect the objects. But if I set random=1, also 10000 steps. The YOLO can't detect anything.
But I am more confused about why using different input size to train the YOLO. What kind of benefits will the different size of input bring to the YOLO?
Like if I use the fixed size input, I think it's much better than using multi dimension size.

computervisionlearner · 2019-08-13T07:24:26Z

So YOLOv2 I made some design choice errors, I made the anchor box size be relative to the feature size in the last layer. Since the network was downsampling by 32 this means it was relative to 32 pixels so an anchor of 9x9 was actually 288px x 288px.

In YOLOv3 anchor sizes are actual pixel values. this simplifies a lot of stuff and was only a little bit harder to implement
Hi, I guess yolov3 anchor sizes is relative to cfg file, as 608*608, not original image size, right?
During multi-scale training process (as),

TheMikeyR mentioned this issue Mar 26, 2018

YOLOv3 support AlexeyAB/darknet#504

Open

AlexeyAB mentioned this issue Mar 26, 2018

Filters for convolution and Reason for very large anchors #562

Closed

soldier828 mentioned this issue Jun 19, 2019

kmeans yolov3 anchors? #1659

Open

aelich mentioned this issue Aug 5, 2020

How to train YOLOv3_tiny_3l on g4dn.12xlarge 4 GPU (NVIDIA T4) -> *** buffer overflow detected *** AlexeyAB/darknet#6400

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature pyramids #555

feature pyramids #555

ghost commented Mar 25, 2018

pjreddie commented Mar 26, 2018

ghost commented Mar 26, 2018

pjreddie commented Mar 26, 2018

xiayq1 commented Jan 23, 2019

computervisionlearner commented Aug 13, 2019

feature pyramids #555

feature pyramids #555

Comments

ghost commented Mar 25, 2018

pjreddie commented Mar 26, 2018

ghost commented Mar 26, 2018

pjreddie commented Mar 26, 2018

xiayq1 commented Jan 23, 2019

computervisionlearner commented Aug 13, 2019