Add RCNNLossLayer, RCNNDetectionLayer for Faster(er) R-CNN #3496

guoshengCS · 2017-08-15T06:21:21Z

Add RCNNLossLayer, RCNNDetectionLayer for Faster(er) R-CNN.

qingqing01

@pkuyym Please help to review.

qingqing01 · 2017-08-23T11:38:29Z

paddle/gserver/layers/DetectionUtil.cpp

+                  size_t topK,
+                  real confThreshold,
+                  real nmsThreshold,
+                  vector<size_t>* indices) {


I see applyNMSFast is similar with SSD:

void applyNMSFast(const vector<NormalizedBBox>& bboxes, const real* confScoreData, size_t classIdx, size_t topK, real confThreshold, real nmsThreshold, size_t numPriorBBoxes, size_t numClasses, vector<size_t>* indices) { vector<pair<real, size_t>> scores; for (size_t i = 0; i < numPriorBBoxes; ++i) { size_t confOffset = i * numClasses + classIdx; if (confScoreData[confOffset] > confThreshold) scores.push_back(std::make_pair(confScoreData[confOffset], i)); } // ... }

觉得可以写成下面，依据confScoreData来判断：

void applyNMSFast(const vector<NormalizedBBox>& bboxes, size_t topK, real confThreshold, real nmsThreshold, size_t numClasses, vector<size_t>* indices, const real* confScoreData, size_t classIdx, size_t numPriorBBoxes) { vector<pair<real, size_t>> scores; if (confScoreData) { for (size_t i = 0; i < numPriorBBoxes; ++i) { size_t confOffset = i * numClasses + classIdx; if (confScoreData[confOffset] > confThreshold) scores.push_back(std::make_pair(confScoreData[confOffset], i)); } } else { for (size_t i = 0; i < bboxes.size(); ++i) { scores.push_back(std::make_pair(bboxes[i].first, i)); } } // ... }

qingqing01 · 2017-08-23T11:52:54Z

paddle/gserver/layers/DetectionUtil.cpp

+  }
+}
+
+size_t getDetectionIndices(


目测和applyNMSFast类似，和SSD里的代码可复用~

qingqing01 · 2017-08-23T11:57:45Z

paddle/gserver/layers/DetectionUtil.cpp

+  decodedBBox.yMax = decodedBBoxCenterY + decodedBBoxHeight / 2;
+
+  return decodedBBox;
+}


上面代码也类似，不过为了方便调试收敛效果，代码优化也行。

qingqing01 · 2017-08-23T12:13:27Z

paddle/gserver/layers/RCNNDetectionLayer.cpp

+    std::vector<real> roiLocData(4);  // RoI location
+    for (size_t j = 0; j < 4; ++j) {
+      roiLocData[j] = *(roisData + n * roiDim + 1 + j);
+    }


int batchIdx = *(roisData + n * roiDim); std::vector<real> roiLocData(4); // RoI location for (size_t j = 0; j < 4; ++j) { roiLocData[j] = *(roisData + n * roiDim + 1 + j); }

==>

roisData += roiDim; int batchIdx = *roisData; std::vector<real> roiLocData(roisData+ 1, roisData+ 5);

std::vector的初始化：http://www.cplusplus.com/reference/vector/vector/vector/

qingqing01 · 2017-08-23T12:23:34Z

paddle/gserver/layers/RCNNDetectionLayer.cpp

+      for (size_t j = 0; j < 4; ++j) {
+        predLocData[j] = *(locPredData + n * numClasses_ * 4 + c * 4 + j);
+      }
+      real predConfData = *(confPredData + n * numClasses_ + c);


同样代码可以短一些：

locPredData += numClasses_ * 4; for (size_t c = 0; c < numClasses_; ++c) { if (c == backgroundId_) continue; std::vector<real> predLocData(locPredData + c * 4, locPredData + c * 4 + 4); real predConfData = *(confPredData + c); // ...

qingqing01 · 2017-08-23T12:28:30Z

paddle/gserver/layers/RCNNDetectionLayer.h

+ *          contains the prior-box data. The rest two input layers are
+ *          layers for generating bounding-box location offset and the
+ *          classification confidence.
+ * - Output: The predict bounding boxes.


The predicted bounding boxes.

qingqing01 · 2017-08-23T12:30:34Z

python/paddle/trainer/config_parser.py

@@ -1781,6 +1781,39 @@ def __init__(self, name, inputs, size, input_num, num_classes,
        self.config.size = size


+@config_layer('rcnn_loss')
+class RCNNLossLayer(LayerBase):
+    def __init__(self, name, inputs, loss_ratio, num_classes, background_id=0):


, **xargs):

qingqing01 · 2017-08-23T12:35:42Z

python/paddle/trainer_config_helpers/layers.py

+                    loss_ratio,
+                    num_classes,
+                    background_id=0,
+                    name=None):


it's better to add some comments for the difference between rnn_loss and multibox_loss, if you understand.

luotao1 · 2019-02-01T04:50:54Z

感谢您给PaddlePaddle贡献代码。由于Paddle V1/V2版本已不再维护，相关代码也已从develop分支上删除，因此关闭您的PR，欢迎您向Paddle最新版-Fluid贡献代码。
Thanks for contributing to PaddlePaddle! Since V1/V2 will not be maintained anymore, and related codes have been deleted from develop branch as well, we close this PR. Welcome to contribute to Fluid——the latest version of PaddlePaddle.

Add RCNNLossLayer, RCNNDetectionLayer for Faster(er) R-CNN

417a0b2

guoshengCS requested review from pkuyym, wanghaoshuang and qingqing01 August 15, 2017 06:21

fix bugs in test_RCNNDetection on GPU test

dfd3af6

qingqing01 reviewed Aug 23, 2017

View reviewed changes

llxxxll mentioned this pull request Oct 26, 2017

图像目标检测算法（需求） #5131

Closed

luotao1 closed this Feb 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add RCNNLossLayer, RCNNDetectionLayer for Faster(er) R-CNN #3496

Add RCNNLossLayer, RCNNDetectionLayer for Faster(er) R-CNN #3496

guoshengCS commented Aug 15, 2017

qingqing01 left a comment

qingqing01 Aug 23, 2017

qingqing01 Aug 23, 2017

qingqing01 Aug 23, 2017

qingqing01 Aug 23, 2017

qingqing01 Aug 23, 2017

qingqing01 Aug 23, 2017

qingqing01 Aug 23, 2017

qingqing01 Aug 23, 2017

luotao1 commented Feb 1, 2019

Add RCNNLossLayer, RCNNDetectionLayer for Faster(er) R-CNN #3496

Add RCNNLossLayer, RCNNDetectionLayer for Faster(er) R-CNN #3496

Conversation

guoshengCS commented Aug 15, 2017

qingqing01 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

luotao1 commented Feb 1, 2019