Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

error when running script_rpn_pedestrian_VGG16_caltech.m for training rpn #9

Closed
brisker opened this issue Sep 29, 2016 · 16 comments
Closed

Comments

@brisker
Copy link

brisker commented Sep 29, 2016

the error reads like this:
Loading region proposals...done
Warrning: no windows proposal is loaded !
引用了不存在的字段 'sizes'。

出错 roidb_from_caltech (line 52)
height = imdb.sizes(1,1);

出错 Dataset.caltech_trainval>@(x)x.roidb_func(x,false) (line 6)
dataset.roidb_train = cellfun(@(x) x.roidb_func(x, false), dataset.imdb_train, 'UniformOutput', false);

出错 Dataset.caltech_trainval (line 6)
dataset.roidb_train = cellfun(@(x) x.roidb_func(x, false), dataset.imdb_train, 'UniformOutput', false);

出错 script_rpn_pedestrian_VGG16_caltech (line 30)
dataset = Dataset.caltech_trainval(dataset, 'train');
Here is the screenshot of the dataset, have I got something wrong?
Thanks a lot! @zhangliliang
image
image
image

@zhangliliang
Copy link
Owner

zhangliliang commented Sep 29, 2016

Hi.

You might manually remove those files in imdb/cache, which might be generated unsuccessfully before.

@brisker
Copy link
Author

brisker commented Sep 29, 2016

@zhangliliang
thanks for your reply! And I am a little confused by the annotations of caltech pedestrian dataset. I am new to this. So if I want to train cpn using my own dataset, I know that I need to provide the ground truth bounding box coordinates. But in set06_VOOO_I00029.txt it looks like this:
% bbGt version=3
person-fa 172 184 10 26 0 0 0 0 0 0 0
person-fa 188 188 8 24 0 0 0 0 0 0 0

What do these zeros mean? If I want to use my own dataset, I also need to provide annotations in this txt format?

@zhangliliang
Copy link
Owner

@brisker

This file is extracting using the dollar's toolbox. Thus you could find the meaning of these number in the toolbox.

For using your dataset, you might need to write your own imdb and roidb generation scripts, which suitable for your data form.

@brisker
Copy link
Author

brisker commented Sep 30, 2016

@zhangliliang
I use TITAN X GPU,
After nearly 20 hours why the training log stops at iteration 9920?(stops at yesterday afternoon..........)
image

@zhangliliang
Copy link
Owner

@brisker

It seems that the training stop after iter 10000, since it save such a model.

You might check the error message which output in the matlab console.

@brisker
Copy link
Author

brisker commented Oct 1, 2016

@zhangliliang
Is it supposed to be like this?
image

@zhangliliang
Copy link
Owner

Yes.

@brisker
Copy link
Author

brisker commented Oct 1, 2016

@zhangliliang
Is the code for training rpn you provide soecific for pedestrian detection dataset, or it can be easily adapted to another dataset? If I want to train a new rpn with my own defined model and my own dataset, could you please tell me where to fix the code you provide?(just a general step advice) thanks a lot, i am a fresh man

@brisker
Copy link
Author

brisker commented Oct 1, 2016

i can not understand why the annotations of the caltech pedestrian dataset so different from the pascal voc2007 dataset,with just four numbers for the ground truth bounding boxes,but the caltech so many more zeroes in it. So i got annotations for my own dataset in the style and format in pascal voc2007,so i do not know how to adapt to your code

@zhangliliang
Copy link
Owner

The RPN is suitable for detection for specified class like object proposal, pedestrian detection, car detector and so on.

You could find the meaning for zero in the dollar's toolbox.

If you try to use your dataset, you might need to change the data preparing procedure and modified the relevant scripts.

@brisker
Copy link
Author

brisker commented Oct 3, 2016

@zhangliliang
So what kind of object detection tasks does RPN perform better than faster-rcnn? What features do these tasks have in common?

@zhangliliang
Copy link
Owner

@brisker

Hi,

These questions might be a little big and beyond the issue of this repo.

Recommend to read the original paper for this repo, and also the Faster/Fast R-CNN, SSD etc, to find out.

@brisker
Copy link
Author

brisker commented Oct 3, 2016

@zhangliliang
In your paper you mentioined atrous convolution, where indicates that you use it in the code? In the prototxt?

@zhangliliang
Copy link
Owner

Yes.

In the prototxt when extracting feature for the BF trainning and testing.

@brisker
Copy link
Author

brisker commented Oct 3, 2016

@zhangliliang
Two questions:
First, in the rpn_test.prototxt, I found the atours layer:
layer {
bottom: "conv4_1_atrous"
top: "conv4_2_atrous"
name: "conv4_2_atrous"
param {
name: "conv4_2_w"
lr_mult: 1.0
}
param {
name: "conv4_2_b"
lr_mult: 2.0
}
type: "Convolution"
convolution_param {
num_output: 512
pad: 2
kernel_size: 3
filter_stride: 2
}
}
I do not quite understand, so this layer's "type" is still "Convolution" ,so atrous convolution is not a new defined layer in caffe? Then how did it perform this?
image
Where to define the "rate=2"?

Second, in the paper , you mentioned that due to roipooling leads to "plain" features ,so you use atrous convolution, but in the train_val.prototxt for rpn training, I can not find roipooling layer? So what problem does atrous want to solve? Just get feature maps from shallower layers?

@zhangliliang
Copy link
Owner

Hi,

"rate=2" correspond to "filter_stride: 2" in the prototxt.

The atrous trick is not adopted during the RPN part, but the BF part.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants