Model different to paper #2

dannyhung1128 · 2019-12-25T11:20:02Z

Hi - thank you for the good work,

I notice that your model in this repo is different from the one you presented in your paper, and that in README you mentioned one can achieve better performance by adding more D4LCN modules.

Which model is the one you used to produce the results in the paper? What's the difference in terms of performance?

Thanks

dingmyu · 2019-12-25T14:31:28Z

Hi Danny,

What do you mean by the difference between the model and the paper?

The model in this repo is consistent with the paper and can produce the same results as the paper. In the experiment, we found that adding the D4LCN module to the first three blocks achieves the best results. And the D4LCN module after block2 is the most important, the module after block1 is the least important.

For convenience, we try to provide a time-saving model, which can be trained on 11GB GPUs in one day. In README, we provide a simplified version of the model. This model only uses a D4LCN module after block2, which also produces good performance. What we want to prove is that using only one D4LCN module can also bring significant improvements. The result of this simplified model is almost the same as that in the paper, you can download and have a try :)

Thanks

dannyhung1128 · 2019-12-25T19:43:02Z

Like you mentioned you use D4LCN 3 times but in your repo only 2 times.
This line was commented out https://github.com/dingmyu/D4LCN/blob/master/models/resnet_dilate.py#L148

dannyhung1128 · 2019-12-26T00:25:46Z

just curious, have you switched your depth input into 3d depth, which contains 3d depth points x, y, z?

dingmyu · 2019-12-26T02:34:31Z

The model in this repo is consistent with the paper and can produce the same results as the paper. In the experiment, we found that adding the D4LCN module to the first three blocks achieves the best results. And the D4LCN module after block2 is the most important, the module after block1 is the least important.

For convenience, we try to provide a time-saving model, which can be trained on 11GB GPUs in one day.

Hi Danny,

As I said, the performance with and without D4LCN on block1 is similar.

Do you mean pseudo-LiDAR representations or x-y-z three channels? I didn't try pseudo-LiDAR because 3D processing should be more time-consuming than 2D. However, using three channels seems like a good idea if u want to try : )

Thanks.

dingmyu · 2019-12-27T00:50:00Z

Feel free to reopen it if you have any further questions.

dingmyu closed this as completed Dec 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model different to paper #2

Model different to paper #2

dannyhung1128 commented Dec 25, 2019 •

edited

Loading

dingmyu commented Dec 25, 2019

dannyhung1128 commented Dec 25, 2019 •

edited

Loading

dannyhung1128 commented Dec 26, 2019

dingmyu commented Dec 26, 2019

dingmyu commented Dec 27, 2019

Model different to paper #2

Model different to paper #2

Comments

dannyhung1128 commented Dec 25, 2019 • edited Loading

dingmyu commented Dec 25, 2019

dannyhung1128 commented Dec 25, 2019 • edited Loading

dannyhung1128 commented Dec 26, 2019

dingmyu commented Dec 26, 2019

dingmyu commented Dec 27, 2019

dannyhung1128 commented Dec 25, 2019 •

edited

Loading

dannyhung1128 commented Dec 25, 2019 •

edited

Loading