Preparation for CLOCs #2

shaunkheng97 · 2021-01-14T16:19:03Z

Hi, I am planning to do a fusion with Yolov4 and Seconds/PointPillar. Would you be providing a tutorial/guide for the extraction of the bounding boxes before NMS?

pangsu0613 · 2021-01-15T00:46:33Z

Hello, for SECOND-V1.5 (for newer version of SECOND, it should be very similar), check the file 'voxelnet.py' https://github.com/traveller59/second.pytorch/blob/v1.5/second/pytorch/models/voxelnet.py, line 377, batch_box_preds = preds_dict["box_preds"], they are the raw output (encodings of bounding boxes before NMS) from the SECOND network. First, you need to decode them (line 387), the decoded expressions are [x, y, z, w, l, h, r] in lidar coordinate, if you need them in camera coordinate, use functions in https://github.com/traveller59/second.pytorch/blob/v1.5/second/pytorch/core/box_torch_ops.py to transform them. 'box_torch_ops.py' also provides many other useful 2D/3D bounding box and coordinate transformation functions.
As for Yolov4, I am not very familiar with the Yolov4 codebase, but based on my experience, just setting NMS score threshold to 0 (if you are using sigmoid score, 0 means no threshold), the output also works fine.

CodeDragon18 · 2021-01-18T08:10:10Z

Hi,
when do you update your code? i am waiting to try.

shaunkheng97 · 2021-01-19T17:42:52Z

Currently I do have a trained yolov4 model with BDD dataset and a trained Seconds v1.6 with Kitti dataset. My questions are:

If I am planning to evaluate CLOCs on Kitti dataset, is it recommended to train yolov4 with Kitti dataset to optimize the performance?
From my understanding, I do not need to retrain the network but would have to do an inference without NMS on a dataset as the input for CLOCs, am I correct?

pangsu0613 · 2021-01-20T03:42:54Z

@shaunkheng97

Yes, it is recommended to train the 2d detector (for here it is your yolov4) with Kitti dataset. If the 2D detector performs poorly on Kitti, it would spoil the fusion.
Yes, you are right, you don't need to re-train the network, just do the inference without NMS or no NMS score thresholding, the point is to get more raw outputs from the network.

pangsu0613 · 2021-01-20T03:47:03Z

@CodeDragon18
Thank you for your interests, I have been really busy these days, but I have started working on it, I will upload an early version as soon as possible.

shaunkheng97 · 2021-01-20T04:08:23Z

Alright I’ll work on it in the meantime! Thanks!

shaunkheng97 · 2021-02-01T11:55:21Z

Hi, just curious and confused about the training.

I have used 90% for training and 10% for validation on 7480 Kitti training dataset. If I were to run inference without NMS, I would have to reuse the training dataset as the inference set? Would it be contradictory if I am using the same dataset for training and inference?

pangsu0613 · 2021-02-01T16:55:25Z

Yes, you are right. Ideally, one should divide the dataset into 3 parts, part 1 for training the 3D and 2D detector, part 2 for training CLOCs, and part 3 for validation only. But for KITTI, first, the 3712 frames mini-training and 3769 frames validation split is so popular, many researchers use that split for their experiments, it would be good to show results on the 3769 validation set for comparison; second, KITTI is a relatively small dataset, I think it is too small to divide it into 3 parts. So, I just use the popular 3712 frames mini-training set to train 3D/2D detectors and CLOCs, and doing validation on 3769 frames validation set. This is NOT the best and reasonable way for training, but even with this, I still do get some improvements. I think for other larger datasets (such as nuScene, Waymo and Argoverse), dividing it into 3 parts would be a better choice.

shaunkheng97 · 2021-02-01T17:22:43Z

So for now, should I retrain yolov4 with any random 3712 frames, and run inference again on all 7480 frames for CLOCs input? How was Seconds' training like? I believe it is more ideal to train yolov4 similarly with the Seconds that you've used.

Might try to train on nuScenes if I am able to successfully train CLOCs on Kitti.

pangsu0613 · 2021-02-01T17:38:14Z

Yes, it would be better to train YOLO-V4 with the 3712 frames if 3712 frames are enough to train YOLO-V4.
Also, the 3712 + 3769 split is NOT random, it is a fixed well known convention split used by many researchers, so people could compare different networks on the same validation set, I remembered the split was proposed in a year 2015 paper named '3D object proposals for accurate object class detection'. You can find the split under /CLOCs/second/data/ImageSet, there are multiple text files there, for 'train.txt', it contains all the frame numbers for the 3712 mini-training set; for 'val.txt', it contains all the frame numbers for the 3769 validation set. SECOND uses the 3712 mini-training set for training.

shaunkheng97 · 2021-02-01T17:40:09Z

Alright. I'll attempt to train yolov4 with the 3712 mini-training set first, and will get back to CLOCs soon!

FaFaLiu · 2021-04-24T10:58:34Z

Did you succeed？I also want to do this...

pangsu0613 closed this as completed Jan 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preparation for CLOCs #2

Preparation for CLOCs #2

shaunkheng97 commented Jan 14, 2021 •

edited

Loading

pangsu0613 commented Jan 15, 2021

CodeDragon18 commented Jan 18, 2021

shaunkheng97 commented Jan 19, 2021 •

edited

Loading

pangsu0613 commented Jan 20, 2021

pangsu0613 commented Jan 20, 2021

shaunkheng97 commented Jan 20, 2021

shaunkheng97 commented Feb 1, 2021

pangsu0613 commented Feb 1, 2021

shaunkheng97 commented Feb 1, 2021

pangsu0613 commented Feb 1, 2021

shaunkheng97 commented Feb 1, 2021

FaFaLiu commented Apr 24, 2021

Preparation for CLOCs #2

Preparation for CLOCs #2

Comments

shaunkheng97 commented Jan 14, 2021 • edited Loading

pangsu0613 commented Jan 15, 2021

CodeDragon18 commented Jan 18, 2021

shaunkheng97 commented Jan 19, 2021 • edited Loading

pangsu0613 commented Jan 20, 2021

pangsu0613 commented Jan 20, 2021

shaunkheng97 commented Jan 20, 2021

shaunkheng97 commented Feb 1, 2021

pangsu0613 commented Feb 1, 2021

shaunkheng97 commented Feb 1, 2021

pangsu0613 commented Feb 1, 2021

shaunkheng97 commented Feb 1, 2021

FaFaLiu commented Apr 24, 2021

shaunkheng97 commented Jan 14, 2021 •

edited

Loading

shaunkheng97 commented Jan 19, 2021 •

edited

Loading