How to inference #8

bigsheep2012 · 2018-10-16T02:43:32Z

Hello
May I know how to inference a example from the kitti test dataset or a point cloud file in kitti format but without a info file? I have checked second/pytorch/inference.py. It seems a little bit different from train.py (e.g. should I create a reduced point cloud version of the original point cloud?).

Just wanna make sure I am using the inference in the correct way.

Thanks in advance.
Lin

traveller59 · 2018-10-16T14:40:19Z

The inference.py currently only used for kitti viewer. you can check inference steps in viewer.py.
There is no way to inference a single example in command line for now, you need to use evaluate in train.py to predict entire test set or use viewer.py to inference in GUI.
Inference step:

read point cloud [N, 4] and calib matrix from file (to remove points outside camera and generate bbox2d/camera box3d)
use remove_outside_points to remove points.
generate anchors (and anchor masks, if you remove empty anchors), use prep_pointcloud to get example dict (just generate voxels, num_points_per_voxel and coordinates when predict)
pass example dict to net.call() and get results.

bigsheep2012 · 2018-10-18T02:14:41Z

Thanks a lot.
Just create a new .py file following your suggestions giving an inference solution for a single example.
I guess I can just remove the code related to 'infos' if I do not care the bbox in camera's coordinates, right?

traveller59 · 2018-10-18T13:14:54Z

Yes, but you still need to create a dict which is needed for net.call. you need to add some code in VoxelNet.predict to return LiDAR boxes:
second/pytorch/models/voxelnet.py, line 924:

final_box_preds_camera = box_torch_ops.box_lidar_to_camera(
    final_box_preds, rect, Trv2c)
if self.lidar_only:
                predictions_dict = {
                    "box3d_lidar": final_box_preds,
                    "scores": final_scores,
                    "label_preds": label_preds,
                    "image_idx": img_idx,
                }
else:
    #  camera code

bigsheep2012 · 2018-10-19T01:35:42Z

Thank you. Closing the issue.

kxhit · 2018-11-25T05:14:26Z

Hi! @bigsheep2012
I'm trying to use the pretrained model to predict the results on my point cloud. Have you figure it out? Would you like to share your code? Thanks a lot!

bigsheep2018 · 2018-11-26T09:36:00Z

Hello @kxhit,

I am not using a pretrained model as @traveller59 said there might be some issues using pretrained model with SparseConv by facebook research.

I trained the model from scratch for like one day with a 1080 ti. I am afraid that I may not be able to share the code because it is done in a company.

My way is firstly trying to extract the voxel generator, target_assigner and voxelnet parts for building a small version without any data augmentation and, then add other stuffs.

Do modifications carefully in box_np_ops.* as there are many trivial modifications if you are using your own data.

kxhit · 2018-12-02T06:26:09Z

@bigsheep2012 @bigsheep2018 @traveller59 Thanks for your reply!
I found the time consuming is much higher than the time as stated in the paper and the KITTI benchmark(0.05s).
My test info is:

[14:03:35] input preparation time: 0.2631642818450928
[14:03:36] detection time: 0.291057825088501

I want to know the 0.05s specifically refers to which time-consuming and why it takes me much more time. Am I doing something wrong?
Waiting for reply! Thanks!

traveller59 · 2018-12-02T11:43:46Z

@kxhit some code need real-time JIT compiling, so the first run may cost some time. you can see the input preparation time is very long because point_to_voxel is a numba.jit function. the following run should cost much less time.

bigsheep2018 · 2018-12-03T01:24:34Z

hi @kxhit ,

Firstly, I do not think the Submission of Kitti by @traveller59 is the shared github version as sparse convolution on GPU is not implemented on the github code. Thus, the time cost might be some different.

Secondly, if you are using the author's orginal code (and the reduced point cloud mentioned in ReadMe) without any modification, you should get a faster result. In my case, on my desktop with a 1060 6G, the detection time is 100-120 ms.

kxhit · 2018-12-03T06:44:06Z

As you say, the next will cost much less time. Thanks! Testing on TITAN XP 12G.
So, this open source code can't achieve 0.05s performance now, right?

[14:37:15] input preparation time: 0.013611555099487305
[14:37:15] detection time: 0.08942961692810059

bigsheep2018 · 2018-12-03T06:52:36Z

@kxhit
Yes, I think this is the case. You can refer to his paper, which is cited on the Kitti website. The GPU version Sparse Convolution implemented by the author is not shared. This github code is using the default sparse convolution by Facebook research.

traveller59 · 2018-12-06T11:17:12Z

@kxhit @bigsheep2012 Currently I can't reproduce the speed in Ubuntu 18.04, PyTorch 1.0 and newest SparseConvNet. The forward time (not include input prepare time) of the pointcloud 107 is 0.069s in current environment but I can get 0.049s in previous 16.04 in 1080Ti , you can check the deprecated KittiViewer picture in README.md.
I even can't build SparseConvNet correctly after lots of try. Now I am using a wheel package built in a 16.04 docker. I have no idea about this speed problem for know.

chandrakantkhandelwal · 2019-09-03T07:03:49Z

Thank you. Closing the issue.

@bigsheep2012 I am also trying to predict on only on Lidar data (from a custom Lidar), I do not have image and calib input. Could you please tell me which files I need to modify to remove the image and calib parameters?

xieqi1996 · 2019-12-26T01:56:24Z

Thank you. Closing the issue.

@bigsheep2012 I am also trying to predict on only on Lidar data (from a custom Lidar), I do not have image and calib input. Could you please tell me which files I need to modify to remove the image and calib parameters?

I have the same problem，hava you solved the problem？

bigsheep2012 closed this as completed Oct 19, 2018

robin2002 mentioned this issue Apr 13, 2019

Multi-gpu training is not working!! #154

Closed

zhangfree2018 mentioned this issue Aug 22, 2019

Segmentation fault when training the network numba falling back to object mode error #261

Open

nr-patel mentioned this issue Oct 28, 2019

nuscenes-devkit Version issues #290

Closed

zyxcambridge mentioned this issue May 23, 2020

RuntimeError: CUDA error: an illegal memory access was encountered #372

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to inference #8

How to inference #8

bigsheep2012 commented Oct 16, 2018

traveller59 commented Oct 16, 2018

bigsheep2012 commented Oct 18, 2018

traveller59 commented Oct 18, 2018

bigsheep2012 commented Oct 19, 2018

kxhit commented Nov 25, 2018

bigsheep2018 commented Nov 26, 2018

kxhit commented Dec 2, 2018

traveller59 commented Dec 2, 2018

bigsheep2018 commented Dec 3, 2018

kxhit commented Dec 3, 2018

bigsheep2018 commented Dec 3, 2018

traveller59 commented Dec 6, 2018

chandrakantkhandelwal commented Sep 3, 2019

xieqi1996 commented Dec 26, 2019

How to inference #8

How to inference #8

Comments

bigsheep2012 commented Oct 16, 2018

traveller59 commented Oct 16, 2018

bigsheep2012 commented Oct 18, 2018

traveller59 commented Oct 18, 2018

bigsheep2012 commented Oct 19, 2018

kxhit commented Nov 25, 2018

bigsheep2018 commented Nov 26, 2018

kxhit commented Dec 2, 2018

traveller59 commented Dec 2, 2018

bigsheep2018 commented Dec 3, 2018

kxhit commented Dec 3, 2018

bigsheep2018 commented Dec 3, 2018

traveller59 commented Dec 6, 2018

chandrakantkhandelwal commented Sep 3, 2019

xieqi1996 commented Dec 26, 2019