-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Object Detection: About KITTI format #992
Comments
Hello @skyzhao3q, the 16th column ( When you create a dataset, DIGITS will encode the first 15 fields into the label database. However if you use DetectNet, only those fields are used:
You can set other fields to zero if you create your own dataset. Note that if you create your own dataset, it is important to ensure that your objects have a size of around 100 to 200 pixels in the image. |
@gheinrich thx |
hey the order is [top,left,down,right] for the 2D bounding box |
@code-Assasin I am pretty sure from looking at the Kitti images and their labels that the order is [left, top, right, bottom] |
x_min, y_min, x_max, y_max Looking at this image: https://devblogs.nvidia.com/parallelforall/wp-content/uploads/2016/07/Figure5-624x76.png |
Hey do we need matlab support for digits to get the accuracy values and all the visualizations and outputs? Since the kitti data preparation python file generates .m files. |
is there any code to convert tracklet_labels.xml file into text files? |
top,left = (xmin,ymin) |
in practical i have used this x1,y1,x2,y2 = int(bbox['xmin']), int(bbox['ymax']), int(bbox['xmax']), int(bbox['ymin'])
cv2.rectangle(image, (x1,y1), (x2,y2), (0,255,0), 2) hope it could help you. |
Did you find a solution? |
Hey, In my solution, based on kitti_raw_devtools/matlab/run_demoTracklets.m, I write a new .m file to convert tracklets.xml to 3d object detection labels format, but the \alpha field is not in tracklets.xml. The difference between \alpha and r_y is https://github.com/pratikac/kitti/blob/master/readme.tracking.txt#L89, and I am working on it. |
Hey, It's quite late. But you can extend your data by converting the Official KITTI Tracking Datasets to 3D object detection types |
From the Doc of https://github.com/NVIDIA/DIGITS/blob/v4.0.0-rc.3/digits/extensions/data/objectDetection/README.md
Actually it should be 16 columns(1+1+1+1+4+3+3+1+1) as defined below
But when I read the label txt file , It is 15 columns.
for example
Q1:
If 15 columns is right, is that the score field is not be used ?
I want to create my own label file. but I don't have information of dimensions , location and so on.
I think it should be set by default values like 0 or -1.
Q2:
What is the default value of these column ? or How should I do when I don't have the information of any column
thank you!
The text was updated successfully, but these errors were encountered: