Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Object Detection walk-through #803

Merged
merged 2 commits into from
Jun 20, 2016

Conversation

gheinrich
Copy link
Contributor

No description provided.


### Loading the data into DIGITS

To create an Object Detection dataset in DIGITS, [this version of `3.4-dev`](https://github.com/NVIDIA/DIGITS/tree/2613082eec6c207548c99bb69fa5defc9473b1e8), or any more recent version, is required.
Copy link
Member

@lukeyeager lukeyeager Jun 16, 2016

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe just say DIGITS 4.0?

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tried to train detectNet with Digits. Everything is OK after I applied the latest changes but the result is always 0... 50 zero boxes after forward_net. I don't know why.
I used the prototxt in NVIDIA/caffe/examples/kitti. Do you have any suggestion?

@lukeyeager
Copy link
Member

The screenshots for the home page and for the "New Object Detection Dataset" page need updating with the latest UI changes.

@lukeyeager lukeyeager added this to the v4.0.0-rc.2 milestone Jun 16, 2016
}
}
```

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe mention adding the Pretrained model if downloaded.
screen shot 2016-06-16 at 5 12 19 pm

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think that's mentioned on line 83

@IsaacYangSLA
Copy link
Contributor

@gheinrich I will try it on Windows and get the results.

@lukeyeager lukeyeager force-pushed the dev/object-detection-walkthrough branch 3 times, most recently from 3b9ff66 to 0c36fce Compare June 17, 2016 22:51
@lukeyeager
Copy link
Member

Isaac, if you do want to try this on Windows, check the latest updates to this pull request. I updated it to change the suggested learning rate, etc.

gheinrich and others added 2 commits June 17, 2016 15:54
UI has changed in some places
Convert all images from PNG to JPEG for sake of space
@lukeyeager lukeyeager force-pushed the dev/object-detection-walkthrough branch from 0c36fce to a81b526 Compare June 17, 2016 22:54
@lukeyeager lukeyeager merged commit 21cc8ce into NVIDIA:master Jun 20, 2016
@RSly
Copy link

RSly commented Jun 21, 2016

Hi, I was checking this walk-through to reproduce the results.
I couldn't follow the part regarding the Kitti database and .txt label format. I could download some txt files (data_object_label_2.zip, are these the good ones?), but It was not clear what is in those .txt files (looks like 3D bounding boxes?)

=> could you please provide the links (or the name of the zip files) to the images and labels to be used for this walk-through?
=>Also, in order to train with other databases, an example of such files with a little explanation of what should go to the label .txt can be of great help (can it be like [x,y, width, height, category] of a bounding box? what should it be?).

Thanks

@Greendogo
Copy link

It would definitely be helpful to more clearly define the datasets we're to download.
From here: http://www.cvlibs.net/datasets/kitti/eval_object.php
I can see that they are quite large, and I'd rather not download one only to find out it was the wrong one later:

Download left color images of object data set (12 GB)
Download right color images, if you want to use stereo information (12 GB)
Download the 3 temporally preceding frames (left color) (36 GB)
Download the 3 temporally preceding frames (right color) (36 GB)
Download Velodyne point clouds, if you want to use laser information (29 GB)
Download camera calibration matrices of object data set (16 MB)
Download training labels of object data set (5 MB)
Download object development kit (1 MB)

@douzsh
Copy link

douzsh commented Jun 23, 2016

Download left color images of object data set (12 GB)
Download training labels of object data set (5 MB)

I think these two sets are what you need.

Greendogo added a commit to Greendogo/DIGITS that referenced this pull request Jun 23, 2016
Added a link to the KITTI object detection webpage with directions as to which files are needed for the walk-through example.

Thanks to douzsh for his help: NVIDIA#803 (comment)
@gheinrich gheinrich deleted the dev/object-detection-walkthrough branch November 30, 2016 16:49
SlipknotTN pushed a commit to cynnyx/DIGITS that referenced this pull request Mar 30, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

10 participants