Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I want to detect objects by DPNs #12

Closed
Harold-Zhang opened this issue Sep 20, 2017 · 6 comments
Closed

I want to detect objects by DPNs #12

Harold-Zhang opened this issue Sep 20, 2017 · 6 comments

Comments

@Harold-Zhang
Copy link

How to train a model to detect objects and get results? May I use the pretrained models?

@Harold-Zhang
Copy link
Author

@cypw

@cypw
Copy link
Owner

cypw commented Sep 21, 2017

There are many examples showing how to use a pre-trained model to do object detection in MXNet, such as:

  1. Faster R-CNN in MXNet: mx-rcnn
  2. SSD in MXNet: mxnet-ssd

You can get started by following their README files and substitute the base CNN with DPN.

@Harold-Zhang
Copy link
Author

Harold-Zhang commented Sep 22, 2017

@cypw Thanks for your reply. I saw the result of ILSVRC2017 Object detection on Image-Net, and your team used the ensemble of DPN models to detect objects, how did it work? Did you use DPNs? Or Faster R-CNN? Thank you~

@Harold-Zhang
Copy link
Author

@cypw Could you provide me some paper about your DPNs? I want to learn more but I didn't find more paper except ILSVRC2017 workshop. By the way, could you tell me how long it took you to train DPN-92 model with 4 x K80? Thanks for your help.

@cypw
Copy link
Owner

cypw commented Sep 28, 2017

@Harold-Zhang Thank you for your interest.

In the competition, we used the Faster R-CNN framework and substituted the base CNN with different kinds of DPNs and did the ensembling. Several tricks are used to boost the performance and you can check our team's abstract at [here] for more information.

Besides the workshop [slides], we also have a [paper] and a [video](in chinese) talking about the proposed DPNs.

As for the training time, DPN-92 takes about one day using a cluster with 40 x K80 cards. The actual training speed of each node is shown at [here]. I haven't tried to train DPNs on a single node, but I guess it may take you more than a week to train DPN-92 from scratch.

@Harold-Zhang
Copy link
Author

Thank you very much!@cypw

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants