yolov2 and darknet19 #62

Sunyouteng · 2018-10-25T03:21:17Z

Hello, I wanna prune yolov2's pretrained model, just wanna it to have fewer filters for each layer. But, it is not in the Torchvision'model set. Does a model have to be in Torchvision'model set if I wanna prune it? I studied your documentation for a week, and i did not find a clear way to do that. Yolov2 is first trained on ImageNet then we got Darknet19 model. And then make a little change about darknet19 network, and train it again on object detection dataset and we got yolov2. And I wanna to prune this model. I am new in Pytorch. Can I do this with Distiller? Can you give me some detailed instructions? If yes, I would like to contribute my work to the nice Distiller.

nzmora · 2018-10-25T09:31:47Z

Hi,

Great to hear that you are considering using Distiller!
Your question is packed, so I'll take it step by step:

Sure, you can use Distiller with any network, not just the models that come from TorchVision. You can see some examples of image classification models that we added here. The code that adds them is here.
If you add YoLov2, you will be the first to add an object detection model to Distiller, which will help other people! This might mean a bit of extra work for you, because you might have to adjust the image classification sample application to peform object detection. Therefore, it will probably be easier for you to simply write a new sample application, specifically for object detection, based on your current YoLo application.
When I added a network that learns a language-model I wrote a new sample application (because this is very different from image classification) and I also wrote a tutorial. This tutorial should give you a lot of details on how to add a new application. Basically, to get started you don't need much: you just add some callbacks to your exiting Pytorch YoLo application.

If you need help and guidance, feel free to post more questions and we will try to help.
Cheers,
Neta

Sunyouteng · 2018-11-03T15:27:33Z

@nzmora
Dear Mr. nzmora:
First, thanks a lot for your detailed solution for how to prune YoLov2. I feel encouaged to do this, alothough with big fear to fail.

For the past week, I spend two days looking into your tutorial, focusing on the difference between the raw main.py from pytorch/examples/word_language_model/ and your main in this project. I get the main idea of how to integrate my application with your distiller library. But, I have no idea with NLP(I am into CV), which makes me feel confused sometimes.
I spend three days learning all the pruning alogrithms in your model zoo. I want to choose L1-Norm, followed by networking thinning. Now, i can print the sparsity table of the model. Then I write a yaml file for compression. When i use distiller.config.file_config(model, None, args.compress)to config it, an error happened.

Fatal error while parsing [section:lr_schedulers] [item:exp_finetuning_lr]
Exception: <class 'TypeError'> NoneType is not an Optimizer
Fatal while creating lr_schedulers
Exception: <class 'TypeError'> NoneType is not an Optimizer
Actually i copied this part from your yaml file for network thinning.

Then I want to debug the compress_classifier.py as https://github.com/NervanaSystems/distiller/blob/master/examples/pruning_filters_for_efficient_convnets/resnet56_cifar_filter_rank.yaml did, so that i can see the detailed steps about parsing the yaml file. Then an error happened when it runs model.cuda. I know the default gpus = None, and i dont have GPU in my PC, but it still runs model.cuda. Is there anything wrong with this? I want to make sure my code can run and then i move to a server with 2 GPUs to do the experiment with SSH.
Now I have my version of yaml with target sparsity level using "pruning_filters_for_efficient_convnets" algorithm and a pretrained weights file. Since I have a pretrained model, I want to "physically" remove the pruned filters from the network, then I set the epoch=0 in policy. and start to finetune for epoch 1 to epoch 100. Is that ok? Can i visualize the process of finetuning? And for the net_thinner class, how to change the arch: 'resnet56_cifar' and dataset: 'cifar10' for my applicaition?
I wanna say something about python. It really makes life easier i think. But, sometimes it is too abstract and encapsulate to many details with so many libs like Pytorch. I like C, I can know what is the programs doing step by step. It is painful for me to fit Python. But, i have to acquire it, for it will be very useful in AI.

Last, thanks again for your your time reading my comments. I will appreciate it if you reply.
best regards!
Youteng Sun.

nzmora · 2018-11-04T13:15:13Z

Hi Sun,

I think the easiest way to proceed is if you opened a PR on a branch (e.g. call this branch "yolov2") so that I can see (and execute) your code.

(2) You can attach the YAML file for me to look at. But sharing all of the code is better, because the problem can be in the code, and not in the YAML.

(3) We currently don't support training on the CPU. We want to add this feature, but it is low priority (since training most models takes a long time on a CPU compared to a GPU). Again, if you share your code, I can run it on a GPU to see if it works (but note that I'm going to be in Beijing some of the time and won't be able to help).

(4) Using thinning in epoch=0 and then fine-tuning sounds good.
Regarding "And for the net_thinner class, how to change the arch: 'resnet56_cifar' and dataset: 'cifar10' for my application?" - this is a good question. Is the dataset COCO? I think here we might need to make some changes in the main code to support this (I need to see your code ;-).

Cheers,
Neta

Sunyouteng · 2018-11-15T06:47:02Z

@nzmora
Hello, friend:
Before I want to use this distiller, I have to get a training program for yolov2. I downloaded it from github, but there is something wrong with it,(maybe i am wrong). After debug the yolov2 pytorch version for a long time, I gave up. And I decided to write a program to implement the algorithm <> based on Darknet (C language) and prune the filter physically. And i realy made it yesterday. Thanks for the talking with me. I will go learning disitiller when i take part in the job next year I think. It is happy to learn so much from this open project.
Hope you had a good trip in Beijing. And i am in Shanghai.
Best regards
cheers,
Sun.

nzmora self-assigned this Oct 25, 2018

Sunyouteng closed this as completed Nov 15, 2018

HKLee2040 mentioned this issue Dec 13, 2018

Does quantization support custom model,like object detection model？ #106

Closed

YsYusaito mentioned this issue Jul 13, 2021

Reduce the yolov3 model size of keras(.h5) or darknet(.weight) #560

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yolov2 and darknet19 #62

yolov2 and darknet19 #62

Sunyouteng commented Oct 25, 2018

nzmora commented Oct 25, 2018

Sunyouteng commented Nov 3, 2018

nzmora commented Nov 4, 2018

Sunyouteng commented Nov 15, 2018

yolov2 and darknet19 #62

yolov2 and darknet19 #62

Comments

Sunyouteng commented Oct 25, 2018

nzmora commented Oct 25, 2018

Sunyouteng commented Nov 3, 2018

nzmora commented Nov 4, 2018

Sunyouteng commented Nov 15, 2018