loss doesn't decrease and keep on 40 in my dataset #24

guapizyq · 2019-01-22T05:13:31Z

I train with my own dataset with the default script
my dataset has three types of objection with 40000 train sample and 6000 validation sample
the batch_size is 4, my loss decrease to 49 in the first epoch, but loss keep on 40 in the 13th epoch, the loss doesn't decrease anymore.
how should i change my learning rate and others parameter

guapizyq · 2019-01-22T05:14:49Z

it would be appreciated if you give some advise on this issue

Adamdad · 2019-01-22T08:11:15Z

Can you detect anything in the testset？if not，what is your learning rate,lr decay pacience?

guapizyq · 2019-02-18T13:15:48Z

I am sorry, it seems to my mistakes.
the first training is to get a stable loss
the second training without frozen layers is to get a lower loss?

Adamdad · 2019-02-18T13:16:58Z

correct

guapizyq · 2019-02-22T07:15:46Z

I got a lower loss than 40, but it still is 38
the result in my testdset looks ok, i do not konw how to decrease the loss

zhangyufei1995 · 2019-05-31T09:30:51Z

@guapizyq @Adamdad I am very happy to discuss with you. What I want to ask is, 1. How much is the epoch setting of the red arrow here? 2, how much is the initial_epoch setting of the two black arrows? I look forward to your answer.

zhangyufei1995 · 2019-05-31T09:41:01Z

And why is it divided into step-by-step training?the first training? the second training ?What is their role?

Adamdad · 2019-05-31T10:06:53Z

The first training part is for finetuning a model quickly. It freezes most layers, only to train on the last few layers. We can get an acceptable model for detection in a short period of time

The second training part is for getting a complete model. All the layers can be trained through this process.

Under most occasions, I only use the second part. Epoch under is not important here.

gzz1529657064 · 2019-06-03T14:10:37Z

The first training part is for finetuning a model quickly. It freezes most layers, only to train on the last few layers. We can get an acceptable model for detection in a short period of time

The second training part is for getting a complete model. All the layers can be trained through this process.

Under most occasions, I only use the second part. Epoch under is not important here.

After model training, I have a model with size of 277M. It is bigger than YOLO-v3,Why?
Doesn't MobileNet reduce model parameters?
This is my training strategy in my dateset.

Unfreeze all of the layers
learning_rate = 0.001
load_pretrained=False
batch_size = 16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

loss doesn't decrease and keep on 40 in my dataset #24

loss doesn't decrease and keep on 40 in my dataset #24

guapizyq commented Jan 22, 2019

guapizyq commented Jan 22, 2019

Adamdad commented Jan 22, 2019

guapizyq commented Feb 18, 2019

Adamdad commented Feb 18, 2019

guapizyq commented Feb 22, 2019

zhangyufei1995 commented May 31, 2019

zhangyufei1995 commented May 31, 2019

Adamdad commented May 31, 2019

gzz1529657064 commented Jun 3, 2019

loss doesn't decrease and keep on 40 in my dataset #24

loss doesn't decrease and keep on 40 in my dataset #24

Comments

guapizyq commented Jan 22, 2019

guapizyq commented Jan 22, 2019

Adamdad commented Jan 22, 2019

guapizyq commented Feb 18, 2019

Adamdad commented Feb 18, 2019

guapizyq commented Feb 22, 2019

zhangyufei1995 commented May 31, 2019

zhangyufei1995 commented May 31, 2019

Adamdad commented May 31, 2019

gzz1529657064 commented Jun 3, 2019