Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to adjust learning rates or something to be better when using a pretrained model to train my dataset?please give me some advice #820

Open
J-LINC opened this issue Mar 22, 2023 · 5 comments

Comments

@J-LINC
Copy link

J-LINC commented Mar 22, 2023

I'm trying to train my own data set using pretrained weights, BDD100K. Can you give me some advice? (All parameters unchanged, e.g. learning rate and step of learning rate decay) Below is the result of my 12 epoch training. I think map is a little too low, but the values of each loss function shown in the diagram are very small. I am worried that if I continue without any hyper parameter changes, the model will be overfitting. I currently set the number of training rounds to 250 epoch, which is evaluated every 10 rounds, and every 50 epochs saves the model.The following image shows the losses I have trained for 10 epochs and the evaluation results on the validation set
2023-03-22 23-13-28 的屏幕截图

2023-03-22 23-16-25 的屏幕截图

@Flova
Copy link
Collaborator

Flova commented Apr 4, 2023

What pretrained weights did you use? I would not worry about overfitting that much on this dataset scale. I am preparing a PR with better hyperparameters in the near future, but for now I would just recommend longer training and maybe a slightly higher lr.

@J-LINC
Copy link
Author

J-LINC commented Apr 5, 2023

What pretrained weights did you use? I would not worry about overfitting that much on this dataset scale. I am preparing a PR with better hyperparameters in the near future, but for now I would just recommend longer training and maybe a slightly higher lr.

I did not use the pre-training model because I think BDD100k is big enough. I forgot whether the pre-training model was used when I trained COCO in the original paper. Now I am wondering whether to change the learning rate

@J-LINC
Copy link
Author

J-LINC commented Apr 5, 2023

What pretrained weights did you use? I would not worry about overfitting that much on this dataset scale. I am preparing a PR with better hyperparameters in the near future, but for now I would just recommend longer training and maybe a slightly higher lr.

you mean learning rate > 0.001 will be better? i think the initial learning rate in the yolov3-custom.cfg is too low.

@J-LINC
Copy link
Author

J-LINC commented Apr 5, 2023

What pretrained weights did you use? I would not worry about overfitting that much on this dataset scale. I am preparing a PR with better hyperparameters in the near future, but for now I would just recommend longer training and maybe a slightly higher lr.

Before your reply, I have modified ,which is not to use the pre-training model. As for the two messages I just replied to you, I plan to look at the training results tomorrow and reply to you

@Flova Flova mentioned this issue Apr 12, 2023
4 tasks
@J-LINC
Copy link
Author

J-LINC commented May 2, 2023

@Flova hi ,i think most issues about map low is because they just use one optimiser ,adam, i checked your code long time ago and i found sgd optimiser emmm,it 's better than adam in training ,at least in some aspects

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants