-
Notifications
You must be signed in to change notification settings - Fork 7.4k
Add option to load model weights from checkpoint before starting to t… #368
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add option to load model weights from checkpoint before starting to t… #368
Conversation
…rain objectdetector
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you extend the test to cover this new option?
@MarkDaoust I could add a new test case that loads one of my existing trained checkpoints, evaluates the model and verifies the test case and that weights are loaded by checking that the AP is high enough. |
I'm not sure why the CODEOWNERS file didn't assign Khanh and Lu directly. They're the real owners here. |
Cause the pattern is wrong. Here we are in a subdir that It is only covered by your global. |
Oh, right. I'll send a fix for that. |
@ziyeqinghan Could you take a look? |
Hi and thanks for this.
From what I understand this is because checkpoint for EfficientDetNetTrainHub are different and need a custom function to correctly restore them. Not sure about it though. from my terminal in model_dir path:
|
Since there is no option to create issues, I just have a question how to do multi GPU training using tflite model maker ? |
This does not seem to be in the actual code, yet I see a commit here. What is the status? |
Closing pr since it was reported not to work for other who attempted to use the code and I have no capacity to further investigate it. |
@Viktor-Nilsson This worked for me when I tried it |
I'd be keen to get this supported too, and as I am sure many others would as the ability to improve your own custom model is key without being wasteful with GPU retraining on data you've already trained with before. |
Added an optional parameter that allows passing a path to a checkpoint file when calling objectdetector.create()
If a checkpoint path is passed, the underlying tf.keras.model will load the model weights from the checkpoint before training is started.