New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug: Cannot load training states from checkpoints to resume training #374
Comments
hello @alarivarmann you are right about the Warning, I am fixing it right now. However, if you want to use checkpoints, you need to add one inside your pipeline. I am trying your code out. Thanks for creating this issue. |
I have fixed the warnings here : #378 |
@alarivarmann you would need to add a checkpoint at the end of your pipeline for it to be truly resumable... However, do you really need this ? It seems like you are trying to run an AutoML loop, and not use any checkpoints. I think you should use Pipeline instead of ResumablePipeline. You could use the Trainer alone to do this, and not the AutoML loop. I think you simply want to save, and load the pipeline you have trained on. Am I right ? If so, please take a look at the Step Saving documentation : https://www.neuraxle.org/stable/step_saving_and_lifecycle.html I don't think you are trying to search hyperparams here. We need to add more examples for this... This feature is still experimental. It will be fully ready/functional after this pr here : #377 |
Hi Alex, Thanks for your response. This code snippet was just an example. The idea why I wished to try Neuraxle is to have AutoML Pipeline for which training can be resumed from checkpoints. Does Neuraxle currently support this feature? Thanks a lot! |
Yes, there is an example here : https://www.neuraxle.org/stable/examples/caching/plot_auto_ml_checkpoint.html Next release will have this fix that includes the step saving checkpoint. Thanks for trying out Neuraxle :) |
OK Thanks! When do you plan to implement Bayesian optimization/TPE similar to Optuna? |
@alarivarmann Until Alexandre answers, I think you could dig in our unit tests for the AutoML hyperparameter selection algorithms. You should be able to see example usages of our TPE ! We just need to add some documentaiton, I think it was working. @alexbrillant can you confirm that we have a functional and working TPE implementation ? I think this should be documented very soon, this is an important point to not forget. |
@alarivarmann So here are the TPE tests and how you can use it: You may want to understand how |
Describe the bug
When training AutoML on ResumablePipeline, logs are full of messages like these:
UserWarning: Cannot Load Step /models/resumable_pipeline/AutoML/ResumablePipeline (ResumablePipeline:ResumablePipeline) With Step Saver JoblibStepSaver.
saver.class.name))
To Reproduce
Expected behavior
I am expecting that there would be no errors such as these. The goal is to be able to warm start training from old checkpoints.
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: