New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Jobs fails when loading previous model #3
Comments
Hello, Thank you for your interests in our research. Let us take 2D Burgers equation as an instance. Our goal is to solve the PDE for 1000 time steps. The procedure is to first initialize all the network parameters with function Hope that answer your question. Thank you! |
Thank you for your reply. I have the same problem. I observed that there was no adaptation in the code for multiple training rounds. For example, when I train a step 100 times, what should I change? How do I set the value of 'pre_model_save_path ='? |
Thank you for your question. Yes, we only show the code for 1000 time steps. When training for the 100 steps, you will directly apply the function |
Hello, how do I get the parameter pre_model_save_path? Very confused, hope to get your help, thank you very much |
Thank you for your question. |
For the first pre-training, how to train without pre_model_save_path directly using the network parameters initialized based on the function initialize_weights. |
The |
I'm still confused, because I still can't run it successfully. I read that your code also needs a network pre-training weight for the first training. As for the network initialization weight you said, I don't know how to implement it. I see that a pre-trained model is loaded in the train function defined in your code. I'm messy, can you send me a debugged code on how to get the pretrained model in the first step. Really hope to get your help. My mailbox is 2858724272@qq.com. thank you very much! |
Hi, we have tested the code. It works well. The code posted in the repo does not have bugs. You may modify it for your own purpose (e.g., for different pretraining schemes or different PDE systems). Second, for the first training, you do not need pretrained network parameters (e.g., weights). They are initialized based on the function Third, the pretrained model is loaded unless there is pretraining happening. Namely, you will only need it after the 1st pretraining. |
Thank you for your reply, this is my first training process and the error says that a pre-trained model is required. Is there any special setup required for the first pre-training? thank you |
Hi Dr.Ren, |
Hi Paul,
I hope you are doing well.
I have a question when trying to running the python script. It requires to load a previous trained model, './model/checkpoint500.pt'. Can you please tell me how to obtain this model, or how to define the weights/biases for initializing the network?
many thanks in advance.
The text was updated successfully, but these errors were encountered: