Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

training stopped at epoch 1 #34

Open
rajpratyush opened this issue Mar 11, 2022 · 9 comments · May be fixed by #35
Open

training stopped at epoch 1 #34

rajpratyush opened this issue Mar 11, 2022 · 9 comments · May be fixed by #35

Comments

@rajpratyush
Copy link

can you tell me how long does it take for the training process to complete?

i am using a google colab notebook. and it has been stuck at epoch 1 since last 20 mins

@hitvoice
Copy link
Owner

I just run the training on an RTX-4000:

03/12/2022 04:51:19 [Data loaded.]
03/12/2022 04:51:26 Epoch 1
03/12/2022 04:59:32 > epoch [ 1] updates[  2707] train loss[4.39872] remaining[0:00:00]

03/12/2022 05:00:20 dev EM: 54.285714285714285 F1: 65.42046261815145
03/12/2022 05:00:22 [new best model saved.]
03/12/2022 05:00:22 Epoch 2

It's about 10 minutes after the data were loaded.

@rajpratyush
Copy link
Author

I m using linux without gpu though i hv a gpu but havnt setup cuda in it and dont know how to setup too, to train it or colab notebook.

Could you post up a pretrained model

@rajpratyush
Copy link
Author

Linux gets hanged while in prepro.py but incolab it toom more than 2.5 hours still stuck at epoch 1

@hitvoice
Copy link
Owner

On CPU it's supposed to be slow. I haven't tested on CPUs before. And I do not have any experience with Colab. Did you complete your prepro.py?

@rajpratyush
Copy link
Author

On colab i did completed but not on cpu it was always getting stuck at dev part

@hitvoice
Copy link
Owner

If you interrupt the kernel, what's the stack trace? It may help by indicating where it was stuck.

@rajpratyush
Copy link
Author

Actually the system got hanged but would you mind if we get up on a short online meeting if you are free

@rajpratyush
Copy link
Author

@hitvoice My Google Colab notebook is successfully working now. Shall I put up a pull request for those who want to implement it?

@hitvoice
Copy link
Owner

hitvoice commented Mar 26, 2022

Sorry for the late reply. Pull requests are definitely welcomed!

@rajpratyush rajpratyush linked a pull request Mar 26, 2022 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants