Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Runtimeerror during training #2

Closed
YangJae96 opened this issue Aug 1, 2022 · 4 comments
Closed

Runtimeerror during training #2

YangJae96 opened this issue Aug 1, 2022 · 4 comments

Comments

@YangJae96
Copy link

Hi, I am facing some error while running your codes.

There was some error preparing the setting
such as faiss-gpu, apex, ex..
but I made the code running and like below.
image

THe memory seems to be initalized but the error comes up right at the
begin of training.
image

Could you give some help?

@caposerenity
Copy link
Owner

Hi, @YangJae96 , it seems that Apex is not properly installed on your machine, you can try to re-install it. There won't be errors during memory initialization because apex amp is not employed in that process.

@YangJae96
Copy link
Author

I see. I am runnning the training but it is taking a while.
Did it take more than a day to train the model in both setting?

@caposerenity
Copy link
Owner

caposerenity commented Aug 3, 2022

Hi, @YangJae96 , it takes about 30hours to train the model with PRW as the target domain on a single A100 GPUI. The reverse setting takes much less time because it achieves the best performance at the 7th epoch, you can get more training details from our provided train_log in README.

@YangJae96
Copy link
Author

@caposerenity Okay! Thank you for your detail explanation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants