Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hi,thank you for the great codes. But I have met some problems. I just download the code, and run using distribute training mode, #17

Open
yangsuhui opened this issue Aug 26, 2020 · 7 comments

Comments

@yangsuhui
Copy link

No description provided.

@yangsuhui
Copy link
Author

hi,thank you for the great codes. But I have met some problems.
I just download the code, and run 35 epochs using distribute training mode, but using the eval.py to evaluate the perfermance of trained model, just got 0.0 accuracy using 30th epoch model, so i want to know what should I do?
thank you very much

@yizt
Copy link
Owner

yizt commented Aug 26, 2020

@yangsuhui run more epochs until the model is convergent; try 80 or 90 epochs

@yangsuhui
Copy link
Author

@yizt thank you, I will try train more epochs,;
when you train model with generate horizonal datas, and get the best model with 61 epochs as you put in the code; May I ask you, from which epoch the model accuracy is not zero when you train model?

@yizt
Copy link
Owner

yizt commented Aug 26, 2020 via email

@yizt
Copy link
Owner

yizt commented Aug 28, 2020

@yangsuhui hi, loss value is less than 0.01 after 30 epochs;

@yangsuhui
Copy link
Author

yangsuhui commented Aug 28, 2020

@yizt ok, understood, thank you very much!!!
one more, May I ask you how much time you trained from begining to model convergent with 61 epochs in horizonal mode?
I found the convergent speed using distribute training is slower than trained with single gpu,

@yizt
Copy link
Owner

yizt commented Aug 31, 2020

@yangsuhui i train the model using 8 RTX 2080 Ti GPUs with the image batch size of 128; the trainning process takes about 36 hours

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants