Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why is keras training slow? #13

Closed
chenzi opened this issue Apr 4, 2019 · 4 comments
Closed

Why is keras training slow? #13

chenzi opened this issue Apr 4, 2019 · 4 comments
Labels
question Further information is requested

Comments

@chenzi
Copy link

chenzi commented Apr 4, 2019

I use aws ec2 p2.xlarge server 12G gpu 60G Memory, But training is still slow.

WX20190404-155500

image

@chenzi
Copy link
Author

chenzi commented Apr 4, 2019

I get an error RuntimeWarning: Can save best model only with val_acc available, skipping.
'skipping.' % (self.monitor), RuntimeWarning)
image

@GantMan
Copy link
Owner

GantMan commented Apr 4, 2019

You're having around the same speed I have.

I think training is slower in this Keras model for 2 reasons. There is real-time image manipulation to augment the training set, and the amount of data per epoch is huge. Given the batch size, I'm using.
To get this model to 93% I trained on my machine for days.

As for your error message:
Did you change anything in the code? If so consult this link keras-team/keras#836

@GantMan GantMan added the question Further information is requested label Apr 4, 2019
@GantMan
Copy link
Owner

GantMan commented Apr 4, 2019

Another thing, since your computer you're using has more power than mine, you might want to try removing my workers=0 line. You might get a training boost.

@chenzi
Copy link
Author

chenzi commented Apr 6, 2019

I tryed set wokrder=1000 then The training speed is faster.

@GantMan GantMan closed this as completed Aug 6, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants