-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
paddle训练使用多cpu不如单cpu速度快 #923
Comments
版本是0.9.0a0 |
@janelu9 最可能的原因是batch_size设置的过小,导致计算线程大量空闲。 同时,读数据的DataProvider可能写的太慢,导致时间占用都在读数据上。 |
|
另外,不知道这些是否都是物理核个数 |
@backyes 物理核心有2个 逻辑48个 256G内存 suse12系统 |
加大batch_size等于成倍减少训练次数 肯定训练的时间会缩短了 但是精度会下降 |
@reyoung 额 不是训练次数 那是迭代次数了 不过每次迭代的计算量不一样了 |
在服务器上建立了1,2,4,8个cpu的镜像,当trainner_counter分别设置为1,2,4,8时发现速度逐渐变慢,全部设置为1时,速度相当。说明paddle并没有利用多cpu啊
The text was updated successfully, but these errors were encountered: