Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

官方例子image_classification提交MPI任务出错 #790

Closed
pkuyym opened this issue Dec 8, 2016 · 4 comments
Closed

官方例子image_classification提交MPI任务出错 #790

pkuyym opened this issue Dec 8, 2016 · 4 comments
Labels

Comments

@pkuyym
Copy link
Contributor

pkuyym commented Dec 8, 2016

日志信息如下:

Thu Dec 8 19:55:24 2016[1,0]:Python Error: <type 'exceptions.IOError'> : [Errno 2] No such file or directory: '\xff\xc4'
Thu Dec 8 19:55:24 2016[1,0]:Python Callstack:
Thu Dec 8 19:55:24 2016[1,0]: /home/disk1/normandy/maybach/237307/workspace/thirdparty/thirdparty/image_provider.py : 72
Thu Dec 8 19:55:24 2016[1,0]:Calling iterator next error

@backyes
Copy link
Contributor

backyes commented Dec 8, 2016

@pkuyym

paddle_platform自带的test样例是测试case,可能不兼容最新paddle。请依照http://deeplearning.baidu.com/doc/deprecated_doc/doc_cn/demo/index.html 中demo的示例,完成多机训练。 或者根据文档思路,自己基于开源的单机demo,自行修改成多机demo,从单机到多机迁移成本也非常低。

以后百度内部将不再针对每一个单机版本的demo,构建多机版本demo,历史demo集群版本参看 历史文档http://deeplearning.baidu.com/doc/deprecated_doc/doc_cn/demo/index.html

@backyes backyes added the Cluster label Dec 9, 2016
@pkuyym
Copy link
Contributor Author

pkuyym commented Dec 9, 2016

@backyes
demo就是依据http://deeplearning.baidu.com/doc/deprecated_doc/doc_cn/demo/index.html的步骤提交的,现在新的问题是多机版提交任务thirdpary的before_hook.sh执行出错,日志如下:

Fri Dec 9 12:02:29 2016[1,0]:Could not execute the executable "./thirdparty/thirdparty/before_hook.sh": Exec format error
Fri Dec 9 12:02:29 2016[1,0]:
Fri Dec 9 12:02:29 2016[1,0]:This could mean that your PATH or executable name is wrong, or that you do not
Fri Dec 9 12:02:29 2016[1,0]:have the necessary permissions. Please ensure that the executable is able to be
Fri Dec 9 12:02:29 2016[1,0]:found and executed.

@backyes
Copy link
Contributor

backyes commented Dec 9, 2016

这个问题的确奇怪,方便私下提供job地址吧

@pkuyym
Copy link
Contributor Author

pkuyym commented Dec 9, 2016

@backyes
已解决,before_bash.sh第一行的注释应该是#!/bin/bash,而不能是其它注释,可以close掉了,谢谢

@backyes backyes closed this as completed Dec 9, 2016
zhhsplendid pushed a commit to zhhsplendid/Paddle that referenced this issue Sep 25, 2019
* Update install_Windows_en.md

* Update install_Windows_en.md
zhhsplendid pushed a commit to zhhsplendid/Paddle that referenced this issue Sep 25, 2019
* hotfix deadlink (PaddlePaddle#811)

* Update native_infer_en.md (PaddlePaddle#787)

* Update install_Windows_en.md (PaddlePaddle#790)

* Update install_Windows_en.md

* Update install_Windows_en.md

* Update cluster_howto_en.rst (PaddlePaddle#791)

* Update cluster_howto_en.rst

* Update cluster_howto_en.rst

* Update doc/fluid/user_guides/howto/training/cluster_howto_en.rst

Co-Authored-By: acosta123 <42226556+acosta123@users.noreply.github.com>

* Update doc/fluid/user_guides/howto/training/cluster_howto_en.rst

Co-Authored-By: acosta123 <42226556+acosta123@users.noreply.github.com>

* Update cluster_howto_en.rst

* Update index_cn.rst (PaddlePaddle#813)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants