-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PySequence_Check(seq_) 检测失败问题 #3139
Comments
应该还是某几条数据错误。 可以配置跳过错误的数据 @provider(
check=True,
check_fail_continue=True,
)
.... |
可以配置跳过错误的数据 这个是说对data_provide的函数配置跳过错误数据么?我的用的paddle v1 @reyoung 几个疑问: |
如果不想查出来哪几项数据是错的,可以使用如上配置将错误数据跳过。 |
I close this issue due to inactivity. please feel free to reopen it if more information is available. |
Log line format: [IWEF]mmdd hh:mm:ss.uuuuuu threadid file:line] msg
F0801 17:12:41.648218 24026 PythonUtil.h:197] Check failed: PySequence_Check(seq_)
被这个PySequence_Check困扰很久了,问题情况是,完全一样的配置在本地跑没有任何问题,但是放到mpi集群跑,有时候就会挂,报检测失败的问题。
最头痛的是,这并不是每次都可以复现,像是随机的。
我的数据输出格式是一个字典:比如{‘input1‘:[],’input2’:[], 'label':0} (其中label是integer,所以不是sequence)
另外一个情况是,在使用dropout或l1/l2正则系数的时候,这种问题出现就更频繁了
查看代码和之前的issue,也并没有得到实质性的解决,请帮忙解释这个检测的原理,及出现随机现象的原因,谢谢
ps:在集群上paddle因为这种原因挂了的任务,也不会自己kill,而且还占用资源,这个是不是也可以优化下
The text was updated successfully, but these errors were encountered: