New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
系统邮件异常 #104
Comments
################################# 爬虫部分数据后,出现如下异常: worker: Warm shutdown (MainProcess) [2018-07-05 08:41:49,835: ERROR/ForkPoolWorker-1] failed to crawl http://weibo.com/p/1003061497124480/info?mod=pedit_more,here are details:'NoneType' object is not subscriptable, stack is File "/root/softs/weibospider-master/decorators/decorators.py", line 17, in time_limit [2018-07-05 08:41:49,850: INFO/ForkPoolWorker-1] Task tasks.user.crawl_person_infos[e73340af-df06-4745-a349-060112884736] succeeded in 1.632628456999555s: None |
第一个信息那里异常是被捕获到的
这是自己写的异常处理模块提示的信息 |
第一条日志已经删除。 可以复现,后续将发详细日志。 |
第三个问题: 回复的很及时,非常感谢! 2018-07-05 19:11:30 - storage - ERROR - DB operation error,here are details:(pymysql.err.IntegrityError) (1062, "Duplicate entry '4255964637059647' for key 'weibo_id'") [SQL: 'INSERT INTO weibo_data (weibo_id, weibo_cont, weibo_img, weibo_video, repost_num, comment_num, praise_num, uid, is_origin, device, weibo_url, create_time, comment_crawled, repost_crawled, dialogue_crawled, praise_crawled) VALUES (%(weibo_id)s, %(weibo_cont)s, %(weibo_img)s, %(weibo_video)s, %(repost_num)s, %(comment_num)s, %(praise_num)s, %(uid)s, %(is_origin)s, %(device)s, %(weibo_url)s, %(create_time)s, %(comment_crawled)s, %(repost_crawled)s, %(dialogue_crawled)s, %(praise_crawled)s)'] [parameters: {'weibo_img': 'https://tc.sinaimg.cn/images/tc.service.png', 'praise_crawled': 0, 'weibo_cont': '#上证快讯# 【进一 |
非常感谢。 但下面这个问题也是类似吗? 我看日志好像是出现了回滚? 2018-07-05 19:10:50 - crawler - INFO - the crawling url is http://weibo.com/u/1076684233?is_ori=1&is_tag=0&profile_ftype=1&page=4 |
是类似的,好像这个发布版有些错误没有回滚,就会导致这个问题,在 |
已解决,谢谢! |
master 版本关于数据回滚问题, 虽报错,但数据已入库。 |
如下:
问题:
1、抓取几百个微博账号个人信息后,使用的是163的邮箱,出现如下异常,建议对邮件发送部分做异常处理。
######################
[2018-07-05 08:10:09,864: ERROR/ForkPoolWorker-1] Failed to send emails, (535, b'Error: authentication failed') is raised, here are details: File "/root/softs/weibospider-master/utils/email_warning.py", line 48, in send_email
server.login(email_from, email_pass)
worker: Warm shutdown (MainProcess)
2018-07-05 08:10:09 - crawler - ERROR - failed to crawl http://weibo.com/p/1035051742566624/info?mod=pedit_more,here are details:'NoneType' object is not subscriptable, stack is File "/root/softs/weibospider-master/decorators/decorators.py", line 17, in time_limit
return func(*args, **kargs)
[2018-07-05 08:10:09,878: ERROR/ForkPoolWorker-1] failed to crawl http://weibo.com/p/1035051742566624/info?mod=pedit_more,here are details:'NoneType' object is not subscriptable, stack is File "/root/softs/weibospider-master/decorators/decorators.py", line 17, in time_limit
return func(*args, **kargs)
[2018-07-05 08:10:09,896: INFO/ForkPoolWorker-1] Task tasks.user.crawl_person_infos[3f4c447b-f4cf-41f1-89a6-3d929aed6bfd] succeeded in 2.4491456080004355s: None
[2018-07-05 08:10:10,906: WARNING/MainProcess] Restoring 4 unacknowledged message(s)
The text was updated successfully, but these errors were encountered: