Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

blogpostparser crashes parsing empty post #93

Open
wo opened this issue Sep 4, 2016 · 0 comments
Open

blogpostparser crashes parsing empty post #93

wo opened this issue Sep 4, 2016 · 0 comments

Comments

@wo
Copy link
Owner

wo commented Sep 4, 2016

2016-09-02 22:01:28 fetching blog post http://kazez.blogspot.com/2016/09/houston-we-have-cover.html
2016-09-02 22:01:29 no content found!
Traceback (most recent call last):
  File "bin/scraperdaemon.py", line 70, in <module>
    daemon.start()
  File "bin/scraperdaemon.py", line 38, in start
    self.run()
  File "bin/scraperdaemon.py", line 45, in run
    blogpostprocessor.run()
  File "/home/wo/opp-tools/bin/../opp/blogpostprocessor.py", line 26, in run
    process_blogpost(post)
  File "/home/wo/opp-tools/bin/../opp/blogpostprocessor.py", line 33, in process_blogpost
    blogpostparser.parse(doc)
  File "/home/wo/opp-tools/bin/../opp/docparser/blogpostparser.py", line 43, in parse
    doc.numwords = len(doc.content.split())
AttributeError: 'NoneType' object has no attribute 'split'

wo added a commit that referenced this issue Sep 25, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant