New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can't get the full article. #42
Comments
Which url are you trying to parse? |
I tried more than one article, this one for example: 'http://www.bbc.com/news/business-37618618' |
Hm, it looks like they're setting You won't be able to extract the article with |
Fixes #42, resolve issue where data would not be returned if key was in dictionary
This is fixed, version 0.8.3 is now available on pypi! The site you posted now returns: {
'site_name': u 'BBC News',
'description': u 'Samsung has ceased production of its Galaxy Note 7 smartphones after reports of devices it had deemed safe catching fire.',
'videos': [],
'title': u 'Samsung permanently stops Galaxy Note 7 production',
'url': u 'http://www.bbc.com/news/business-37618618',
'status_code': 200,
'locale': u 'en_GB',
'images': [{
'src': 'http://www.bbc.com/news/business-37618618',
'height': None,
'width': None
}, {
'src': u 'http://ichef.bbci.co.uk/news/1024/cpsprodpb/577B/production/_91759322_h2h1xuaj.jpg',
'type': u 'og:image'
}]
} |
Hi, I want to extract the article from the source url. I got only the title of the article and small parts of it under the "description" parameter.
The text was updated successfully, but these errors were encountered: