Can't get the full article. #42

yaseenox-personal · 2016-10-11T13:28:37Z

Hi, I want to extract the article from the source url. I got only the title of the article and small parts of it under the "description" parameter.

michaelhelmick · 2016-10-11T14:35:33Z

Which url are you trying to parse?

yaseenox-personal · 2016-10-13T09:00:51Z

I tried more than one article, this one for example: 'http://www.bbc.com/news/business-37618618'

michaelhelmick · 2016-10-13T19:05:18Z

Hm, it looks like they're setting meta description as well as og:description and twitter:description

You won't be able to extract the article with lassie but you will be able to get the description. I'll see if I can fix this.

Fixes #42, resolve issue where data would not be returned if key was in dictionary

michaelhelmick · 2016-10-21T14:36:51Z

This is fixed, version 0.8.3 is now available on pypi!

The site you posted now returns:

{
    'site_name': u 'BBC News',
    'description': u 'Samsung has ceased production of its Galaxy Note 7 smartphones after reports of devices it had deemed safe catching fire.',
    'videos': [],
    'title': u 'Samsung permanently stops Galaxy Note 7 production',
    'url': u 'http://www.bbc.com/news/business-37618618',
    'status_code': 200,
    'locale': u 'en_GB',
    'images': [{
        'src': 'http://www.bbc.com/news/business-37618618',
        'height': None,
        'width': None
    }, {
        'src': u 'http://ichef.bbci.co.uk/news/1024/cpsprodpb/577B/production/_91759322_h2h1xuaj.jpg',
        'type': u 'og:image'
    }]
}

michaelhelmick closed this as completed in 1a5121f Oct 21, 2016

michaelhelmick added a commit that referenced this issue Oct 21, 2016

Merge pull request #43 from michaelhelmick/bugs-42

c775d54

Fixes #42, resolve issue where data would not be returned if key was in dictionary

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't get the full article. #42

Can't get the full article. #42

yaseenox-personal commented Oct 11, 2016

michaelhelmick commented Oct 11, 2016

yaseenox-personal commented Oct 13, 2016

michaelhelmick commented Oct 13, 2016

michaelhelmick commented Oct 21, 2016

Can't get the full article. #42

Can't get the full article. #42

Comments

yaseenox-personal commented Oct 11, 2016

michaelhelmick commented Oct 11, 2016

yaseenox-personal commented Oct 13, 2016

michaelhelmick commented Oct 13, 2016

michaelhelmick commented Oct 21, 2016