[MRG+1] docs: update overview spider code to use toscrape.com and minor changes #2249

eliasdorneles · 2016-09-15T18:21:27Z

So, this will replace the spider example code from the overview that
scrapes questions from StackOverflow by a spider scraping quotes (much
like the one in the tutorial), and upates the text around it to be
consistent.

There are also minor wording changes plus a small Sphinx/reST syntax fix
on the features list at the bottom (it was creating a definition list,
causing one line to be bold).

Does this look good?
Thanks!

So, this will replace the spider example code from the overview that scrapes questions from StackOverflow by a spider scraping quotes (much like the one in the tutorial), and upates the text around it to be consistent. There are also minor wording changes plus a small Sphinx/reST syntax fix on the features list at the bottom (it was creating a definition list, causing one line to be bold).

stummjr · 2016-09-15T18:26:55Z

docs/intro/overview.rst

-of them as they finish.
+an argument. In the ``parse`` callback we loop through the quote elements
+using a CSS Selector, yield a Python dict with the extracted quote text and author,
+look for a link to the next page and schedules another request using the same


and schedules another request using the same: schedules should be in 1st person of plural.

codecov-io · 2016-09-15T18:30:53Z

Current coverage is 83.36% (diff: 100%)

Merging #2249 into master will not change coverage

Powered by Codecov. Last update 2f60f2a...75531e4

redapple

Nice!

kmike · 2016-09-15T19:44:04Z

docs/intro/overview.rst

-                'link': response.url,
-            }
+            next_page = response.css('li.next a::attr("href")').extract_first()
+            if next_page:


I think next_page is None could be slightly better because an empty string is a valid relative URL

eliasdorneles · 2016-09-16T19:00:06Z

hey @kmike, I'll merge this to move things forward, feel free to point out if you have any other concern.
thanks for reviewing!

stummjr approved these changes Sep 15, 2016

View reviewed changes

minor grammar fix

1d159ae

redapple approved these changes Sep 15, 2016

View reviewed changes

kmike reviewed Sep 15, 2016

View reviewed changes

use better condition in example spider

75531e4

redapple changed the title ~~docs: update overview spider code to use toscrape.com and minor changes~~ [MRG+1] docs: update overview spider code to use toscrape.com and minor changes Sep 16, 2016

eliasdorneles merged commit de1a6ac into master Sep 16, 2016

eliasdorneles mentioned this pull request Sep 16, 2016

[backport][1.1] docs: update overview spider code to use toscrape.com and minor changes #2256

Merged

redapple deleted the fix-overview-spider branch October 25, 2016 09:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG+1] docs: update overview spider code to use toscrape.com and minor changes #2249

[MRG+1] docs: update overview spider code to use toscrape.com and minor changes #2249

eliasdorneles commented Sep 15, 2016

stummjr Sep 15, 2016

eliasdorneles Sep 15, 2016

codecov-io commented Sep 15, 2016 •

edited

Loading

redapple left a comment

kmike Sep 15, 2016

eliasdorneles Sep 15, 2016

eliasdorneles commented Sep 16, 2016

[MRG+1] docs: update overview spider code to use toscrape.com and minor changes #2249

[MRG+1] docs: update overview spider code to use toscrape.com and minor changes #2249

Conversation

eliasdorneles commented Sep 15, 2016

stummjr Sep 15, 2016

Choose a reason for hiding this comment

eliasdorneles Sep 15, 2016

Choose a reason for hiding this comment

codecov-io commented Sep 15, 2016 • edited Loading

Current coverage is 83.36% (diff: 100%)

redapple left a comment

Choose a reason for hiding this comment

kmike Sep 15, 2016

Choose a reason for hiding this comment

eliasdorneles Sep 15, 2016

Choose a reason for hiding this comment

eliasdorneles commented Sep 16, 2016

codecov-io commented Sep 15, 2016 •

edited

Loading