Add a page which explains important Scrapy concepts in a single page #1569

kmike · 2015-10-29T13:50:29Z

As @plafl said: "Scrapy is very extensible but that has a cost too. There are too many concepts: spiders, items, middlewares, pipelines, exporters, extensions, signals, settings. As a newcomer I would like to know which problem they solve."

+1 :) I think we should add a page which will explain all this in a single place - what are these concepts, when and why to use them.

Granitosaurus · 2015-11-05T10:52:28Z

Isn't that pretty much http://doc.scrapy.org/en/latest/topics/architecture.html?highlight=scrapy%20architecture ? Which is my favorite page in the docs.

kmike · 2015-11-05T20:57:12Z

@Granitas yeah, it is close, a good catch.

This page is not targeted for beginners though, e.g. both for downloader and for spider middleware it says just "They provide a convenient mechanism for extending Scrapy functionality by plugging custom code." without explaining which custom code should go to a spider mw and which should go to downloader mw. You can figure it out by meditating over the architecture overview picture, but it is not an easy task if you're just starting. Also, it doesn't explain extensions at all.

* spiders don't have to work on specific domains; * explain what to use Downloader middleware for and what to use Spider middleware for; * Engine no longer locates spiders based on domains; * "Spider middleware output direction" step was missing. See also: GH-1569.

darshanime · 2016-09-07T07:44:45Z

I can write this page, I have a few questions to get started:

is the page is aimed at the extensions developer
should it read like an article or should it contain code
what is the desired length of the article (how many words)
what concepts should be focused on

Gallaecio · 2019-02-12T14:23:30Z

This seems to go in the lines of a question I recently tried to answer on StackOverflow: https://stackoverflow.com/q/54421455/939364

kmike added the docs label Oct 29, 2015

kmike mentioned this issue Mar 25, 2016

DOC improved Architecture overview #1879

Merged

Gallaecio added the enhancement label Aug 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a page which explains important Scrapy concepts in a single page #1569

Add a page which explains important Scrapy concepts in a single page #1569

kmike commented Oct 29, 2015

Granitosaurus commented Nov 5, 2015

kmike commented Nov 5, 2015

darshanime commented Sep 7, 2016

Gallaecio commented Feb 12, 2019

Add a page which explains important Scrapy concepts in a single page #1569

Add a page which explains important Scrapy concepts in a single page #1569

Comments

kmike commented Oct 29, 2015

Granitosaurus commented Nov 5, 2015

kmike commented Nov 5, 2015

darshanime commented Sep 7, 2016

Gallaecio commented Feb 12, 2019