New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a page which explains important Scrapy concepts in a single page #1569
Comments
Isn't that pretty much http://doc.scrapy.org/en/latest/topics/architecture.html?highlight=scrapy%20architecture ? Which is my favorite page in the docs. |
@Granitas yeah, it is close, a good catch. This page is not targeted for beginners though, e.g. both for downloader and for spider middleware it says just "They provide a convenient mechanism for extending Scrapy functionality by plugging custom code." without explaining which custom code should go to a spider mw and which should go to downloader mw. You can figure it out by meditating over the architecture overview picture, but it is not an easy task if you're just starting. Also, it doesn't explain extensions at all. |
* spiders don't have to work on specific domains; * explain what to use Downloader middleware for and what to use Spider middleware for; * Engine no longer locates spiders based on domains; * "Spider middleware output direction" step was missing. See also: GH-1569.
* spiders don't have to work on specific domains; * explain what to use Downloader middleware for and what to use Spider middleware for; * Engine no longer locates spiders based on domains; * "Spider middleware output direction" step was missing. See also: GH-1569.
* spiders don't have to work on specific domains; * explain what to use Downloader middleware for and what to use Spider middleware for; * Engine no longer locates spiders based on domains; * "Spider middleware output direction" step was missing. See also: GH-1569.
I can write this page, I have a few questions to get started:
|
This seems to go in the lines of a question I recently tried to answer on StackOverflow: https://stackoverflow.com/q/54421455/939364 |
As @plafl said: "Scrapy is very extensible but that has a cost too. There are too many concepts: spiders, items, middlewares, pipelines, exporters, extensions, signals, settings. As a newcomer I would like to know which problem they solve."
+1 :) I think we should add a page which will explain all this in a single place - what are these concepts, when and why to use them.
The text was updated successfully, but these errors were encountered: