GitHub - scrapy/scrapy: Scrapy, a fast high-level web crawling & scraping framework for Python.

Scrapy

Overview

Scrapy is a BSD-licensed fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.

Scrapy is maintained by Zyte (formerly Scrapinghub) and many other contributors.

Check the Scrapy homepage at https://scrapy.org for more information, including a list of features.

Requirements

Python 3.8+
Works on Linux, Windows, macOS, BSD

Install

The quick way:

pip install scrapy

See the install section in the documentation at https://docs.scrapy.org/en/latest/intro/install.html for more details.

Documentation

Documentation is available online at https://docs.scrapy.org/ and in the docs directory.

Code of Conduct

Please note that this project is released with a Contributor Code of Conduct.

By participating in this project you agree to abide by its terms. Please report unacceptable behavior to opensource@zyte.com.

Companies using Scrapy

See https://scrapy.org/companies/ for a list.

Commercial Support

See https://scrapy.org/support/ for details.

Name		Name	Last commit message	Last commit date
Latest commit History 10,541 Commits
.github		.github
artwork		artwork
docs		docs
extras		extras
scrapy		scrapy
sep		sep
tests		tests
tests_typing		tests_typing
.bandit.yml		.bandit.yml
.bumpversion.cfg		.bumpversion.cfg
.coveragerc		.coveragerc
.flake8		.flake8
.git-blame-ignore-revs		.git-blame-ignore-revs
.gitattributes		.gitattributes
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
AUTHORS		AUTHORS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
NEWS		NEWS
README.rst		README.rst
SECURITY.md		SECURITY.md
codecov.yml		codecov.yml
conftest.py		conftest.py
pylintrc		pylintrc
pytest.ini		pytest.ini
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrapy

Overview

Requirements

Install

Documentation

Releases

Community (blog, twitter, mail list, IRC)

Contributing

Code of Conduct

Companies using Scrapy

Commercial Support

About

Releases 38

Packages

Used by 47.8k

Contributors 561

Languages

License

scrapy/scrapy

Folders and files

Latest commit

History

Repository files navigation

Scrapy

Overview

Requirements

Install

Documentation

Releases

Community (blog, twitter, mail list, IRC)

Contributing

Code of Conduct

Companies using Scrapy

Commercial Support

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases 38

Packages 0

Used by 47.8k

Contributors 561

Languages

Packages