Skip to content
@scrapy

Scrapy project

An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way.

Pinned Loading

  1. scrapy Public

    Scrapy, a fast high-level web crawling & scraping framework for Python.

    Python 54.7k 10.7k

  2. scrapy.org Public

    The scrapy.org website

    HTML 63 141

Repositories

Showing 10 of 27 repositories
  • scrapy Public

    Scrapy, a fast high-level web crawling & scraping framework for Python.

    Python 54,654 BSD-3-Clause 10,713 437 (19 issues need help) 185 Updated Mar 25, 2025
  • queuelib Public

    Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python

    Python 275 BSD-3-Clause 55 3 2 Updated Mar 24, 2025
  • protego Public

    A pure-Python robots.txt parser with support for modern conventions.

    DIGITAL Command Language 61 BSD-3-Clause 28 5 (1 issue needs help) 0 Updated Mar 24, 2025
  • w3lib Public

    Python library of web-related functions

    Python 400 BSD-3-Clause 107 11 (1 issue needs help) 4 Updated Mar 24, 2025
  • itemloaders Public

    Library to populate items using XPath and CSS with a convenient API

    Python 47 BSD-3-Clause 16 17 5 Updated Mar 24, 2025
  • cssselect Public

    CSS Selectors for Python

    Python 293 61 17 5 Updated Mar 24, 2025
  • parsel Public

    Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

    Python 1,206 BSD-3-Clause 149 31 (1 issue needs help) 13 Updated Mar 24, 2025
  • itemadapter Public

    Common interface for data container classes

    Python 67 BSD-3-Clause 13 5 2 Updated Mar 24, 2025
  • form2request Public

    Python 3.8+ library to build HTTP requests out of HTML forms

    Python 4 BSD-3-Clause 3 2 0 Updated Mar 21, 2025
  • scrapyd-client Public

    Command line client for Scrapyd server

    Python 773 BSD-3-Clause 145 5 0 Updated Mar 8, 2025