Skip to content
@crwlrsoft

crwlr.software

PHP Packages for Rapid Crawler and Scraper Development

crwlr.software logo

crwlr.software - PHP Packages for Rapid Crawler and Scraper Development

crwlr.software is a collection of open source PHP composer packages that provide the necessary tools for web crawling and scraping tasks. The crawler package contains everything and helps you build crawlers as fast as possible. And there are also parts of it that you can use standalone.

Popular repositories Loading

  1. crawler crawler Public

    Library for Rapid (Web) Crawler and Scraper Development

    PHP 334 12

  2. url url Public

    Swiss Army knife for urls.

    PHP 102 7

  3. query-string query-string Public

    A library for convenient handling of query strings used in HTTP requests.

    PHP 17 3

  4. schema-org schema-org Public

    Extract schema.org objects from HTML documents

    PHP 11 2

  5. robots-txt robots-txt Public

    Robots Exclusion Standard/Protocol Parser for Web Crawling/Scraping

    PHP 9 2

  6. laravel-crawler laravel-crawler Public

    Laravel adapter for the crwlr/crawler package.

    PHP 3

Repositories

Showing 10 of 14 repositories
  • url Public

    Swiss Army knife for urls.

    crwlrsoft/url’s past year of commit activity
    PHP 102 MIT 7 0 0 Updated Oct 24, 2024
  • crawler Public

    Library for Rapid (Web) Crawler and Scraper Development

    crwlrsoft/crawler’s past year of commit activity
    PHP 334 MIT 12 1 0 Updated Oct 24, 2024
  • crawler-ext-browser Public

    Extension for the crwlr/crawler package containing steps utilizing a headless browser.

    crwlrsoft/crawler-ext-browser’s past year of commit activity
    PHP 0 MIT 0 0 0 Updated Oct 15, 2024
  • crwl-extension-utils Public

    Utils for extension packages for the crwl.io app.

    crwlrsoft/crwl-extension-utils’s past year of commit activity
    PHP 0 0 0 0 Updated Oct 15, 2024
  • crwl-ext-browser Public archive

    Extension configurations for integration of crwlr/crawler-ext-browser into the crwl.io app.

    crwlrsoft/crwl-ext-browser’s past year of commit activity
    PHP 0 MIT 0 0 0 Updated Jul 8, 2024
  • html-2-text Public

    Convert HTML to formatted plain text.

    crwlrsoft/html-2-text’s past year of commit activity
    PHP 2 MIT 0 0 0 Updated Feb 21, 2024
  • package-template Public template

    Template repository for new crwlr packages

    crwlrsoft/package-template’s past year of commit activity
    PHP 1 MIT 0 0 0 Updated Feb 5, 2024
  • schema-org Public

    Extract schema.org objects from HTML documents

    crwlrsoft/schema-org’s past year of commit activity
    PHP 11 MIT 2 0 0 Updated Nov 30, 2023
  • utils Public

    Utilities that are needed in multiple crwler packages.

    crwlrsoft/utils’s past year of commit activity
    PHP 2 MIT 1 0 0 Updated Oct 29, 2023
  • robots-txt Public

    Robots Exclusion Standard/Protocol Parser for Web Crawling/Scraping

    crwlrsoft/robots-txt’s past year of commit activity
    PHP 9 MIT 2 0 0 Updated Oct 29, 2023

Top languages

Loading…

Most used topics

Loading…