Skip to content
View DataEnggNerd's full-sized avatar
🤠
Fail fast!
🤠
Fail fast!

Block or report DataEnggNerd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Web scraping tools

16 repositories

Fast, zero-copy HTML Parser written in Rust

Rust 393 30 Updated Aug 14, 2024

High-performance browser-grade HTML5 parser

Rust 2,457 259 Updated Dec 19, 2025

Low output latency streaming HTML parser/rewriter with CSS selector-based API

Rust 1,900 88 Updated Oct 17, 2025

Python version of the Playwright testing and automation library.

Python 14,087 1,112 Updated Dec 9, 2025

Scrapy, a fast high-level web crawling & scraping framework for Python.

Python 59,314 11,196 Updated Dec 22, 2025

HTTP API for Scrapy spiders

Python 872 161 Updated Sep 22, 2025

Generate and download e-books from online sources.

Python 1,980 396 Updated Dec 25, 2025

HTML parsing and querying with CSS selectors

Rust 2,281 125 Updated Dec 17, 2025

DuckDB is an analytical in-process SQL database management system

C++ 35,005 2,811 Updated Dec 24, 2025

🚀 Web scraping for humans

Python 975 66 Updated Dec 1, 2024

Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI. Docs 文档 👉

Python 3,391 584 Updated Feb 19, 2025

Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js

Python 3,490 649 Updated Oct 29, 2024

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Python 7,058 715 Updated Jun 9, 2025

A Python library for solving reCAPTCHA v2 and v3 with Playwright

Python 453 64 Updated Nov 20, 2025

A Rust library to extract useful data from HTML documents, suitable for web scraping.

Rust 1,014 68 Updated Mar 19, 2025

🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!

Python 8,330 476 Updated Dec 26, 2025