web-scraping

Star

Here are 2,644 public repositories matching this topic...

scrapy / scrapy

Star

Scrapy, a fast high-level web crawling & scraping framework for Python.

python crawler framework scraping crawling web-scraping hacktoberfest web-scraping-python

Updated Aug 2, 2024
Python

The best and simplest free open source web page change detection, website watcher, restock monitor and notification service. Restock Monitor, change detection. Designed for simplicity - Simply monitor which websites had a text change for free. Free Open source web page change detection, Website defacement monitoring, Price change notification

notifications monitoring self-hosted web-scraping website-monitor url-monitor change-alert change-detection changedetection website-change-monitor website-change-tracker website-monitoring change-monitoring website-watcher website-change-detector restock-monitor website-change-detection website-change-notification back-in-stock website-defacement-monitoring

Updated Aug 4, 2024
Python

Evil0ctal / Douyin_TikTok_Download_API

Star

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具，支持API调用，在线批量解析及下载。

Updated Jul 7, 2024
Python

alirezamika / autoscraper

Sponsor

Star

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

python crawler machine-learning scraper automation ai scraping artificial-intelligence web-scraping scrape webscraping webautomation

Updated Jun 17, 2024
Python

seleniumbase / SeleniumBase

Star

📊 Python's all-in-one framework for web crawling, scraping, testing, and reporting. Supports pytest. UC Mode provides stealth. Includes many tools.

Updated Aug 3, 2024
Python

mherrmann / helium

Star

Lighter web automation with Python

python firefox chrome webdriver selenium python3 web-scraping helium web-automation selenium-python

Updated Jul 23, 2024
Python

apify / crawlee-python

Star

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

python crawler scraper automation web-crawler headless scraping crawling pip web-scraping beautifulsoup web-crawling headless-chrome apify playwright

Updated Aug 3, 2024
Python

adbar / trafilatura

Star

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Updated Aug 2, 2024
Python

snooppr / snoop

Star

Snoop — инструмент разведки на основе открытых данных (OSINT world)

Updated Jul 30, 2024
Python

lorien / grab

Star

Web Scraping Framework

python crawler framework spider asynchronous network python-library scraping crawling http-client python3 web-scraping pycurl urllib3

Updated Mar 12, 2024
Python

yifeikong / curl_cffi

Star

Python binding for curl-impersonate via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.

http curl https http-client web-scraping fingerprinting ja3 tls-fingerprint curl-impersonate ja3-fingerprint http2-fingerprint akamai-fingerprint

Updated Aug 3, 2024
Python

oxylabs / amazon-scraper

Star

Free Trial Amazon Scraper API for extracting search, product, offer listing, reviews, question and answers, best sellers and sellers data.

Updated Jul 23, 2024
Python

vprusso / youtube_tutorials

Sponsor

Star

Collection of scripts corresponding to LucidProgramming YouTube tutorials

python python3 web-scraping youtube-tutorial python-tutorial ctci-solutions lucidprogramming python3-tutorial technical-interview

Updated Oct 26, 2022
Python

je-suis-tm / web-scraping

Star

Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist

Updated Feb 1, 2022
Python

alecxe / scrapy-fake-useragent

Star

Random User-Agent middleware based on fake-useragent

python web-scraping scrapy

Updated Sep 18, 2023
Python

serpapi / google-search-results-python

Star

Google Search Results via SERP API pip Python Package

python scraping web-scraping google-images bing-image google-crawler serp-api serpapi

Updated Jun 19, 2024
Python

AlexMathew / scrapple

Star

A framework for creating semi-automatic web content extractors

python crawler tutorial extractor scraping web-scraper selector css-selector web-scraping scrapy scrapers beautifulsoup xpath-expression lxml selector-expression

Updated Jul 10, 2024
Python

z0m31en7 / Uscrapper

Star

Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and analysis with Vanta's advanced capabilities. Fast, reliable, and user-friendly, Uscrapper Vanta is the ultimate choice for researchers and analysts.

python osint selenium tor information-extraction websites web-scraping webcrawler webscraping information-gathering website-scraper reconnaissance darkweb selenium-webscraper osint-python webcra osint-tool darkweb-crawler

Updated Jun 13, 2024
Python

shaikhsajid1111 / social-media-profile-scrapers

Star

Fetch user's data across social media

python social-media pinterest web-scraper web-scraping request instagram-scraper twitter-scraper facebook-scraper scrapping-python selenium-python reddit-scraper quora-scraper tiktok-scraper medium-scraper pinterest-scrapper

Updated Jul 11, 2024
Python

jaebradley / basketball_reference_web_scraper

Star

NBA Stats API via Basketball Reference

python nba web-scraper web-scraping basketball-reference

Updated Aug 3, 2024
Python

Improve this page

Add a description, image, and links to the web-scraping topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the web-scraping topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

web-scraping

Here are 2,644 public repositories matching this topic...

scrapy / scrapy

dgtlmoon / changedetection.io

Evil0ctal / Douyin_TikTok_Download_API

alirezamika / autoscraper

seleniumbase / SeleniumBase

mherrmann / helium

apify / crawlee-python

adbar / trafilatura

snooppr / snoop

lorien / grab

yifeikong / curl_cffi

oxylabs / amazon-scraper

vprusso / youtube_tutorials

je-suis-tm / web-scraping

alecxe / scrapy-fake-useragent

serpapi / google-search-results-python

AlexMathew / scrapple

z0m31en7 / Uscrapper

shaikhsajid1111 / social-media-profile-scrapers

jaebradley / basketball_reference_web_scraper

Improve this page

Add this topic to your repo