Skip to content

scrapfly/scrapfly-scrapers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ScrapFly Scrapers 🕷️

This repository contains educational example scrapers for popular web scraping targets using the ScrapFly web scraping API and Python.
Most Scrapers use a simple web scraping stack:

To learn more about web scraping see our full tutorials on how to scrape these targets (and many others) see the scrapeguide directory.

Fair Use and Legal Disclaimer

This repository contains educational reference material to illustrate how accessible web scraping can be and the provided programs are not intented to be used in web scraping production. That being said, Scrapfly team is constantly updating and improving all of this code for optimal experience.

Scrapfly does not offer legal advice and as always, consult a lawyer when creating programs that interact with other people's websites though here's a good general intro of what NOT to do:

  • Do not store PII (personally identifiable information) of EU citizens who are protected by GDPR.
  • Do not scrape and repurpose entire public datasets which can be protected by database protection laws in some countries.
  • Do not scrape at rates that could damage the website and scrape only publicly available data.

Setup and Run

  1. Install the required libraries:
$ pip install scrapfly-sdk[all] jmespath loguru nested-lookup  
  1. Export your ScrapFly API key
  • On Mac:
$ export SCRAPFLY_KEY="YOUR SCRAPFLY KEY"
  • On Windows:
$ setx SCRAPFLY_KEY "YOUR SCRAPFLY KEY"
  1. cd into the scraper directory and run the code:
$ cd ./example-scraper
$ python run.py

List of Scrapers

The following is the list of supported websites grouped by type.

E-Commerce

Fashion

Jobs and Companies

Real Estate

Reviews

Search Engines

Social Media

Travel


Aliexpress

The aliexpress.com scraper can scrape the following data:

  • Product pages for a specific product data.
  • Search pages for product listing data.
  • Product reviews.
View sample data

For the full guide, refer to our blog article How to Scrape Aliexpress.com (2023 Update)

Amazon

The amazon.com scraper can scrape the following data:

  • Product pages for a specific product data.
  • Search pages for product listing data.
  • Product reviews.
View sample data

For the full guide, refer to our blog article How to Scrape Amazon.com Product Data and Reviews

BestBuy

The bestbuy.com scraper can scrape the following data:

  • BestBuy sitemaps for URLs.
  • BestBuy product pages for product data.
  • BestBuy search pages for product data on search pages.
  • BestBuy review pages for review data
View sample data

For the full guide, refer to our blog article How to Scrape BestBuy Product, Offer and Review Data

Bing

The bing.com scraper can scrape the following data:

  • Search pages for search results (SERPs) data.
  • Search pages for keyword data.
  • Search page for rich snippet data.
View sample data

For the full guide, refer to our blog article How to Scrape Bing Search with Python

Booking

The booking.com scraper can scrape the following data:

  • Hotel pages for a specific hotel data.
  • Search pages for hotel listing data.
View sample data

For the full guide, refer to our blog article How to Scrape Booking.com (2023 Update)

Crunchbase

The crunchbase.com scraper can scrape the following data:

  • Company pages for a specific company data.
  • Investor pages for a specific investor data.
  • Search pages for hotel listing data.
View sample data

For the full guide, refer to our blog article How to Scrape Crunchbase Company and People Data (2023 Update)

Domain

The domain.com.au scraper can scrape the following data:

  • Property pages for a specific real estate property data.
  • Search pages for real estate property listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Domain.com.au Real Estate Property Data

Ebay

The ebay.com scraper can scrape the following data:

  • Product pages for a specific product data.
  • Search pages for product listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Ebay using Python

Etsy

The etsy.com scraper can scrape the following data:

  • Product pages for a specific product data.
  • Shop pages for a specific shop data.
  • Search pages for product listings data.
View sample data

For the full guide, refer to our blog article

Fashionphile

The fashionphile.com scraper can scrape the following data:

  • Product pages for product data.
  • Search pages for product listing data.
View sample data

For the full guide, refer to our blog article How to Scrape Fashionphile for Second Hand Fashion Data

Glassdoor

The glassdoor.com scraper can scrape the following data:

  • Company search for company page URLs (overiew, jobs, reviews, salaries).
  • Job pages for a specific job data.
  • Salary pages for a specific company salaries data.
  • Review pages for a specific company reviews data.
View sample data

For the full guide, refer to our blog article How to Scrape Glassdoor (2023 update)

Goat

The goat.com scraper can scrape the following data:

  • Product pages for product data.
  • Search pages for product listing data.
View sample data

For the full guide, refer to our blog article How to Scrape Goat.com for Fashion Apparel Data in Python

Homegate

The homegate.ch scraper can scrape the following data:

  • Property pages for a specific real estate property data.
  • Search pages for real estate property listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Homegate.ch Real Estate Property Data

Idealista

The idealista.com scraper can scrape the following data:

  • Property pages for a specific real estate property data.
  • Search pages for real estate property listings data.
  • Provinces pages for search pages URLs.
View sample data

For the full guide, refer to our blog article How to Scrape Idealista.com in Python - Real Estate Property Data

Immobilienscout24

The immobilienscout24.de scraper can scrape the following data:

  • Property pages for a specific real estate property data.
  • Search pages for real estate property listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Immobilienscout24.de Real Estate Data

Immoscout24

The immoscout24.ch scraper can scrape the following data:

  • Property pages for a specific real estate property data.
  • Search pages for real estate property listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Immoscout24.ch Real Estate Property Data

Immowelt

The immowelt.de scraper can scrape the following data:

  • Property pages for a specific real estate property data.
  • Search pages for real estate property listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Immowelt.de Real Estate Data

Indeed

The indeed.com scraper can scrape the following data:

  • Job pages for a specific job data.
  • Search pages for job listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Indeed.com (2023 Update)

Instagram

The instagram.com scraper can scrape the following data:

  • User pages for a specific user data.
  • Post Pages for a specific post data.
View sample data

For the full guide, refer to our blog article How to Scrape Instagram

Leboncoin

The leboncoin.fr scraper can scrape the following data:

  • product pages for a specific product data.
  • Search pages for product listing data.
View sample data

For the full guide, refer to our blog article How to Web Scrape Leboncoin.fr using Python

Nordstorm

The nordstorm.com scraper can scrape the following data:

  • Product pages for product data.
  • Search pages for product listing data.
View sample data

For the full guide, refer to our blog article How to Scrape Nordstrom Fashion Product Data

Realestate

The realestate.com.au scraper can scrape the following data:

  • Property pages for a specific real estate property data.
  • Search pages for real estate property listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Realestate.com.au Property Listing Data

Realtor

The realtor.com scraper can scrape the following data:

  • Property pages for a specific real estate property data.
  • Search pages for real estate property listings data.
  • Feed pages for newly added propery listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Realtor.com - Real Estate Property Data

Reddit

The reddit.com scraper can scrape the following data:

  • Reddit subreddit pages for subbreddit and post data.
  • Reddit post pages for post and comment data.
  • Reddit user profile pages for post data.
  • Reddit user profile pages for comment data.
View sample data

For the full guide, refer to our blog article

Redfin

The redfin.com scraper can scrape the following data:

  • Property pages for a specific real estate property data.
  • Search pages for real estate property listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Redfin Real Estate Property Data in Python

Rightmove

The rightmove.co.uk scraper can scrape the following data:

  • Property pages for a specific real estate property data.
  • Search pages for real estate property listings data.
View sample data

For the full guide, refer to our blog article How to Scrape RightMove Real Estate Property Data with Python

Seloger

The seloger.com scraper can scrape the following data:

  • Property pages for a specific real estate property data.
  • Search pages for real estate property listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Seloger.com - Real Estate Listing Data

Similarweb

The similarweb.com scraper can scrape the following data:

  • Similarweb.com website pages for website traffic inisghts
  • Similarweb.com website comparing pages for comparing insights
  • Similarweb.com website trend pages for trending websites data
  • Similarweb.com sitemaps for urls
View sample data

For the full guide, refer to our blog article How to Scrape SimilarWeb Website Traffic Analytics

Stockx

The stockx.com scraper can scrape the following data:

  • Product pages for a specific product data.
  • Search pages for product listings data.
View sample data

For the full guide, refer to our blog article How to Scrape StockX e-commerce Data with Python

Threads

The threads.net scraper can scrape the following data:

  • User pages for a specific user data.
  • Theads Pages for a specific thread data.
View sample data

For the full guide, refer to our blog article How to scrape Threads by Meta using Python (2023-08 Update)

TikTok

The tiktok.com scraper can scrape the following data:

  • TikTok post pages for post/video data.
  • TikTok comments for comment data on a specific post/video.
  • TikTok profile pages for profile data.
  • TikTok channel pages for video data.
  • TikTok search pages for video data on search results. It can scrape profiles as well.
View sample data

For the full guide, refer to our blog article How To Scrape TikTok in 2024

Tripadvisor

The tripadvisor.com scraper can scrape the following data:

  • Holtel pages for a specific hotel data.
  • Holtes data in a specific location.
  • Search pages for hotel listings data.
View sample data

For the full guide, refer to our blog article How to Scrape TripAdvisor.com (2023 Updated)

Trustpilot

The trustpilot.com scraper can scrape the following data:

  • Company pages for a specific company data.
  • Reviews data in a specific company reviews.
  • Search pages for company listings data.
View sample data

For the full guide, refer to out blog article How to Scrape Trustpilot.com Reviews and Company Data

Twitter(X)

The twitter.com scraper can scrape the following data:

  • Twitter tweet pages for a specific tweet data.
  • Twitter user pages for a specific user data.
View sample data

For the full guide, refer to our blog article How to Scrape X.com (Twitter) using Python (2023-11 Update)

Vestiaire collective

The vestiairecollective.com scraper can scrape the following data:

  • Product pages for product data.
  • Search pages for product listing data.
View sample data

For the full guide, refer to our blog article How to Scrape Vestiaire Collective for Fashion Product Data

G2

The g2.com scraper can scrape the following data:

  • Review pages for company reviews.
  • Search pages for company data.
  • Alternative pages for company alternatives.
View sample data

For the full guide, refer to our blog article How to Scrape G2 Company Data and Reviews

Walmart

The walmart.com scraper can scrape the following data:

  • Product pages for product data.
  • Search pages for product listing data.
View sample data

For the full guide, refer to our blog article How to Web Scrape Walmart.com (2023 Update)

Wellfound

The wellfound.com scraper can scrape the following data:

  • Company pages for company data.
  • Search pages for job listing data.
View sample data

For the full guide, refer to our blog article How to Scrape Wellfound Company Data and Job Listings

Linkedin

The linkedin.com scraper can scrape the following data:

  • Linkedin public profile pages for profile data
  • Linkedin company pages for company data
  • Linkedin job search pages for job data
  • Linkedin job pages for detailed job data
View sample data

For the full guide, refer to our blog article How to Scrape LinkedIn.com Profile, Company, and Job Data

Yellowpages

The yellowpages.com scraper can scrape the following data:

  • Business pages for business data.
  • Search pages for business listing data.
View sample data

For the full guide, refer to our blog article How to Scrape YellowPages.com Business Data and Reviews (2023 Update)

Yelp

The yelp.com scraper can scrape the following data:

  • Business pages for business data.
  • Review pages for business data.
  • Search pages for business listing data.
View sample data

For the full guide, refer to our blog article How to Web Scrape Yelp.com (2023 update)

Zillow

The zillow.com scraper can scrape the following data:

  • Zillow property pages for a specific real estate property data.
  • Zillow search pages for real estate property listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Zillow Real Estate Property Data in Python

Zoominfo

The zoominfo.com scraper can scrape the following data:

  • Company pages for company data.
  • Directory pages for company page URLs.
  • Company pages for company FAQs data.
View sample data

For the full guide, refer to our blog article How to Scrape Zoominfo Company Data (2023 Update)

Zoopla

The zoopla.co.uk scraper can scrape the following data:

  • Zoopla property pages for a specific real estate property data.
  • Zoopla search pages for real estate property listings data.
View sample data

For the full guide, refer to our blog article How to Scrape Zoopla Real Estate Property Data in Python

About

Web scrapers for popular targets powered Scrapfly.io

Topics

Resources

License

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages