Skip to content

hunterness-dev/BikeScraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Amazon Bike Price Scraper

This project is a Python-based web scraper that collects bike product information from Amazon and saves the results to an Excel file.

Features

  • Scrapes multiple bike types (mountain, road, electric, hybrid, BMX, etc.)
  • Uses 10 parallel workers and 10 rotating user agents for efficiency and anti-blocking
  • Handles Amazon's "Start Shopping" and first-time visit pages automatically
  • Saves results to a timestamped Excel file
  • Logs all activity to bike_scraper.log

Requirements

  • Python 3.8+
  • Install dependencies:
    pip install aiohttp beautifulsoup4 pandas fake-useragent openpyxl

Usage

  1. Clone the repository or copy the files to your project folder.
  2. Run the scraper:
    python bike_price.py
  3. Results will be saved as bike_data_YYYYMMDD_HHMMSS.xlsx in the project folder.

Output

  • Excel file with columns: title, price, rating, reviews, url, features, and technical details
  • Log file: bike_scraper.log

Notes

  • This script is for educational purposes. Scraping Amazon may violate their terms of service.
  • For production use, add proxy rotation and more robust error handling.

About

An asynchronous web scraper built with Python that collects and analyzes bike price data from Amazon, featuring proxy rotation, user-agent spoofing, and robust error handling. Built with aiohttp for high-performance concurrent requests and pandas for data processing, with comprehensive logging and anti-detection measures.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages