This project is a Python-based web scraper that collects bike product information from Amazon and saves the results to an Excel file.
- Scrapes multiple bike types (mountain, road, electric, hybrid, BMX, etc.)
- Uses 10 parallel workers and 10 rotating user agents for efficiency and anti-blocking
- Handles Amazon's "Start Shopping" and first-time visit pages automatically
- Saves results to a timestamped Excel file
- Logs all activity to
bike_scraper.log
- Python 3.8+
- Install dependencies:
pip install aiohttp beautifulsoup4 pandas fake-useragent openpyxl
- Clone the repository or copy the files to your project folder.
- Run the scraper:
python bike_price.py
- Results will be saved as
bike_data_YYYYMMDD_HHMMSS.xlsxin the project folder.
- Excel file with columns: title, price, rating, reviews, url, features, and technical details
- Log file:
bike_scraper.log
- This script is for educational purposes. Scraping Amazon may violate their terms of service.
- For production use, add proxy rotation and more robust error handling.