This project is a Python-based web scraper that extracts business information from a public website and automatically updates the data into a Google Sheet using the Google Sheets API.
- Extracts data such as:
- Company name
- Address
- Services
- Website or contact links
- Supports HTML saving by city
- Semi-automated for Cloudflare-protected websites
- Google Sheets API integration for live updates
- Python
- Requests, BeautifulSoup
- Google Sheets API
- Pandas
- Clone the repo
- Set up your credentials.json from Google Cloud
- Run the Python script
- Data will appear in your connected Google Sheet
This project is licensed under the MIT License.