Skip to content

Latest commit

 

History

History
33 lines (25 loc) · 1.75 KB

File metadata and controls

33 lines (25 loc) · 1.75 KB

TripAdvisor & Google Search Scraper

Overview

This project is a sophisticated web scraping tool designed to extract top restaurant data from TripAdvisor and verify search results visibility on Google Search. Utilizing a blend of powerful web scraping libraries such as Selenium, Requests, and BeautifulSoup, this tool delivers accurate and efficient data extraction for specific regions.

Features

  • TripAdvisor Scraper: Efficiently scrapes data about the best restaurants in specified regions from TripAdvisor.
  • Google Search Verification: Checks whether the scraped data appears in Google Search results, providing an additional layer of verification and utility.

Technologies

  • Selenium: Automates web browsers, providing the backbone for real-time data scraping and interaction.
  • Requests: Handles HTTP requests, allowing efficient data retrieval.
  • BeautifulSoup: Parses HTML and XML documents, making it easier to navigate and search the parse tree.

Getting Started

To use this scraper, you'll need to have Python installed on your machine along with the Selenium, Requests, and BeautifulSoup libraries. Detailed instructions on setting up the environment and running the scraper are provided below.

Prerequisites

  • Python 3.x
  • Selenium
  • Requests
  • BeautifulSoup

Installation

  1. Clone the repository to your local machine.
  2. Install the necessary libraries using pip install -r requirements.txt.
  3. Set up your web driver for Selenium (e.g., ChromeDriver for Google Chrome).

Contribution

Contributions to this project are welcome. Please ensure that your code adheres to the project's coding standards and include appropriate tests.

License

This project is licensed under the MIT License - see the LICENSE file for details.