Skip to content

plkmo/Reviews_Crawlers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Review Web Crawler

Overview

Crawls the web for social media comments of specific entity

Contents

  1. Google reviews crawler
  2. Tripadvisor reviews crawler

Requirements

Requirements: Python (3.6+), Selenium (https://selenium-python.readthedocs.io/installation.html#downloading-python-bindings-for-selenium), Selenium chromedriver (https://selenium-python.readthedocs.io/installation.html#drivers), Google Chrome browser

Google reviews crawler

This crawler crawls through google reviews, tripadvisor for a particular search entity (search_query) and extracts the review and corresponding rating (out of 5)

Usage:

main crawler.py --search_query [default: changi+city+point]  
		--mode [google_reviews, tripadvisor]
		--num_reviews [default: 6000] 

Results output file will be saved in ./data/ folder.

About

Web crawlers (Google reviews, Tripadvisor)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages