Skip to content

ashalan/glassdoor-salary-scraper

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

glassdoor-salary-scraper

Current Version: 2.0

Glassdoor web crawler and scraper providing salary data. Forked and modified from williamxie11.

Installation

  • Python 2.7.*

  • Beautiful Soup 4 (4.4.1)

$ pip install bs4
  • Selenium Webdriver
$ pip install selenium

Usage

  1. Open the scraper Python script with a text editor of your choice.

  2. Add your Glassdoor account username and password.

  1. Specify the cities you would like to scrape in cities.txt with each city on a new line

  2. Run the scraper

$ python scraper.py

NOTE: Glassdoor will require you to insert CAPTCHA on login or during the scraping process. The script will poll until CAPTCHA is entered during scraping.

Results

response

The web scraper will output a JSON with the name "[city name].json" in the Data directory, it will also output a JSON with the name "allcities.json" that will include all the cities data. Each data point in the JSON corresponds to one salary on Glassdoor with attributes (company name, job title).

About

Web crawler/scraper for Glassdoor salary data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%