Skip to content
No description, website, or topics provided.
Branch: master
Clone or download
Latest commit 0452be7 Sep 9, 2018
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.gitignore fixed axis Aug 8, 2018
README.md added README Sep 9, 2018
analysis2.ipynb cleaned analysis code Sep 9, 2018
helper.py more analysis work Aug 29, 2018
presentation2.pdf added pdf presentation Aug 8, 2018
scrape.py cleaned analysis code Sep 9, 2018

README.md

glassdoor_scraping_project

The objective of this project was two-fold. First, I wanted to explore the current state of data related job market. Second, I wanted to gain first-hand experience in web-scraping.

Glassdoor is an American company established in 2007. It started off as a platform where employees can anonymously post reviews about salary and workplace environment. It has grown to be one of the most trusted websites for company research and also for job hunting. Glassdoor provides their own salary estimate for many job posting on the website, and also provides a rating for a company.

I wrote two python scripts, namely "scrape.py" and "helper.py", to responsibly scrape the Glassdoor website. I sent queries about Data Scientist/Data Engineer/Data Analyst jobs in select U.S. cities to website, and using the selenium package collected information about the job postings.

I ran my analysis on about 10,000 job postings. The analysis code is in the Jypyter python notebook "analysis2.ipynb".

You can’t perform that action at this time.