Skip to content

water2015sit/Glassdoor_jobs_scraper

Repository files navigation

Data Scientist Projects

Code and Resources Used

original author: Ömer Sakarya , Oct 15, 2019

git : https://github.com/arapfaik/scraping-glassdoor-selenium

original tutorial: https://towardsdatascience.com/selenium-tutorial-scraping-glassdoor-com-in-10-minutes-3d0915c6d905

(youtube: KenJee): https://www.youtube.com/watch?v=GmW4F6MHqqs&list=PL2zq7klxX5ASFejJj80ob9ZAnBHdz5O1t (github): https://github.com/wizrox/dsSlryProj/blob/master/glassdoor_scrapper.py

Overview

  1. Scraped over 1000 job descriptions from glassdoor using python and selenium
  2. Engineered features from the text of each job description to quantify the value companies put on python, excel, aws, and spark.
  3. Optimized Linear, Lasso, and Random Forest Regressors using GridsearchCV to reach the best model.

About

Repo for the data science glassdoor salary project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published