Skip to content

Latest commit

 

History

History
16 lines (11 loc) · 791 Bytes

README.md

File metadata and controls

16 lines (11 loc) · 791 Bytes

Data Scientist Projects

Code and Resources Used

original author: Ömer Sakarya , Oct 15, 2019

git : https://github.com/arapfaik/scraping-glassdoor-selenium

original tutorial: https://towardsdatascience.com/selenium-tutorial-scraping-glassdoor-com-in-10-minutes-3d0915c6d905

(youtube: KenJee): https://www.youtube.com/watch?v=GmW4F6MHqqs&list=PL2zq7klxX5ASFejJj80ob9ZAnBHdz5O1t (github): https://github.com/wizrox/dsSlryProj/blob/master/glassdoor_scrapper.py

Overview

  1. Scraped over 1000 job descriptions from glassdoor using python and selenium
  2. Engineered features from the text of each job description to quantify the value companies put on python, excel, aws, and spark.
  3. Optimized Linear, Lasso, and Random Forest Regressors using GridsearchCV to reach the best model.