Skip to content

Scrape Indeed for job listings and Indeed & Glassdoor for company reviews. Topic model the reviews.

License

Notifications You must be signed in to change notification settings

henridwyer/company-reviews

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Company Reviews by Employees

Scrape Indeed for all recent jobs listings matching given keywords, and then scrape Indeed and Glassdoor the reviews of the companies offering jobs.

Plot the ratings, and use topic modeling on the reviews to understand how employees see their company. Visualize the topics.

Written in python by Henri Dwyer.

Getting Started

Two ipython notebooks show how to use the library for scraping and plotting the ratings. Run them locally or see them hosted:

Topic Modeling

For topic modeling, you can use a Hierarchical Dirichlet Process. I modified the library written by Chong Wang and David Blei in C++:

This ipython notebook shows how to prepare the data for use with the HDP library, and how to explore the topics form the results.

Further reading

An example: Data science jobs in New York

About

Scrape Indeed for job listings and Indeed & Glassdoor for company reviews. Topic model the reviews.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages