An R package to extract information from popular job site indeed.com
- rvest
- stringr
the package can be installed with devtools as:
# install.packages("devtools")
devtools::install_github("gpoudel/JobsInR")
#Also do not forget to install/load dependencies
#install.packages('rvest')
#install.packages('stringr')
library('JobsInR')
library('rvest')
library('stringr')
As of now the package has the following functions:
This function returns the total number of jobs available.
Eg:
jobs_count('python', 'texas')
[1] 2693
This function returns a dataframe. The datafrmae consists of following columns
- Date
- Title
- Company
- Location
- Summary
- URL
Note: This function also eliminates any duplicate entries listed on website (based on original URL), thus the number of rows in dataframe can be less than that got from using the function 'jobs_count()'.
Eg:
delphi_texas <- jobs_df('delphi', 'texas')