Skip to content

Created a SparkML RandomForest model to predict total employee compensation. Queried data with SparkSQL, ran PySpark scripts to run EDA, pre-process data, and train model achieving with 0.98 R2 score.

Notifications You must be signed in to change notification settings

mandira-sawkar/getPaid

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

Objective: Building a ML system using PySpark by analysing the employee data and build a model to predict employee compensation.

About

Created a SparkML RandomForest model to predict total employee compensation. Queried data with SparkSQL, ran PySpark scripts to run EDA, pre-process data, and train model achieving with 0.98 R2 score.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published