This project predicts the deliquency rate in the Freddie Mac dataset in a distributed way using Dask and PySpark.
forked from rajeevdixit19/Scaleable-Ml
-
Notifications
You must be signed in to change notification settings - Fork 0
arjunsawhney1/scalable-ML
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
In this repo, I build a LogisticRegression prediction model with Dask and PySpark and initialize an AWS EMR cluster to run the entire pipeline.
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Python 94.5%
- Shell 5.5%