PySparkFeatureEngineeringSelectionIris

Azure Databricks notebooks that use Iris dataset from Sklearn for Feature Engineering of Continuous Values and Feature Selection

Run this notebook - Engineering and Feature Selection of Iris Dataset from SKLearn to kick off:

0Mount Data -

Mounts containers for storing processed files

1Preprocess Data -

Reads iris dataset from sklearn libraries and preprocesses dataframes for Features, Targets and Features + Targets and saves dataframes to Parquet files in mounted containers

2ProfileFeaturesAndTarget -

Reads in Features + Targets from Parquet files in mounted containers and Peforms Pandas Profiling on the entire dataframe. Identify which columns are not highly correlated with target and each other, identifies duplicates and rows / columns that are missing

3EngineeringContinousFeatures

Reads in Features from Parquet files in mounted containers and scales the values using different methods

4FeatureSelection

Reads in Features + Targets from Parquet files in mounted containers and computes / plots importance of each feature

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
PySpark Feature Engineering and Selection		PySpark Feature Engineering and Selection
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PySparkFeatureEngineeringSelectionIris

Run this notebook - Engineering and Feature Selection of Iris Dataset from SKLearn to kick off:

0Mount Data -

1Preprocess Data -

2ProfileFeaturesAndTarget -

3EngineeringContinousFeatures

4FeatureSelection

About

Uh oh!

Releases

Packages

Languages

RemoteDataEngineer/PySparkFeatureEngineeringSelectionIris

Folders and files

Latest commit

History

Repository files navigation

PySparkFeatureEngineeringSelectionIris

Run this notebook - Engineering and Feature Selection of Iris Dataset from SKLearn to kick off:

0Mount Data -

1Preprocess Data -

2ProfileFeaturesAndTarget -

3EngineeringContinousFeatures

4FeatureSelection

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages