# Reference

## Legend

![image33.png](attachment:image33.png)

## Index

- [Data Science & The Nature of Data](#DataScienceTheNatureofData)
    - [Import a library](#Importalibrary)
    - [Load a dataframe](#Loadadataframe)
    - [List rows of data](#Listrowsofdata)
    - [List columns of data](#Listcolumnsofdata)
    - [Filter data](#Filterdata)
- [Plotting](#Plotting)
    - [Generate a scatterplot](#Generateascatterplot)
    - [Label graph](#Labelgraph)
    - [Generate a bar plot](#Generateabarplot)
    - [Group datapoints](#Groupdatapoints)
    - [Generate a line plot](#Generatealineplot)
    - [Generate a histogram](#Generateahistogram)
- [Descriptive Statistics](#DescriptiveStatistics)
    - [Calculate the mean of a dataset](#Calculatethemeanofadataset)
    - [Calculate the mode of a dataset](#Calculatethemodeofadataset)
    - [Calculate the median of a dataset](#Calculatethemedianofadataset)
    - [Summary of a dataset](#Summaryofadataset)
    - [Get a calculation of a random sample set](#Getacalculationofarandomsampleset)
- [Measures of Association](#MeasuresofAssociation)
    - [Plot a labeled correlation heatmap](#Plotalabeledcorrelationheatmap)
    - [Create a contingency table](#Createacontingencytable)
- [Clustering](#Clustering)
    - [Load data with identifier (ID)](#Loaddatawithidentifier)
    - [Create a dendogram](#Createadendogram)
    - [Plot Clusters](#PlotClusters)
- [KNN Classification](#KNNClassification)
    - [Split X and Y into train and test sets](#SplitXY)
    - [Define the LDA Model](#DefineLDA)
    - [Train the LDA model](#TrainLDA)
    - [Predict classification with the LDA model](#PredictLDA)
    - [Define the KNN classification model](#DefineKNNC)
    - [Train the KNN classification model](#TrainKNNC)
    - [Predict classification with the KNN model](#PredictKNNC)
    - [Get confusion matrix of a model](#ConfusionMatrix)
    - [Get accuracy of a model](#AccuracyofModel)
    - [Get recall and precision of a model](#RecallandPrecision)
- [KNN Regression](#KNNRegression)
    - [Create a pipeline with a scaler and KNN regression](#KNNRPipeline)
    - [Train the KNN regression model](#TrainKNNR)
    - [Predict classification using the KNN model](#PredictKNNR)
    - [Model scatterplot of observed data](#ModelKNNR)
    - [Model scatterplot of predictions](#ModelKNNR)
- [Simple Linear Regression](#SimpleLinearRegression)
    - [Create linear regression model](#CreateLRM)
    - [Train the linear regression model](#TrainLRM)
- [Multiple Linear Regression](#MultipleLinearRegression)
    - [Define the multiple linear regression model](#CreateMLR)
    - [Train the multiple linear regression model](#TrainMLR)
    - [Get predictions from the multiple linear regression model](#PredictMLR)
    - [Check linearity and equal variance](#LinearityMLR)
- [Logistic Regression](#LogisticRegression)
    - [Define the Logistic regression model](#CreateLRM)
- [Ridge and Lasso Regression](#Ridge&Lasso)
    - [Define the logistic regression model with ridge penalty](#DefLRRP)
    - [Get predictions from the ridge regression model](#PredRRM)
    - [Train the logistic regression model with lasso penalty](#TrainLRLP)
    - [Get predictions from the lasso regression model](#PredLRL)
- [Decision tree](#DecisionTree)
    - [Build a decision tree](#DT)
    - [Create the decision tree model](#CreateDT)
    - [Fit the decision tree model](#FitDT)
    - [Visualize the decision tree model & inperpret algorithm fit](#VisDT)
-[Regression Trees](#RegTrees)
    - [Prepare train/test sets](#PrepReggTree)
    - [Fit the model](#FitRegTree)
    - [Get predictions](#PredReggTree)
    - [Evaluate the model](#EvalPredRegg)
    - [Visualize the model](#VisReggTree)
-[Random Forest](#RandomForest)
    - [Create bagging classifier](#Bag)
    - [Fit and score bagging classifier](#FitBag)
    - [Get out of bag (OBB) score](#GetBagScore)
    - [Create random forest classifier](#CreateRandFore)
    - [Fit and score random forest](#FitRandFore)
    - [Visualize ensemble model](#VisEnsem)
- [Gradient Boosting](#GradBoost)
    - [Create gradient boosting classifier](#CreateGradBoost)
    - [Fit the classifier](#FitClassGB)
    - [Get predictions](#GetPredGB)
    - [Evaluate the model](#EvalGB)
    - [Visualize results](#VisGB)
    - [Detecting overfitting with deviance](#OverfitDev)
- [Support Vector Machines](#SVM)
    - [Prepare train/test sets](#PrepSVM)
    - [Fit the linear SVM model](#FitSVM)
    - [Predict using the linear SVM model](#PredSVM)
    - [Evaluate the linear SVM model](#EvalSVM)
    - [Fit the nonlinear SVM model](#FitNonlinSVM)
    - [Predict with the nonlinear SVM model](#PredNonlinSVM)
    - [Evaluate the nonlinear SVM model](#EvalNonlinSVM)
- [Naive Bayes](#NaiveBayes)
    - [Predict train/test sets](#PrepNB)
    - [Fit the model](#FitNB)
    - [Get predictions](#PredNB)
    - [Evaluate the model](#EvalNB)
- [Crossvalidation and Nested crossvalidation](#CrossVal)
    - [Prepare train/test sets with crossvalidation](#PrepCross)
    - [Train model & predict with crossvalidation](#TrainCross)
    - [Evaluate the model](#EvalCross)
    - [Train model & predict with nested crossvalidation](#TrainNCross)
    - [Evaluate the model](#EvalNCross)
- [Data cleaning, transformations, & versioning](#Data)
    - [Missing data](#MissingData)
    - [Transforming data](#Transform)
    - [Common scale](#Common)
    - [Non-normality](#NonNormal)
    - [Create a new variable](#NewVar)




## Data Science & The Nature of Data<a id='DataScienceTheNatureofData'></a>

### Import a library<a id='Importalibrary'></a>

This command lets you import any library and call it using a shortened variable.

![import.png](attachment:import.png)

![image46.png](attachment:image46.png)

### Load a dataframe<a id='Loadadataframe'></a>

This command loads a dataset into the container dataframe.

![variables.png](attachment:variables.png)   ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)

![text.png](attachment:text.png)      ![insertText.png](attachment:insertText.png)

![image17.png](attachment:image17.png)     

Output:

![image36.png](attachment:image36.png)

### List rows of data<a id='Listrowsofdata'></a>

This command lists a user defined location of rows in the dataset.

![variables.png](attachment:variables.png)     ![variable.png](attachment:variable.png)

![lists.png](attachment:lists.png)

![listvarSublist.png](attachment:listvarSublist.png)

![math.png](attachment:math.png)   ![123.png](attachment:123.png)

![image45.png](attachment:image45.png)

Output:

![image44.png](attachment:image44.png)

### List columns of data<a id='Listcolumnsofdata'></a>

This command lists a user defined location of columns in the dataset.

![lists.png](attachment:lists.png)

![dictVariable.png](attachment:dictVariable.png)     ![createListWith.png](attachment:createListWith.png)

![text.png](attachment:text.png)    ![insertText.png](attachment:insertText.png)

![image1.png](attachment:image1.png)

Output:

![image42.png](attachment:image42.png)

### Filter data<a id='Filterdata'></a>

This command filters data so only entries that satisfy the equation (result in true) will be shown.

![lists.png](attachment:lists.png)     ![dictVariable.png](attachment:dictVariable.png)

![logic.png](attachment:logic.png)    ![equality.png](attachment:equality.png)

![text.png](attachment:text.png)      ![insertText.png](attachment:insertText.png)

![image29.png](attachment:image29.png)

Output:

![image4.png](attachment:image4.png)

## Plotting<a id='Plotting'></a>

### Generate a scatterplot<a id='Generateascatterplot'></a>

This command generates a scatterplot using the plotly.express library

![variables.png](attachment:variables.png)      ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)       ![freestyle1.png](attachment:freestyle1.png)

![image40.png](attachment:image40.png)

Output:

![image47.png](attachment:image47.png)

### Label graph<a id='Labelgraph'></a>

This command adds a title and color coding to elements of the plot.

![variables.png](attachment:variables.png)       ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)      ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)       ![freestyle1.png](attachment:freestyle1.png)

![image5.png](attachment:image5.png)

Output:

![image28.png](attachment:image28.png)

### Generate a bar plot<a id='Generateabarplot'></a>

This command generates a bar plot using the plotly.express library

![variables.png](attachment:variables.png)

![withVarDo.png](attachment:withVarDo.png)

![image18.png](attachment:image18.png)

Output:

![image9.png](attachment:image9.png)

### Group datapoints<a id='Groupdatapoints'></a>

This command groups datapoints by a user defined sub-set

![variables.png](attachment:variables.png)        ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)

![text.png](attachment:text.png)      ![insertText.png](attachment:insertText.png)

![image16.png](attachment:image16.png)

### Generate a line plot<a id='Generatealineplot'></a>

This command generates a line plot using the plotly.express library

![variables.png](attachment:variables.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)      ![createListWith.png](attachment:createListWith.png)

![image30.png](attachment:image30.png)

Output:

![image24.png](attachment:image24.png)

### Generate a histogram<a id='Generateahistogram'></a>

This command generates a histogram using the plotly.express library

![variables.png](attachment:variables.png)      ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)      ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)     ![freestyle1.png](attachment:freestyle1.png)

![image19.png](attachment:image19.png)

Output:

![image10.png](attachment:image10.png)

## Descriptive Statistics<a id='DescriptiveStatistics'></a>

### Calculate the mean of a dataset<a id='Calculatethemeanofadataset'></a>

This command calculates the mean of all columns in the dataset dataframe


![variables.png](attachment:variables.png)

![image37.png](attachment:image37.png)

Output:

![image7.png](attachment:image7.png)

### Calculate the mode of a dataset<a id='Calculatethemodeofadataset'></a>

This command calculates the mode of all columns in the dataset dataframe

![variables.png](attachment:variables.png)

![image13.png](attachment:image13.png)

Output:

![image8.png](attachment:image8.png)

### Calculate the median of a dataset<a id='Calculatethemedianofadataset'></a>

This command calculates the median of all columns in the dataset dataframe

![variables.png](attachment:variables.png)

![image48.png](attachment:image48.png)

Output:

![image5.png](attachment:image5.png)

### Summary of a dataset<a id='Summaryofadataset'></a>

This command calculates the total number of entries, mean, standard deviation, minimum entry, first quartile, median (second quartile), third quartile, and maximum entry of all columns in the dataset dataframe.

![variables.png](attachment:variables.png)

![image41.png](attachment:image41.png)

Output:

![image6.png](attachment:image6.png)

### Get a calculation of a random sample set<a id='Getacalculationofarandomsampleset'></a>

This command generates 10 random entries from the dataset dataframe.

![image9.png](attachment:image9.png)      ![image10.png](attachment:image10.png)

![image11.png](attachment:image11.png)               ![image14.png](attachment:image14.png)

![image2.png](attachment:image2.png)

Output:

![image1.png](attachment:image1.png)

## Measures of Association<a id='MeasuresofAssociation'></a>

### Plot a labeled correlation heatmap<a id='Plotalabeledcorrelationheatmap'></a>

This command displays a correlation matrix as a heatmap.

![variables.png](attachment:variables.png)

![setVariable.png](attachment:setVariable.png)        ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![fromVarGet.png](attachment:fromVarGet.png)

![lists.png](attachment:lists.png)       ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)       ![freestyle3.png](attachment:freestyle3.png)

![image14.png](attachment:image14.png)

Output:

![image49.png](attachment:image49.png)

### Create a contingency table<a id='Createacontingencytable'></a>

This command displays a frequency distribution of user defined factors.

![variables.png](attachment:variables.png)     ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)       ![createListWith.png](attachment:createListWith.png)

![dictVariable.png](attachment:dictVariable.png)

![text.png](attachment:text.png)      ![insertText.png](attachment:insertText.png)

![image39.png](attachment:image39.png)

Output:

![image12.png](attachment:image12.png)

## Clustering<a id='Clustering'></a>

### Load data with identifier (ID)<a id='Loaddatawithidentifier'></a>

This command loads a dataset with a respective identity column, which is usually a variable that never repeats a value.

![variables.png](attachment:variables.png)    ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![text.png](attachment:text.png)      ![insertText.png](attachment:insertText.png)

![freestyle.png](attachment:freestyle.png)   ![freestyle1.png](attachment:freestyle1.png)

![image34.png](attachment:image34.png)

### Create a dendogram<a id='Createadendogram'></a>

This command creates a hierarchical cluster out of a given dataset

![variables.png](attachment:variables.png)     ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)   ![freestyle1.png](attachment:freestyle1.png)

![image35.png](attachment:image35.png)

Output:

![image7.png](attachment:image7.png)

### Plot Clusters<a id='PlotClusters'></a>

This series of commands runs a k-means clustering algorithm

#### 1) Setup variable to run k-means using column names from the dataset

![variables.png](attachment:variables.png)     ![setVariable.png](attachment:setVariable.png)

![lists.png](attachment:lists.png)     ![createListWith.png](attachment:createListWith.png)       

![dictVariable.png](attachment:dictVariable.png)

![text.png](attachment:text.png)      ![insertText.png](attachment:insertText.png)

![image31.png](attachment:image31.png)

Output:

![image26.png](attachment:image26.png)

#### 2) Set the number of clusters (k) and create a cluster model

![variables.png](attachment:variables.png)   ![setVariable.png](attachment:setVariable.png)

![withVarCreate.png](attachment:withVarCreate.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)    ![freestyle1.png](attachment:freestyle1.png)

![image25.png](attachment:image25.png)

#### 3) Tell the variable information about the data and make predictions of clusters for each datapoint.

![variables.png](attachment:variables.png)

![setVariable.png](attachment:setVariable.png)       ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![image21.png](attachment:image21.png)

Output:

![image8.png](attachment:image8.png)

#### 4) Add cluster assignments as a new column.

![variables.png](attachment:variables.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)   ![freestyle1.png](attachment:freestyle1.png)

![text.png](attachment:text.png)    ![insertText.png](attachment:insertText.png)

![image38.png](attachment:image38.png)

Output:

![image27.png](attachment:image27.png)

#### 5) Create a scatterplot of clusters.

![variables.png](attachment:variables.png)    ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)     ![freestyle1.png](attachment:freestyle1.png)

![image20.png](attachment:image20.png)

Output:

![image23.png](attachment:image23.png)

## KNN Classification<a id='KNNClassification'></a>

### Split X and Y into train and test sets<a id='SplitXY'></a>

This command splits pre-defined variables X and Y into train and test sets. It outputs four dataframes: Xtrain, Xtest, Ytrain, and Ytest.

![variables.png](attachment:variables.png)     ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)                    ![variable.png](attachment:variable.png)

![lists.png](attachment:lists.png)      ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png) ![freestyle1.png](attachment:freestyle1.png)

![1.png](attachment:1.png)

### Define the LDA model<a id='DefineLDA'></a>

This command sets up a base LDA model.

![variables.png](attachment:variables.png)        ![setVariable.png](attachment:setVariable.png)

![withVarCreate.png](attachment:withVarCreate.png)

![lists.png](attachment:lists.png)         ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)           ![freestyle1.png](attachment:freestyle1.png)

![2.png](attachment:2.png)

Output:

![2o.png](attachment:2o.png)

### Train the LDA model<a id='TrainLDA'></a>

This command trains the previously defined LDA model with user given data.

![variables.png](attachment:variables.png)     ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)     ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)      ![freestyle1.png](attachment:freestyle1.png)

![3.png](attachment:3.png)

Output:

![3o.png](attachment:3o.png)

### Predict classification with the LDA model<a id='PredictLDA'></a>

This command predicts classification using the trained LDA model.

![variables.png](attachment:variables.png)        ![variable.png](attachment:variable.png)

![setVariable.png](attachment:setVariable.png)     ![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)      ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)      ![freestyle1.png](attachment:freestyle1.png)

![4.png](attachment:4.png)

### Define the KNN classification model<a id='DefineKNNC'></a>

This command sets up a base KNN model.

![variables.png](attachment:variables.png)     ![setVariable.png](attachment:setVariable.png)

![withVarCreate.png](attachment:withVarCreate.png)

![lists.png](attachment:lists.png)     ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)      ![freestyle1.png](attachment:freestyle1.png)

![5.png](attachment:5.png)

### Train the KNN classification model<a id='TrainKNNC'></a>

This command trains the previously defined KNN model with user given data.

[See __Train the LDA Model__. Replace variable "lda_iris" with "knnclass_iris".]

Output:

![5o.png](attachment:5o.png)

### Predict classification with the KNN model<a id='PredictKNNC'></a>

This command predicts classification using the trained KNN model.

[See __Predict Classification with the LDA Model__. Replace "lda_iris_predict" with "knnclass_iris_predict" and replace "lda_iris" with "knnclass_iris".]

Output:

![5oo.png](attachment:5oo.png)

### Get confusion matrix of a model<a id='ConfusionMatrix'></a>

This command outputs an array representing the confusion matrix of a prediction model. This is used to assess the predictive value (performance) of the model.

![variables.png](attachment:variables.png)     ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)      ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)       ![freestyle1.png](attachment:freestyle1.png)

![6.png](attachment:6.png)

Output:

![6o.png](attachment:6o.png)

### Get accuracy of a model<a id='AccuracyofModel'></a>

This command retrieves a decimal percentage representing the accuracy of the prediction model.

![variables.png](attachment:variables.png)         ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)     ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)    ![freestyle1.png](attachment:freestyle1.png)

![6.png](attachment:6.png)

Output:

![6o.png](attachment:6o.png)

### Get recall and precision of a model<a id='RecallandPrecision'></a>

This command shows a classification report of the prediction model. The report includes decimal percentages of precision, recall, fl-score, and support.

![variables.png](attachment:variables.png)        ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)     ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![text.png](attachment:text.png)     print

![freestyle.png](attachment:freestyle.png)       ![freestyle1.png](attachment:freestyle1.png)

![8.png](attachment:8.png)

Output:

![8o.png](attachment:8o.png)

## KNN Regression<a id='KNNRegression'></a>

### Create a pipeline with a scaler and KNN regression<a id='KNNRPipeline'></a>

![variables.png](attachment:variables.png)       ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)

![withVarCreate.png](attachment:withVarCreate.png)

![lists.png](attachment:lists.png)      ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)     ![freestyle1.png](attachment:freestyle1.png)

![9.png](attachment:9.png)

### Train the KNN regression model <a id='TrainKNNR'></a>

[See __Train the LDA Model__. Replace variable "lda_iris" with "std_clf".]

Output:

![9o.png](attachment:9o.png)

### Predict classification using the KNN regression model<a id='PredictKNNR'></a>

[See __Predict classification with the LDA Model__. Replace "lda_iris_predict" with "predictions" and replace "lda_iris" with "std_clf".]

Output:

![9oo.png](attachment:9oo.png)

### Model scatterplot of observed data<a id='ModelKNNR'></a>

![variables.png](attachment:variables.png)     ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)     ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)    ![freestyle1.png](attachment:freestyle1.png)

![10.png](attachment:10.png)

Output:

![10o.png](attachment:10o.png)

### Model scatterplot of predictions<a id='ModelKNNR'></a>

![variables.png](attachment:variables.png)      ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)     ![freestyle1.png](attachment:freestyle1.png)

![11.png](attachment:11.png)

Output:

![11o.png](attachment:11o.png)

## Simple linear regression<a id='SimpleLinearRegression'></a>

### Define the linear regression model<a id='CreateLRM'></a>

![variables.png](attachment:variables.png)     ![setVariable.png](attachment:setVariable.png)

![withVarCreate.png](attachment:withVarCreate.png)

![12.png](attachment:12.png)

### Train the linear regression model<a id='TrainLRM'></a>

![variables.png](attachment:variables.png)     ![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![dictVariable.png](attachment:dictVariable.png)

![text.png](attachment:text.png)      ![insertText.png](attachment:insertText.png)

![13.png](attachment:13.png)

Output:

![13o.png](attachment:13o.png)

## Multiple Linear Regression<a id='MultipleLinearRegression'></a>

### Define the Multiple Linear Regression Model<a id='CreateMLR'></a>

This command creates a multiple linear regression model

[See Create linear regression model; same command]

### Train the Multiple Linear Regression Model<a id='TrainMLR'></a>

This command trains a multiple linear regression model.

![variables.png](attachment:variables.png)       ![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)     ![createListWith.png](attachment:createListWith.png)

![dictVariable.png](attachment:dictVariable.png)

![text.png](attachment:text.png)      ![insertText.png](attachment:insertText.png)

![14.png](attachment:14.png)

Output:

![14o.png](attachment:14o.png)

### Get predictions from the mutliple linear regression model<a id='PredictMLR'></a>

This command gets predictions of data using a multiple linear regression model.

![variables.png](attachment:variables.png)   ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)     ![createListWith.png](attachment:createListWith.png)

![dictVariable.png](attachment:dictVariable.png)

![text.png](attachment:text.png)       ![insertText.png](attachment:insertText.png)

![freestyle.png](attachment:freestyle.png)       ![freestyle3.png](attachment:freestyle3.png)

![15.png](attachment:15.png)

Output:

![15o.png](attachment:15o.png)

### Check linearity and equal variance<a id='LinearityMLR'></a>

This command checks the linearity and equal variance of a multiple linear regression model.

![variables.png](attachment:variables.png)      ![variable.png](attachment:variable.png)

![setVariable.png](attachment:setVariable.png)

![lists.png](attachment:lists.png)       ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)     ![freestyle1.png](attachment:freestyle1.png)

![16.png](attachment:16.png)

Output:

![16o.png](attachment:16o.png)

## Logistic Regression<a id='LogisticRegression'></a>

### Define the Logistic regression model <a id='CreateLRM'></a>

This function creates the logistic regression model.

![variables.png](attachment:variables.png)     ![variable.png](attachment:variable.png)

![setVariable.png](attachment:setVariable.png)     ![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)      ![createListWith.png](attachment:createListWith.png)

![dictVariable.png](attachment:dictVariable.png)

![freestyle.png](attachment:freestyle.png)      ![freestyle1.png](attachment:freestyle1.png)

![17.png](attachment:17.png)

Output:

![17o.png](attachment:17o.png)

## Ridge and Lasso Regression<a id='Ridge&Lasso'></a>

### Train the logistic regression model with ridge penalty<a id='DefLRRP'></a>

This training model makes a pipeline so that the model can be scaled and trained in one step.

![variables.png](attachment:variables.png)     ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)

![withVarCreate.png](attachment:withVarCreate.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)       ![freestyle1.png](attachment:freestyle1.png)

![18.png](attachment:18.png)

### Get predictions from the ridge regression model<a id='PredRRM'></a>

This command uses the lasso model to make predictions about the data it is given.

![variables.png](attachment:variables.png)     ![variable.png](attachment:variable.png)

![setVariable.png](attachment:setVariable.png)     ![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)           ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)     ![freestyle1.png](attachment:freestyle1.png)

![19.png](attachment:19.png)

### Train the logistic regression model with lasso penalty<a id='TrainLRLP'></a>

This command creates a logistic regression model with a lasso penalt.

![variables.png](attachment:variables.png)      ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)

![withVarCreate.png](attachment:withVarCreate.png)

![lists.png](attachment:lists.png)         ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)      ![freestyle1.png](attachment:freestyle1.png)

![20.png](attachment:20.png)

### Get predictions from the lasso regression model<a id='PredLRL'></a>

This command uses the logistic regression model with lasso penalty to make predictions about the data it is given.

![variables.png](attachment:variables.png)     ![variable.png](attachment:variable.png)

![setVariable.png](attachment:setVariable.png)       ![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)        ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)         ![freestyle1.png](attachment:freestyle1.png)

![21.png](attachment:21.png)

## Decision Trees<a id='DecisionTree'></a>

### Build a decision tree<a id='DT'></a>

This command builds a generic decision tree.

![variables.png](attachment:variables.png)      ![setVariable.png](attachment:setVariable.png)

![lists.png](attachment:lists.png)         ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)     ![freestyle1.png](attachment:freestyle1.png)

![22.png](attachment:22.png)

### Create the decision tree model<a id='CreateDT'></a>

This command creates a generic decision tree model

![variables.png](attachment:variables.png)    ![setVariable.png](attachment:setVariable.png)

![withVarCreate.png](attachment:withVarCreate.png)

![23.png](attachment:23.png)

### Fit the decision tree model<a id='FitDT'></a>

This command fits a generic decision tree into the model.

![variables.png](attachment:variables.png)       ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)       ![createListWith.png](attachment:createListWith.png)

![24.png](attachment:24.png)

### Visualize the decision tree model & interpret algorithm fit<a id='VisDT'></a>

This command visualizes the model and displays algorithm interpretation for each node.

![variables.png](attachment:variables.png)      ![variable.png](attachment:variable.png)

![setVariable.png](attachment:setVariable.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)     ![freestyle1.png](attachment:freestyle1.png)

![25.png](attachment:25.png)

Output:

![25o.png](attachment:25o.png)

## Regression Trees<a id='RegTrees'></a>

### Prepare train/test sets<a id='PrepReggTree'></a>

After setting X and Y to dataframes containing the two predictors we want, train and test sets can be split.

![variables.png](attachment:variables.png) ![variable.png](attachment:variable.png)

![variable.png](attachment:variable.png)

![lists.png](attachment:lists.png) ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png) ![freestyle1.png](attachment:freestyle1.png)

![image51.png](attachment:image51.png)

### Fit the model<a id='FitRegTree'></a>

Fit the regression tree model with your data

![variables.png](attachment:variables.png)       ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)      ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)      ![listvarGet.png](attachment:listvarGet.png)

![image52.png](attachment:image52.png)

### Get predictions<a id='PredReggTree'></a>

Predict the label of a new set of data using the model you just trained

![variables.png](attachment:variables.png)   ![withVarDo.png](attachment:withVarDo.png)

![setVariable.png](attachment:setVariable.png)   ![variable.png](attachment:variable.png)

![lists.png](attachment:lists.png)  ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)    ![freestyle1.png](attachment:freestyle1.png)

![image53.png](attachment:image53.png)

### Evaluate the model<a id='EvalPreddRegg'></a>

Evaluate your regression tree model trained on given data using the r^2 formula, which gives a goodness-of-fit estimation

![variables.png](attachment:variables.png)    ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![text.png](attachment:text.png)   ![insertText.png](attachment:insertText.png)

![freestyle.png](attachment:freestyle.png)   ![freestyle1.png](attachment:freestyle1.png)

![image54.png](attachment:image54.png)

Output:

![image55.png](attachment:image55.png)

### Visualize the model<a id='VisReggTree'></a>

Put data into a graphically visualized information tree

![variables.png](attachment:variables.png)    ![withVarDo.png](attachment:withVarDo.png)

![setVariable.png](attachment:setVariable.png)    ![variable.png](attachment:variable.png)

![lists.png](attachment:lists.png)     ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)    ![freestyle1.png](attachment:freestyle1.png)

![image56.png](attachment:image56.png)

Output:

![image57.png](attachment:image57.png)

## Random Forests<a id='RandomForest'></a>

### Create bagging classifier<a id='Bag'></a>

![variables.png](attachment:variables.png)   ![setVariable.png](attachment:setVariable.png)

![withVarCreate.png](attachment:withVarCreate.png)

![lists.png](attachment:lists.png)  ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png) ![freestyle1.png](attachment:freestyle1.png)

![image58.png](attachment:image58.png)

### Fit and score bagging classifier<a id='FitBag'></a>

![variables.png](attachment:variables.png)   ![withVarDo.png](attachment:withVarDo.png)

![fromVarGet.png](attachment:fromVarGet.png)

![lists.png](attachment:lists.png) ![createListWith.png](attachment:createListWith.png)

![image59.png](attachment:image59.png)

Output:

![image60.png](attachment:image60.png)

### Get out of bag (OOB) score<a id='GetBagScore'></a>

![variables.png](attachment:variables.png)    ![fromVarGet.png](attachment:fromVarGet.png)

![image61.png](attachment:image61.png)

Output:

![image62.png](attachment:image62.png)

### Create random forest classifier<a id='CreateRandFore'></a>

![variables.png](attachment:variables.png)   ![setVariable.png](attachment:setVariable.png)

![withVarCreate.png](attachment:withVarCreate.png)

![lists.png](attachment:lists.png)     ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png) ![freestyle1.png](attachment:freestyle1.png)

![image63.png](attachment:image63.png)

### Fit and score random forest classifier<a id='FitRandFore'></a>

![variables.png](attachment:variables.png)  ![withVarDo.png](attachment:withVarDo.png)

![fromVarGet.png](attachment:fromVarGet.png)

![lists.png](attachment:lists.png)     ![createListWith.png](attachment:createListWith.png)

![image64.png](attachment:image64.png)

Output:

![image65.png](attachment:image65.png)

### Visualize ensemble model<a id='VisEnsem'></a>

![variables.png](attachment:variables.png)  ![withVarDo.png](attachment:withVarDo.png)

![fromVarGet.png](attachment:fromVarGet.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)   ![freestyle3.png](attachment:freestyle3.png)

![image66.png](attachment:image66.png)

Output:

![image67.png](attachment:image67.png)

## Gradient Boosting<a id='GradBoost'></a>

### Prepare train/test sets<a id='CreateGradBoost'></a>

![variables.png](attachment:variables.png)   ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![text.png](attachment:text.png)   ![insertText.png](attachment:insertText.png)

![freestyle.png](attachment:freestyle.png)   ![freestyle1.png](attachment:freestyle1.png)

![image68.png](attachment:image68.png)

### Create gradient boosting classifier<a id='CreateGradBoost'></a>

![variables.png](attachment:variables.png)  ![setVariable.png](attachment:setVariable.png)

![withVarCreate.png](attachment:withVarCreate.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)  ![freestyle1.png](attachment:freestyle1.png)

![image69.png](attachment:image69.png)

### Fit the classifier<a id='FitClassGB'></a>

![variables.png](attachment:variables.png)   ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png) ![freestyle1.png](attachment:freestyle1.png)

![image70.png](attachment:image70.png)

Output:

![image71.png](attachment:image71.png)

### Get predictions<a id='GetPredGB'></a>

![variables.png](attachment:variables.png)   ![setVariable.png](attachment:setVariable.png)

![variable.png](attachment:variable.png)    ![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png) ![freestyle1.png](attachment:freestyle1.png)

![image72.png](attachment:image72.png)

### Evaluate the model<a id='EvalGB'></a>

![variables.png](attachment:variables.png)  ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)     ![freestyle1.png](attachment:freestyle1.png)

![image73.png](attachment:image73.png)

Output:

![image74.png](attachment:image74.png)

### Visualize results<a id='VisGB'></a>

![variables.png](attachment:variables.png)  ![withVarDo.png](attachment:withVarDo.png)

![fromVarGet.png](attachment:fromVarGet.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)    ![freestyle3.png](attachment:freestyle3.png)

![image75.png](attachment:image75.png)

Output:

![image76.png](attachment:image76.png)

### Detecting overfitting with deviance<a id='OverfitDev'></a>

![variables.png](attachment:variables.png)   ![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)    ![freestyle1.png](attachment:freestyle1.png)

![image77.png](attachment:image77.png)

Output:

![image78.png](attachment:image78.png)

## Support Vector Machines<a id='SVM'></a>

### Prepare train/test sets<a id='PrepSVM'></a>

![variables.png](attachment:variables.png)  ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)  ![dictVariable.png](attachment:dictVariable.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![text.png](attachment:text.png)  ![insertText.png](attachment:insertText.png)

![freestyle.png](attachment:freestyle.png)   ![freestyle1.png](attachment:freestyle1.png)

![image79.png](attachment:image79.png)

### Fit the linear SVM model<a id='FitSVM'></a>

![variables.png](attachment:variables.png)  ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)    ![freestyle1.png](attachment:freestyle1.png)

![image80.png](attachment:image80.png)

Output:

![image81.png](attachment:image81.png)

### Predict using the linear SVM model<a id='PredSVM'></a>

![variables.png](attachment:variables.png)    ![variable.png](attachment:variable.png)

![setVariable.png](attachment:setVariable.png)    ![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)     ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)  ![freestyle1.png](attachment:freestyle1.png)

![image82.png](attachment:image82.png)

### Evaluate the linear SVM model<a id='EvalSVM'></a>

![variables.png](attachment:variables.png)   ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)    ![freestyle1.png](attachment:freestyle1.png)

![image83.png](attachment:image83.png)

Output:

![image84.png](attachment:image84.png)

### Fit the nonlinear SVM model<a id='FitNonlinSVM'></a>

![variables.png](attachment:variables.png)    ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)  ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)  

![freestyle.png](attachment:freestyle.png)    ![freestyle1.png](attachment:freestyle1.png)

![image85.png](attachment:image85.png)

Output:

![image86.png](attachment:image86.png)

### Predict with the nonlinear SVM model<a id='PredNonlinSVM'></a>

![variables.png](attachment:variables.png)    ![withVarDo.png](attachment:withVarDo.png)

![variable.png](attachment:variable.png)      ![setVariable.png](attachment:setVariable.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)    ![freestyle1.png](attachment:freestyle1.png)

![image87.png](attachment:image87.png)

### Evaluate the nonlinear SVM model<a id='EvalNonlinSVM'></a>

![variables.png](attachment:variables.png)  ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)  ![freestyle1.png](attachment:freestyle1.png)

![image88.png](attachment:image88.png)

Output:

![image89.png](attachment:image89.png)

## Naive Bayes<a id='NaiveBayes'></a>

### Prepare train/test sets<a id='PrepSVM'></a>

![variables.png](attachment:variables.png)  ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)    ![dictVariable.png](attachment:dictVariable.png)

![lists.png](attachment:lists.png)    ![createListWith.png](attachment:createListWith.png)

![text.png](attachment:text.png)    ![insertText.png](attachment:insertText.png)

![freestyle.png](attachment:freestyle.png)      ![freestyle1.png](attachment:freestyle1.png)

![image90.png](attachment:image90.png)

### Fit the model<a id='FitSVM'></a>

![variables.png](attachment:variables.png)   ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)   ![freestyle1.png](attachment:freestyle1.png)

![image91.png](attachment:image91.png)

Output:

![image92.png](attachment:image92.png)

### Get predictions<a id='PredNonlinSVM'></a>

![variables.png](attachment:variables.png)    ![withVarDo.png](attachment:withVarDo.png)

![variable.png](attachment:variable.png)  ![setVariable.png](attachment:setVariable.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)    ![freestyle1.png](attachment:freestyle1.png)

![image93.png](attachment:image93.png)

### Evaluate the model<a id='EvalNB'></a>

![variables.png](attachment:variables.png)   ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![listvarGet.png](attachment:listvarGet.png)

![freestyle.png](attachment:freestyle.png)     ![freestyle1.png](attachment:freestyle1.png)

![image94.png](attachment:image94.png)

Output:

![image110.png](attachment:image110.png)

## Crossvalidation<a id='CrossVal'></a>

### Prepare train/test sets<a id='PrepCross'></a>

![variables.png](attachment:variables.png)  ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)  ![dictVariable.png](attachment:dictVariable.png)

![lists.png](attachment:lists.png)  ![createListWith.png](attachment:createListWith.png)

![text.png](attachment:text.png)    ![insertText.png](attachment:insertText.png)

![freestyle.png](attachment:freestyle.png)  ![freestyle1.png](attachment:freestyle1.png)

![image95.png](attachment:image95.png)

### Train model & predict with crossvalidation<a id='TrainCross'></a>

![variables.png](attachment:variables.png)  ![withVarDo.png](attachment:withVarDo.png)

![setVariable.png](attachment:setVariable.png)  ![variable.png](attachment:variable.png)

![lists.png](attachment:lists.png) ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)  ![freestyle1.png](attachment:freestyle1.png)

![image96.png](attachment:image96.png)

### Evaluate the model<a id='EvalCross'></a>

![variables.png](attachment:variables.png)  ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![image97.png](attachment:image97.png)

Output:

![image111.png](attachment:image111.png)

### Train model & predict with nested crossvalidation<a id='TrainNCross'></a>

![variables.png](attachment:variables.png)   ![withVarDo.png](attachment:withVarDo.png)

![variable.png](attachment:variable.png)   ![setVariable.png](attachment:setVariable.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)   ![freestyle1.png](attachment:freestyle1.png)

![image98.png](attachment:image98.png)

### Evaluate the model<a id='EvalNCross'></a>

![variables.png](attachment:variables.png)  ![variable.png](attachment:variable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![image99.png](attachment:image99.png)

Output:

![image100.png](attachment:image100.png)

## Data cleaning, transformations, & versioning<a id='Data'></a>

### Missing Data<a id='MissingData'></a>

After reading in a csv file "?" as unmarked values, convert those instances to NaN and either delete them from your set using dataframe.dropna() or fill them with median or mean values with dataframe.fillna(dataframe.median()) or dataframe.fillna(dataframe.mean()). Conversion from "?" to NaN is seen below.

![variables.png](attachment:variables.png)  ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![text.png](attachment:text.png)   ![insertText.png](attachment:insertText.png)

![freestyle.png](attachment:freestyle.png)  ![freestyle1.png](attachment:freestyle1.png)

![image101.png](attachment:image101.png)

Before and after transformation:

![image109.png](attachment:image109.png)

After:

![image102.png](attachment:image102.png)

### Transforming data<a id='Transform'></a>

Outliers (values in the 1st and 99th percentile) can be converted to NaN and clipped as the last permitted extreme value.

![variables.png](attachment:variables.png)   ![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)   ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)   ![freestyle1.png](attachment:freestyle1.png)

![image103.png](attachment:image103.png)

Before and after transformation:

![image104b.png](attachment:image104b.png)  ![image104.png](attachment:image104.png)

### Common scale<a id='Common'></a>

Converts your data to a standardized scale (i.e. standard deviation of 1 unit).

![variables.png](attachment:variables.png)   ![withVarDo.png](attachment:withVarDo.png)

![setVariable.png](attachment:setVariable.png)   ![variable.png](attachment:variable.png)

![lists.png](attachment:lists.png)  ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)   ![freestyle1.png](attachment:freestyle1.png)

![image105.png](attachment:image105.png)

Output:

![image106.png](attachment:image106.png)

### Non-normality<a id='NonNormal'></a>

To get a normal distribution skew, try running a log transform.

![variables.png](attachment:variables.png)  ![withVarDo.png](attachment:withVarDo.png)

![lists.png](attachment:lists.png)  ![createListWith.png](attachment:createListWith.png)

![freestyle.png](attachment:freestyle.png)  ![freestyle3.png](attachment:freestyle3.png)

![image107.png](attachment:image107.png)

Before and after transformation:

![image107b.png](attachment:image107b.png) ![image107a.png](attachment:image107a.png)

### Create a new variable<a id='NewVar'></a>

Create a new variable as a column in your dataset

![variables.png](attachment:variables.png)  ![setVariable.png](attachment:setVariable.png)

![withVarDo.png](attachment:withVarDo.png)

![text.png](attachment:text.png)  ![insertText.png](attachment:insertText.png)

![freestyle.png](attachment:freestyle.png)  ![freestyle3.png](attachment:freestyle3.png)

![image108.png](attachment:image108.png)

Before and After transformation:

![image108b.png](attachment:image108b.png)    ![image108a.png](attachment:image108a.png)