This is a tentative roadmap for our 30 days machine learning challange. I will add more information along the way.
-
Core concepts of Machine Learning
-
Machine Learning Process
-
Basic Data Exploration
-
Working with Missing Data
a. Examining Missing Data
b. Dropping Missing Data
c. Imputing Data
d. Adding Indicator Columns
-
Your first machine-learning model
-
Working with Cleaning Data
a. Column Names
b. Replacing Missing Values
-
Model Validation
-
Data Exploration
a. Data Size
b. Summary Stats
c. Histogram
d. Scatter Plot
e. Joint Plot
f. Pair Grid
g. Box and Violin Plots
h. Comparing Two Ordinal Values
i. Correlation
j. RadViz
k. Parallel Coordinates
-
Preprocessing Data
a. Standardize
b. Scale to Range
c. Dummy Variables
d. Label Encoder
e. Frequency Encoding
f. Pulling Categories from Strings
g. Other Categorical Encoding
h. Date Feature Engineering
i. Add col _na Feature
j. Manual Feature Engineering
- Feature Selection
a. Collinear Columns
b. Lasso Regression
c. Recursive Feature Elimination
e. Mutual Information
f. Principal Component Analysis
g. Feature Importance
- Dealing with Imbalance Classes
a. Use a Different Metric
b. Tree-based Algorithms and Ensembles
c. Penalize Models
d. Upsampling Minority
e. Generate Minority Data
f. Downsampling Majority
g. Upsampling Then Downsampling
- Classification Algorithms
- Model Selection
- Metrics and Classification Evaluation
a. Confusion Matrix
b. Metrics
c. Accuracy
d. Recall
e. Precision
f. F1
g. Classification Report
h. RoC
i. Precision-Recall Curve
j. Cumulative Gains Plot
k. Lift Curve
l. Class Balance
m. Class Prediction Error
n. Discrimination Threshold
- Explaining Classification Model
- Regression Algorithms
- Metrics and Regression Evaluation
- Explaining Regression Model
- Dimensionality Reduction
- Clustering
- Implementing Pipeline
-
Neural networks
-
Artificial neural networks (ANN)
-
Project:
-
ANN walkthrough: Predicting Stock Prices
- Natural Language Processing (NLP)
-
Project:
-
NLP walkthrough: Mining Newsgroups Dataset
- Deep Learning Basics
- Problems and Solutions
- Machine Learning best practices
-
Project:
-
Building a Movie Recommendation Engine
-
Project:
-
Recognizing Faces
- Project: Predicting Online Ad Click-Through: Tree-based Algorithm
- Project: NewsGroups Dataset with Clustering and Topic Modeling