📘 Scikit-Learn Intro Study Plan

This repository contains a concise, hands-on study plan to get comfortable applying Scikit-Learn for practical machine learning tasks. The focus is on understanding the core API, building end-to-end workflows, and preparing for code interviews with minimal but effective coverage.

🎯 Goals

Understand the Scikit-Learn API and the fit/transform/predict pattern.
Build reproducible pipelines for preprocessing and modeling.
Evaluate models with proper train/test splits and metrics.
Tune models quickly with cross-validation and simple search.
Apply to small tabular datasets end-to-end.

🧱 Core Concepts to Learn

Estimators, Transformers, Pipelines
Train/test split, cross-validation
Feature preprocessing: scaling, encoding, imputation
Model evaluation: classification vs regression metrics
Model selection: GridSearchCV, RandomizedSearchCV
Saving/loading models (joblib)

🛠️ Minimal Toolkit

Pipeline, ColumnTransformer
StandardScaler, OneHotEncoder, SimpleImputer
train_test_split, cross_val_score
GridSearchCV, RandomizedSearchCV
Baselines: DummyClassifier, DummyRegressor
Models: LogisticRegression, RandomForestClassifier, RandomForestRegressor, GradientBoostingRegressor

📚 Study Sequence (3–5 hours)

Quick API tour with a toy dataset (iris, boston alternative: fetch_california_housing).
Preprocessing with ColumnTransformer (numeric vs categorical).
Pipelines: preprocessing + model in one object.
Evaluation: train_test_split, cross_val_score, metrics (accuracy, f1, roc_auc, rmse/mae).
Hyperparameter tuning with GridSearchCV/RandomizedSearchCV.
Export model with joblib and reload for inference.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
drawndata1.csv		drawndata1.csv
drawndata2.csv		drawndata2.csv
scikit_init.ipynb		scikit_init.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📘 Scikit-Learn Intro Study Plan

🎯 Goals

🧱 Core Concepts to Learn

🛠️ Minimal Toolkit

📚 Study Sequence (3–5 hours)

🧪 Template: Tabular Classification

About

Uh oh!

Releases

Packages

Languages

victorhrls/scikit-learn-intro

Folders and files

Latest commit

History

Repository files navigation

📘 Scikit-Learn Intro Study Plan

🎯 Goals

🧱 Core Concepts to Learn

🛠️ Minimal Toolkit

📚 Study Sequence (3–5 hours)

🧪 Template: Tabular Classification

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages