# ML Basics

Short overview of common ML tasks (supervised, unsupervised) and a tiny pipeline example using the Iris dataset.

In [None]:
# Tiny pipeline example: scale features and train a LogisticRegression on Iris
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.pipeline import make_pipeline
from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score

data = load_iris(as_frame=True)
X = data.data
y = data.target
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=1)
pipe = make_pipeline(StandardScaler(), LogisticRegression(max_iter=200))
pipe.fit(X_train, y_train)
preds = pipe.predict(X_test)
print('Accuracy:', accuracy_score(y_test, preds))

## Notes for learners
- Always start with a simple baseline model.
- Preprocessing (scaling, encoding) often impacts model performance.
- Use train/test split and cross-validation to estimate performance.