This repository is for practicing tabular data workflows with Python on some datasets.
- Exploratory Data Analysis (EDA)
- Data cleaning (duplicates, invalid or missing-like values)
- Data preprocessing and transformation
- Categorical encoding (label mapping, one-hot encoding)
- Feature scaling (StandardScaler)
- Basic feature selection/extraction (correlation and chi-square)
- Preparing final model-ready datasets
- Python
- NumPy
- pandas
- matplotlib
- seaborn
- scikit-learn
- scipy