In this repository, there is a module that I have compiled that allows us to adapt the dataset to the machine learning model before creating the machine learning model.
- Variables with categorical data
- Variables with numeric data
- Variables with categorical but high cardinality data
- Variables that contain numeric but categorical data
- General information about the dataset
- Categorical variables analysis
- Numerical variables analysis
- Target variable analysis
- Analysis of target variables with categorical variables
- Analysis of target variables with numerical variables
- Correlation analysis
- Outliers analysis
- Missing values analysis
- Encoding(Label Encoding, One-Hot Encoding, Rare Encoding)
- Feature scaling, extraction