GitHub - zl376/class_imb: Classification with imbalanced classes

Comparison of Decision Tree, Random Foreset and Random Forest with undersampled bootstrap for unbalanced data

Note:

This example shows class imbalance of ~200 ( dominant y=0 ).

Precision is always low
Using 'class_weight' in Random Forest seems to perform worse with more trees
Instead, explicit under-sampling dominant class (y=0) before bootstrap works for Random Forest with more trees

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
model		model
.gitignore		.gitignore
Readme.md		Readme.md
UnderRandomForestClassifier.py		UnderRandomForestClassifier.py
main.ipynb		main.ipynb
utils.py		utils.py