A Simple Decision Tree Classifier Implementation

Manual Python implementation of a decision tree classifier

See tree.py for source code, and example.py for sample implementation with the iris dataset.

This simple decision tree classifier class is only built for numeric or ranked data. It accepts a list of lists as its data input, with classifications (string or numeric) included as the last element of each list or row. It was built without, and can be implemented without, pandas or sklearn. It is hard-coded to use gini impurity as its split metric/criterion.

The performance of this decision tree on the iris dataset is comparable to sklearn's implementation, both scoring around 95% test accuracy with a max_depth of 5 and a min_samples_split of 4.

For additional information about the implementation, see my Medium blog post.

The code for this decision tree was adapted and expanded as an exercise, from Jason Brownlee's Machine Learning Mastery.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
example.py		example.py
iris.csv		iris.csv
tree.py		tree.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

example.py

example.py

iris.csv

iris.csv

tree.py

tree.py

Repository files navigation

A Simple Decision Tree Classifier Implementation

About

Releases

Packages

Languages

License

lorischl-otter/decision_tree_by_hand

Folders and files

Latest commit

History

Repository files navigation

A Simple Decision Tree Classifier Implementation

About

Resources

License

Stars

Watchers

Forks

Languages