How it works

Both classifiers use numerical input features and predict a numerical label. There are working and self explaining examples in the test folders.

RandomForest

Java API

The 'RandomForest' class contains methods to train an ensemble of decision trees according to Breimann.

There is a 'RandomForestBuilder' class for a more convenient setting of the parameters.

fit

The fit method takes a Java 'DataSet<Tuple2<Double, Vector>>' and trains the classifier.

predict

After the classifier is successfully trained the predict method is able to predict labels by passing it a 'DataSet<Vector>'.

evaluate

After the classifier is successfully trained you are able to evaluate its performance by using the evaluate function. It requires you to pass a 'DataSet<Tuple2<Double, Vector>>'. The features vectors will be used to predict the labels and compare them to the given labels. This returns a 'DataSet<Tuple2<Double, Double>>' where the first Double is the predicted label and the second double is the real label. This way you can further process the data to match your desired evaluation.

evaluateBinaryClassification

The method uses the same parameters but assumes a binary classification with true being encoded as '1.0' and false as '-1.0'. This way it is able to calculate accuray, precision and recall for binary classifications.

Scala API

There is no dedicated Scala API yet. But you can find the above methods in the 'RandomForestModel' class in the 'randomforest' package. The methods behave the same way but use Scala native datastructures.

DecisionTree

The decision tree is implemented using information gain to find the best split. It supports the same methods and behaves the same way but uses a distributed decision tree as the underlying classifier.

Java API

There is a 'DecisionTreeBuilder' class for a more convenient setting of the parameters.

Scala API

There is no dedicated Scala API yet. But you can find the above methods in the 'DecisionTreeModel' class.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
src		src
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How it works

RandomForest

Java API

fit

predict

evaluate

evaluateBinaryClassification

Scala API

DecisionTree

Java API

Scala API

About

Releases

Packages

Contributors 2

Languages

2start/TreeBasedLearning

Folders and files

Latest commit

History

Repository files navigation

How it works

RandomForest

Java API

fit

predict

evaluate

evaluateBinaryClassification

Scala API

DecisionTree

Java API

Scala API

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages