Positive-Unlabeled Learning using Random Forests via Recursive Greedy Risk Minimization

We propose new random forest algorithms for PU-learning that recursively and greedily minimise PU-data based estimators of the expected risk. Unbiased (uPU) and nonnegative (nnPU) risk estimators are both supported with either one of the quadratic or logistic loss. Using the quadratic loss and logistic loss are equivalent to using the Gini and entropy impurities in traditional (PN) random forests.

Paper: https://arxiv.org/pdf/2210.08461

How to use PU ET

A minimal working example usage of PU ET is found in run_puet_simple.py. Alternatively, run_puet.py demonstrates how to make use of more functionality. The implementation also supports PN learning, with example given in run_pnet.py.

Requirements

The implementation was created with these packages available. Correct functionality may be achieved with previous versions of packages but this is not tested.

numpy '1.21.2'
scipy '1.7.1'
joblib '1.1.0'
tree.py
trees.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

run_pnet.py

run_pnet.py

run_puet.py

run_puet.py

run_puet_simple.py

run_puet_simple.py

tree.py

tree.py

trees.py

trees.py

Repository files navigation

Positive-Unlabeled Learning using Random Forests via Recursive Greedy Risk Minimization

How to use PU ET

Requirements

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
run_pnet.py		run_pnet.py
run_puet.py		run_puet.py
run_puet_simple.py		run_puet_simple.py
tree.py		tree.py
trees.py		trees.py

jonathanwilton/PUExtraTrees

Folders and files

Latest commit

History

Repository files navigation

Positive-Unlabeled Learning using Random Forests via Recursive Greedy Risk Minimization

How to use PU ET

Requirements

About

Topics

Resources

Stars

Watchers

Forks

Languages