You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
One of the major tasks of the library is evaluating the quality of the models and evaluating the AutoML objectives.
To that end, metrics are needed for every supported problem type.
One of them is evaluating classification tasks. The library should offer an API for using any of these metrics, testing the predicted values against the ground truth.
Important metrics to cover here:
aucroc : the Area Under the Receiver Operating Characteristic Curve (ROC AUC) from prediction scores.
aucprc : The average precision summarizes a precision-recall curve as the weighted mean of precisions achieved at each threshold, with the increase in recall from the previous threshold used as the weight.
accuracy : Accuracy classification score.
f1_score(micro, macro, weighted): F1 score is a harmonic mean of the precision and recall. This version uses the "micro" average: calculate metrics globally by counting the total true positives, false negatives and false positives.
kappa: computes Cohen’s kappa, a score that expresses the level of agreement between two annotators on a classification problem.
precision(micro, macro, weighted): Precision is defined as the number of true positives over the number of true positives plus the number of false positives. This version(micro) calculates metrics globally by counting the total true positives.
recall(micro, macro, weighted): Recall is defined as the number of true positives over the number of true positives plus the number of false negatives. This version(micro) calculates metrics globally by counting the total true positives.
mcc: The Matthews correlation coefficient is used in machine learning as a measure of the quality of binary and multiclass classifications. It takes into account true and false positives and negatives and is generally regarded as a balanced measure which can be used even if the classes are of very different sizes.
Feature Description
One of the major tasks of the library is evaluating the quality of the models and evaluating the AutoML objectives.
To that end, metrics are needed for every supported problem type.
One of them is evaluating classification tasks. The library should offer an API for using any of these metrics, testing the predicted values against the ground truth.
Important metrics to cover here:
aucroc
: the Area Under the Receiver Operating Characteristic Curve (ROC AUC) from prediction scores.aucprc
: The average precision summarizes a precision-recall curve as the weighted mean of precisions achieved at each threshold, with the increase in recall from the previous threshold used as the weight.accuracy
: Accuracy classification score.f1_score
(micro, macro, weighted): F1 score is a harmonic mean of the precision and recall. This version uses the "micro" average: calculate metrics globally by counting the total true positives, false negatives and false positives.kappa
: computes Cohen’s kappa, a score that expresses the level of agreement between two annotators on a classification problem.precision
(micro, macro, weighted): Precision is defined as the number of true positives over the number of true positives plus the number of false positives. This version(micro) calculates metrics globally by counting the total true positives.recall
(micro, macro, weighted): Recall is defined as the number of true positives over the number of true positives plus the number of false negatives. This version(micro) calculates metrics globally by counting the total true positives.mcc
: The Matthews correlation coefficient is used in machine learning as a measure of the quality of binary and multiclass classifications. It takes into account true and false positives and negatives and is generally regarded as a balanced measure which can be used even if the classes are of very different sizes.AP reference: https://github.com/vanderschaarlab/autoprognosis/blob/main/src/autoprognosis/utils/tester.py
The text was updated successfully, but these errors were encountered: