Add in-training tooling to find a more optimal threshold for binary classification. #2181

justinxzhao · 2022-06-22T00:07:41Z

Ludwig uses a default threshold of 0.5 to calculate accuracy for binary classification problems. However, it's highly possible, especially for imbalanced datasets that a threshold of 0.5 is not the best threshold to use.

The AUC measures the performance of a binary classifier averaged across all possible decision thresholds, and is commonly used to determine a better threshold that gets a better balance of precision and recall.

One such algorithmic outline, proposed by @geoffreyangus and @w4nderlust:

def find_best_threshold(model, output_feature_name, dataset, metric, thresholds:  range(0, 1, 0.05)):
  probabilities = model.predict(dataset)[output_feature_name]['probabilities']
  scores = []
  for threshold in thresholds:
    preds = probabilities[:, 1] > threshold
    metric_score = metric(preds, targets)  # TODO: extract targets from `dataset`
    scores.append(metric_score)
  return threshold[argmax(scores)]

By default, the optimal threshold should be calculated at the end of the training phase.

It would also be useful to expose this as a standalone API.

amholler · 2022-06-23T17:55:38Z

An example that works on the current code is here:
https://github.com/ludwig-ai/experiments/blob/main/automl/heuristics/santander_customer_satisfaction/eval_util.py
with an example invocation here:
https://github.com/ludwig-ai/experiments/blob/main/automl/heuristics/santander_customer_satisfaction/train_tabnet_imbalance_ros.py

justinxzhao · 2022-06-23T18:39:58Z

Largely a duplicate of #2158

justinxzhao added feature New feature or request release-0.6 Feature to be implemented in v0.6 labels Jun 22, 2022

justinxzhao added this to To do in AutoML Jun 23, 2022

amholler assigned geoffreyangus Jun 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add in-training tooling to find a more optimal threshold for binary classification. #2181

Add in-training tooling to find a more optimal threshold for binary classification. #2181

justinxzhao commented Jun 22, 2022

amholler commented Jun 23, 2022 •

edited

Loading

justinxzhao commented Jun 23, 2022

Add in-training tooling to find a more optimal threshold for binary classification. #2181

Add in-training tooling to find a more optimal threshold for binary classification. #2181

Comments

justinxzhao commented Jun 22, 2022

amholler commented Jun 23, 2022 • edited Loading

justinxzhao commented Jun 23, 2022

amholler commented Jun 23, 2022 •

edited

Loading