Document eval_metric in XGBoost #6887

exalate-issue-sync · 2023-05-11T15:18:10Z

In [https://github.com//pull/6399|https://github.com//pull/6399|smart-link] we added a notebook describing how to properly use {{eval_metric}} to speed-up XGBoost scoring in early stopping scenario with frequent scoring.

[~accountid:5afa05ceac509206c8203255] pointed out that we need to also document it clearly in H2O documentation: [https://github.com//pull/6399#pullrequestreview-1160569522|https://github.com//pull/6399#pullrequestreview-1160569522|smart-link] - the use case of early stopping being an important factor

exalate-issue-sync · 2023-05-11T15:18:12Z

hannah.tillman commented: also add {{score_eval_metric_only}} to params:

Score only the evaluation metric when enabled. This can make model training faster if scoring is frequent (e.g. each iteration). Defaults to {{False}}.

exalate-issue-sync · 2023-05-11T15:18:14Z

hannah.tillman commented: Quick notes:

Put in the FAQ:

How do I use the eval_metric?

eval_metric is calculated on both training and validation datasets after each iteration.

By default, H2O calculates all appropriate metrics for given problems. Given a binary classification model, H2O will report at least logloss, AUC, and AUCPR.

When early stopping is used, you will need to choose one of the built-in early stopping metrics. For consistency between different model types and/or algorithm implementations, these are always calculated by H2O itself and are independent of XGBoost’s eval_metric implementation.

You don’t always need to specify your custom eval_metric, but it is beneficial for both frequent scoring and when H2O doesn’t provide a suitable built-in metric.

eval_metric: Specify the evaluation metric that will be passed to the native XGBoost backend. Must be one of: rmse, rmsle, mae, mape, mphe, logloss, error, error@t, merror, mlogloss, auc, aucpr, ndcg, map, ndcg@n/map@n, ndcg-/map-/ndcg@n-/map@n, poisson-nloglik, gamma-nloglik, cox-nloglik, tweedie-nloglik, aft-nloglik, interval-regression-accuracy
** [https://xgboost.readthedocs.io/en/latest/parameter.html#learning-task-parameters|https://xgboost.readthedocs.io/en/latest/parameter.html#learning-task-parameters|smart-link]

h2o-ops · 2023-05-14T17:55:32Z

JIRA Issue Details

Jira Issue: PUBDEV-8889
Assignee: hannah.tillman
Reporter: Michal Kurka
State: Resolved
Fix Version: 3.40.0.2
Attachments: N/A
Development PRs: Available

h2o-ops · 2023-05-14T17:55:35Z

Linked PRs from JIRA

#6492

h2o-ops assigned hannah-tillman May 14, 2023

h2o-ops added the fixVersion/3.40.0.2 label May 14, 2023

h2o-ops closed this as completed May 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document eval_metric in XGBoost #6887

Document eval_metric in XGBoost #6887

exalate-issue-sync bot commented May 11, 2023

exalate-issue-sync bot commented May 11, 2023

exalate-issue-sync bot commented May 11, 2023

h2o-ops commented May 14, 2023

h2o-ops commented May 14, 2023

Document eval_metric in XGBoost #6887

Document eval_metric in XGBoost #6887

Comments

exalate-issue-sync bot commented May 11, 2023

exalate-issue-sync bot commented May 11, 2023

exalate-issue-sync bot commented May 11, 2023

h2o-ops commented May 14, 2023

h2o-ops commented May 14, 2023