fix behavior for default objective and metric #4660

StrikerRUS · 2021-10-09T02:14:27Z

Configure metric after objective and use configured objective as a default metric if user doesn't provide any.
By default, objective is regression and can be omitted in params

LightGBM/include/LightGBM/config.h

Line 139 in f3987f3

std::string objective = "regression";

jameslamb

I'm supportive of the core change here, thanks for the quick response to #4655 !

I just left a few small questions.

Also...would you consider adding (fixes #4655) in the PR title? I think it's useful for that information to show up in the entry on release notes, so it's easy for someone reading the release notes to hover over the issue and see a preview of the problem a PR was trying to solve. I think this is useful because PR titles / release note entries are often written to answer the question "what changed?", but as a user reviewing the changelog you also care about "why did this change?".

For example, from https://github.com/microsoft/LightGBM/releases/tag/v3.3.0

jameslamb · 2021-10-10T05:45:25Z

R-package/tests/testthat/test_lgb.Booster.R

    dumped_model <- jsonlite::fromJSON(bst$dump_model())
-    expect_identical(bst_from_ds$eval_train(), list())
-    expect_equal(bst_from_ds$current_iter(), nrounds)


I understand why the eval_train() part of this test changed, but how is this current_iter() change related to the rest of the PR?

I think the line

expect_equal(bst_from_ds$current_iter(), nrounds)

is testing that a Booster created from a Dataset with a Predictor is already considered "fitted", and that the current_iter() attribute reflects the specific training that was done to produce that Predictor.

I think just testing that bst_from_ds is a Booster isn't enough to test that the merging here worked:

LightGBM/R-package/R/lgb.Booster.R

Lines 55 to 70 in f3987f3

private$init_predictor <- train_set$.__enclos_env__$private$predictor

# Check if predictor is existing

if (!is.null(private$init_predictor)) {

# Merge booster

.Call(

LGBM_BoosterMerge_R

, handle

, private$init_predictor$.__enclos_env__$private$handle

)

}

# Check current iteration

private$is_predicted_cur_iter <- c(private$is_predicted_cur_iter, FALSE)

Is line 421 duplicate? (compared with line 419).

That line expect_equal(bst_from_ds$current_iter(), nrounds) is still in the test:

LightGBM/R-package/tests/testthat/test_lgb.Booster.R

Line 419 in 4ee70d1

expect_equal(bst_from_ds$current_iter(), nrounds)

As @shiyu1994 correctly noticed that line was duplicated and I removed one duplicate.

ohhhh I see! My mistake, wasn't obvious to me. Thank you.

jameslamb · 2021-10-10T05:47:55Z

src/io/config.cpp

@@ -85,18 +85,15 @@ void GetObjectiveType(const std::unordered_map<std::string, std::string>& params
  }
 }

-void GetMetricType(const std::unordered_map<std::string, std::string>& params, std::vector<std::string>* metric) {
+void GetMetricType(const std::unordered_map<std::string, std::string>& params, const std::string& objective, std::vector<std::string>* metric) {


as a general, rule, I like to avoid inserting new arguments in the middle of a function signature. In my experience, I've found that it leads to fewer mistakes when updating the places that call that function.

I think this is especially important in C/C++ code, where all arguments are provided positionally.

Would you consider adding objective at the end of this signature? Or is there a specific reason that objective needs to come before metric?

Yeah, that was a semantic reason. In all such kind functions (GetXXXType()) in the config "returned" value is the last argument. It is a common practice in C++ to specify out-parameters at the very end of parameters. Also, semantically it can be read as "based on params and objective set metric". Appending objective to the end of parameters will be more confusing than benefiting, I believe. Given that this API is internal one, I think there is no problem with inserting objective in the middle of parameters and keep semantic order of parameters.

ok makes sense to me, thanks very much for the thorough explanation

StrikerRUS · 2021-10-12T21:57:54Z

would you consider adding (fixes #4655) in the PR title?

Honestly, I'm against of inserting (fixes #XXX) in PR titles.

From the functionality perspective, GitHub auto-closing feature should work fine when triggering word is used in PR description or commit message.

As for PR titles themselves, I believe they should be as brief and descriptive as possible. Any additional info like mentions of issue PR is resolving can disperse reader's attention. One can click to PR and read the whole description any time it is really needed. Quite often one PR can resolve multiple issues (for instance, dmlc/xgboost#7297). Write all issues in a title will be ugly, given that GitHub forces to prepend each issue number with fixes word, won't it? In addition, GitHub recently started to expand issue link to its title in comments which can be very long:

And who knows, maybe they will do the same in titles not only in comments...

github-actions · 2023-08-23T14:49:37Z

This pull request has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

StrikerRUS added the fix label Oct 9, 2021

StrikerRUS force-pushed the fix_default_metric branch from 7d405c6 to 6f32a78 Compare October 10, 2021 01:39

fix behavior for default objective and metric

4ee70d1

StrikerRUS force-pushed the fix_default_metric branch from 6f32a78 to 4ee70d1 Compare October 10, 2021 01:51

StrikerRUS changed the title ~~[WIP] fix all default objective and metric~~ fix behavior for default objective and metric Oct 10, 2021

StrikerRUS marked this pull request as ready for review October 10, 2021 02:10

StrikerRUS requested review from btrotta, chivee, guolinke, henry0312, jameslamb, Laurae2 and shiyu1994 as code owners October 10, 2021 02:10

jameslamb requested changes Oct 10, 2021

View reviewed changes

jameslamb self-requested a review October 12, 2021 21:37

jameslamb approved these changes Oct 12, 2021

View reviewed changes

shiyu1994 approved these changes Oct 13, 2021

View reviewed changes

shiyu1994 merged commit d130bb1 into master Oct 13, 2021

StrikerRUS deleted the fix_default_metric branch October 13, 2021 23:58

github-actions bot locked as resolved and limited conversation to collaborators Aug 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix behavior for default objective and metric #4660

fix behavior for default objective and metric #4660

StrikerRUS commented Oct 9, 2021 •

edited

jameslamb left a comment •

edited

jameslamb Oct 10, 2021

shiyu1994 Oct 12, 2021

StrikerRUS Oct 12, 2021

jameslamb Oct 12, 2021

jameslamb Oct 10, 2021

StrikerRUS Oct 12, 2021

jameslamb Oct 12, 2021

StrikerRUS commented Oct 12, 2021

github-actions bot commented Aug 23, 2023

	private$init_predictor <- train_set$.__enclos_env__$private$predictor

	# Check if predictor is existing
	if (!is.null(private$init_predictor)) {

	# Merge booster
	.Call(
	LGBM_BoosterMerge_R
	, handle
	, private$init_predictor$.__enclos_env__$private$handle
	)

	}

	# Check current iteration
	private$is_predicted_cur_iter <- c(private$is_predicted_cur_iter, FALSE)

fix behavior for default objective and metric #4660

fix behavior for default objective and metric #4660

Conversation

StrikerRUS commented Oct 9, 2021 • edited

jameslamb left a comment • edited

Choose a reason for hiding this comment

jameslamb Oct 10, 2021

Choose a reason for hiding this comment

shiyu1994 Oct 12, 2021

Choose a reason for hiding this comment

StrikerRUS Oct 12, 2021

Choose a reason for hiding this comment

jameslamb Oct 12, 2021

Choose a reason for hiding this comment

jameslamb Oct 10, 2021

Choose a reason for hiding this comment

StrikerRUS Oct 12, 2021

Choose a reason for hiding this comment

jameslamb Oct 12, 2021

Choose a reason for hiding this comment

StrikerRUS commented Oct 12, 2021

github-actions bot commented Aug 23, 2023

StrikerRUS commented Oct 9, 2021 •

edited

jameslamb left a comment •

edited