[R] Remove parameters and attributes related to `ntree` and rebase `iterationrange` #9935

david-cortes · 2023-12-29T10:24:33Z

After the introduction of newer tree modalities such as multi-quantile regression, the code for determining the number of trees in a model is no longer correct, and other sections that rely on it might produce incorrect results. I see that parameters referring to number of trees have been deprecated in favor of parameters referring to number of iterations, so I'm making the switch here.

This PR:

Removes the deprecated argument ntree_limit in predict.
Removes the 'ntree' attribute and internal accessor which are no longer needed.
Fixes incorrect usage of num_class to determine prediction shapes, as now there are more ways in which predict can output multi-dimensional results.
Changes the format of iterationrange to match with R's sequences/ranges.

I've based it off from the current last commit of previous PR #9924 which is a requisite for the changes introduced here.

I'm not sure how to make the PR show the diff w.r.t to that other PR, as it's not a branch of this repository, and I cannot open a PR here to merge to a branch on my own repository, so this PR will need to be rebased later on in order to make it mergable (I think the kind of commits that github generates will also mess up git merge later on so a rebase will anyway be needed).

hcho3 · 2024-01-03T20:01:20Z

I'm not sure how to make the PR show the diff w.r.t to that other PR, as it's not a branch of this repository, and I cannot open a PR here to merge to a branch on my own repository, so this PR will need to be rebased later on in order to make it mergable

It's fine. It's common for us to open pull requests that include the commits from another pull request. Let's try to merge #9924 soon.

david-cortes · 2024-01-11T18:21:32Z

@trivialfis Would be ideal if you could review this PR next.

trivialfis

I'm looking into #9948 as well, would be great if there's a common reindexing function either in C or in R that handles all the translation and can be tested independently.

I find it difficult to reason that all places that need indexing are performing the translation consistently.

R-package/R/xgb.Booster.R

david-cortes · 2024-01-14T19:12:38Z

I'm looking into #9948 as well, would be great if there's a common reindexing function either in C or in R that handles all the translation and can be tested independently.

I find it difficult to reason that all places that need indexing are performing the translation consistently.

Do you have a list of places where such reindexing should be applied?

As I see it, currently there is:

best_iteration - but this one follows base-0 in the C attribute and base-1 in the R attribute, so handling would be quite different from the others.
iterationrange - not too hard to do the conversion, since it always amounts to just subtracting from the initial index.
Booster slicing - but this one requires a very different approach that the others do not share; and if it were to be re-implemented by taking an integer array instead of start/end/step, the handling would also be completely different if we want it to follow R's idioms.
Class index in the linear coefficients callback (currently base-0) - but I think this one would make more sense to match with the numbers in label which are base-0.

trivialfis · 2024-01-15T09:28:11Z

Thank you for sharing, it might be difficult to gather everything into one place. An additional concern is the categorical data.

It's not just 0-based indexing v.s. 1-based indexing, exclusive and inclusive on the end of a range is also problematic, would be great if we could at least find a place to document all the differences so that we can lookup in the future.

david-cortes · 2024-01-15T18:23:08Z

Thank you for sharing, it might be difficult to gather everything into one place. An additional concern is the categorical data.

It's not just 0-based indexing v.s. 1-based indexing, exclusive and inclusive on the end of a range is also problematic, would be great if we could at least find a place to document all the differences so that we can lookup in the future.

You mean in one of those .rst files from the readthedocs page for developers?

trivialfis · 2024-01-19T02:34:05Z

You mean in one of those .rst files from the readthedocs page for developers?

sounds good!

david-cortes mentioned this pull request Jan 7, 2024

[R] Refactor callback structure and attributes #9957

Merged

rebase

4852216

david-cortes force-pushed the remove_ntree branch from b34e0f5 to 4852216 Compare January 11, 2024 18:20

update roxygen

077cb93

trivialfis reviewed Jan 14, 2024

View reviewed changes

R-package/R/xgb.Booster.R Show resolved Hide resolved

R-package/R/xgb.Booster.R Outdated Show resolved Hide resolved

simplify check for iterationrange='all'

bc7251f

trivialfis approved these changes Jan 20, 2024

View reviewed changes

trivialfis mentioned this pull request Jan 20, 2024

Roadmap for new R interface #9810

Open

27 tasks

trivialfis merged commit c5d0608 into dmlc:master Jan 20, 2024
25 of 29 checks passed

david-cortes mentioned this pull request Jan 30, 2024

[R] Document handling of indexes #10019

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[R] Remove parameters and attributes related to `ntree` and rebase `iterationrange` #9935

[R] Remove parameters and attributes related to `ntree` and rebase `iterationrange` #9935

david-cortes commented Dec 29, 2023 •

edited

Loading

hcho3 commented Jan 3, 2024

david-cortes commented Jan 11, 2024

trivialfis left a comment

david-cortes commented Jan 14, 2024

trivialfis commented Jan 15, 2024 •

edited

Loading

david-cortes commented Jan 15, 2024

trivialfis commented Jan 19, 2024

[R] Remove parameters and attributes related to ntree and rebase iterationrange #9935

[R] Remove parameters and attributes related to ntree and rebase iterationrange #9935

Conversation

david-cortes commented Dec 29, 2023 • edited Loading

hcho3 commented Jan 3, 2024

david-cortes commented Jan 11, 2024

trivialfis left a comment

Choose a reason for hiding this comment

david-cortes commented Jan 14, 2024

trivialfis commented Jan 15, 2024 • edited Loading

david-cortes commented Jan 15, 2024

trivialfis commented Jan 19, 2024

[R] Remove parameters and attributes related to `ntree` and rebase `iterationrange` #9935

[R] Remove parameters and attributes related to `ntree` and rebase `iterationrange` #9935

david-cortes commented Dec 29, 2023 •

edited

Loading

trivialfis commented Jan 15, 2024 •

edited

Loading