Fix performance summary bugs for subsampled PSIS-LOO CV #475

fweber144 · 2023-11-09T20:49:15Z

This fixes several bugs (mainly in get_stat()) that were sometimes causing incorrect predictive performance results (i.e., point estimate, standard error, confidence interval) in case of subsampled PSIS-LOO CV. For details, see the commit messages.

…cated.

the original `mu.bs` has no `NA`s.

…s = TRUE`: If `mu.bs` has `NA`s (which is the case for subsampled PSIS-LOO CV if `baseline = "best"`), then `mu` gets modified by line `mu[is.na(mu.bs)] <- NA` and hence `auc.data` needs to be updated as well.

Previously, `NA` was returned as the AUC for the submodels. This was due to `NA`s not being handled correctly in `auc()`.

…rocessing it.

…pled PSIS-LOO CV and `deltas = FALSE`.

subsampled PSIS-LOO CV (`nloo`) with `deltas = TRUE` and `baseline = "best"`. For `baseline = "ref"`, this is only a refactor improving the safety and readability of `get_stat()`'s handling of `NA`s because for `baseline = "ref"`, we should always have no `NA`s in `lppd.bs` and `mu.bs`, so in that case, `n_notna` did not require an adjustment and also because the math operations connecting `mu` and `mu.bs` (analogously for `lppd` and `lppd.bs`) ensured that only the "inner join" of non-`NA` elements (i.e., the set of observations for which both `mu` and `mu.bs` (analogously for `lppd` and `lppd.bs`) are not `NA`) is used. This addresses question 1 from <stan-dev#94 (comment)>.

Previously, ```r weighted.sd(numeric(), numeric()) weighted.sd(NA, NA) weighted.sd(NA, NA, na.rm = TRUE) weighted.sd(0.42, 42) ``` returned `0`, `NA_real_`, `0`, `NaN`, respectively. Now, they return `NA_real_`, `NA_real_`, `NA_real_`, `NA_real_`, respectively, just like ```r sd(numeric()) sd(NA) sd(NA, na.rm = TRUE) sd(0.42) ``` .

`stat %in% c("acc", "pctcorr", "auc")` and `!is.null(y_wobs_test$y_prop)`: `n_notna` was not adapted correctly in that case (because `y_wobs_test$wobs` usually has non-`NA`s at those places where `mu` has `NA`s).

<stan-dev#94 (comment)>.

from PR stan-dev#475: Since `.tabulate_stats()` gets arguments from `summary.vsel()` and friends via `...` and passes them over to `get_stat()`, omitting argument `wcv` would have made it possible for users to modify it.

fweber144 added 15 commits November 8, 2023 11:32

Avoid "end-of-line" comments as they make version control more compli…

14f748a

…cated.

Minor efficiency improvement: Typically (i.e., if baseline = "ref"),

f2f697d

the original `mu.bs` has no `NA`s.

Fix a bug for the AUC with subsampled PSIS-LOO CV (nloo) and `delta…

d109c37

…s = TRUE`: If `mu.bs` has `NA`s (which is the case for subsampled PSIS-LOO CV if `baseline = "best"`), then `mu` gets modified by line `mu[is.na(mu.bs)] <- NA` and hence `auc.data` needs to be updated as well.

Fix a bug for the AUC with subsampled PSIS-LOO CV (nloo):

af50c5b

Previously, `NA` was returned as the AUC for the submodels. This was due to `NA`s not being handled correctly in `auc()`.

auc(): Make it explicit that x should not be used anymore after p…

df32353

…rocessing it.

Enhance comments for subsampled PSIS-LOO CV.

63ea69f

Fix a warning message for the submodel RMSE and AUC in case of subsam…

a976fd2

…pled PSIS-LOO CV and `deltas = FALSE`.

Minor enhancements in get_stat().

8fc0782

Fix a bug for subsampled PSIS-LOO CV with

d4a1a32

`stat %in% c("acc", "pctcorr", "auc")` and `!is.null(y_wobs_test$y_prop)`: `n_notna` was not adapted correctly in that case (because `y_wobs_test$wobs` usually has non-`NA`s at those places where `mu` has `NA`s).

Add a NEWS.md entry for the collection of bug fixes.

d8c9019

Check d_test$y and d_test$y_oscale for NAs.

13def37

Add a TODO comment for subsampled PSIS-LOO CV.

f6e9288

Minor clarification for subsampled PSIS-LOO CV: Resolve

2bd6b4e

<stan-dev#94 (comment)>.

fweber144 mentioned this pull request Nov 9, 2023

Subsampled LOO #94

Open

fweber144 merged commit ca46327 into stan-dev:master Nov 9, 2023

fweber144 deleted the fix_stats_nloo branch November 9, 2023 21:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix performance summary bugs for subsampled PSIS-LOO CV #475

Fix performance summary bugs for subsampled PSIS-LOO CV #475

fweber144 commented Nov 9, 2023

Fix performance summary bugs for subsampled PSIS-LOO CV #475

Fix performance summary bugs for subsampled PSIS-LOO CV #475

Conversation

fweber144 commented Nov 9, 2023