Add more tests for `score()` #502

nikosbosse · 2023-11-24T09:12:38Z

It would be good to add a few more tests for score().

Sample case:
What happens if you only have a single sample?

Quantile case:
what happens if you only

submit the median
a single quantile that is not the median
an asymmetric interval

The text was updated successfully, but these errors were encountered:

seabbs · 2024-02-28T15:46:14Z

Do we have any progress on this? Maybe we could update to a checklist to make this more modular

seabbs · 2024-03-18T10:41:43Z

Anyone keen to chip in here on some infra (maybe @jamesmbaazam :))

nikosbosse · 2024-04-22T06:46:18Z

In the process of adding some tests.

The following leads to a lot of warnings which need to be cleaned up eventually. Some of them are fine and expected, others like the "Column 'dispersion' does not exist to remove" or "! Function execution failed, returning NULL. Error: argument is of length zero." are a bit opaque.

# only one quantile that is not the median 
  onlyonequantile <- example_quantile[quantile_level == 0.3] %>%
    as_forecast()

  expect_error(
    score(onlyonequantile, metrics = metrics_quantile(
      exclude = c("interval_coverage_50", "interval_coverage_90")
    )),
    ""
  )
  expect_warning(
    score(onlyonequantile, metrics = metrics_quantile(
      exclude = c("interval_coverage_50", "interval_coverage_90")
    )),
    "Function execution failed, returning NULL. Error: object 'upper' not found."
  )

1: ! Function execution failed, returning NULL. Error: object 'upper' not found.
2: In `[.data.table`(forecast, , `:=`((metric_name), do.call(run_safely,  ... :
  Column 'wis' does not exist to remove
3: ! Function execution failed, returning NULL. Error: object 'upper' not found.
4: In `[.data.table`(forecast, , `:=`((metric_name), do.call(run_safely,  ... :
  Column 'overprediction' does not exist to remove
5: ! Function execution failed, returning NULL. Error: object 'upper' not found.
6: In `[.data.table`(forecast, , `:=`((metric_name), do.call(run_safely,  ... :
  Column 'underprediction' does not exist to remove
7: ! Function execution failed, returning NULL. Error: object 'upper' not found.
8: In `[.data.table`(forecast, , `:=`((metric_name), do.call(run_safely,  ... :
  Column 'dispersion' does not exist to remove
9: In min(quantile_level[quantile_level > 0.5]) :
  no non-missing arguments to min; returning Inf
10: ! Function execution failed, returning NULL. Error: argument is of length zero.
11: In `[.data.table`(forecast, , `:=`((metric_name), do.call(run_safely,  ... :
  Column 'bias' does not exist to remove
12: ✖ To compute interval coverage deviation, all quantiles must form central symmetric prediction intervals.
ℹ Missing quantiles: 0.7. Returning NA.
13: ✖ In order to compute the absolute error of the median, "0.5" must be among the quantiles given.
ℹ Returning "NA".

Similar for the case of one asymmetric quantile:

 # one asymmetric interval
  oneasymmetric <- example_quantile[quantile_level %in% c(0.2, 0.6)] %>%
    as_forecast()

  expect_warning(
    expect_warning(
      suppressMessages(
        score(onlyonequantile, metrics = metrics_quantile(
          exclude = c("interval_coverage_50", "interval_coverage_90")
        ))
      ),
      "To compute interval coverage deviation, all quantiles must form central symmetric prediction intervals."
    ),
    'In order to compute the absolute error of the median'
  )

nikosbosse · 2024-04-22T08:10:33Z

I spun up new issues, #800 and #801, that came up when trying to address this. Once these are addressed I think it makes sense to circle back and include the two tests I mentioned in the last comment.

nikosbosse · 2024-05-04T16:01:06Z

The "Column 'dispersion' does not exist to remove" errors are addressed in #801.

jamesmbaazam · 2024-05-07T09:24:07Z

Anyone keen to chip in here on some infra (maybe @jamesmbaazam :))

Sorry, I missed this notification.

Issues #502 and #587- Add additional tests

nikosbosse · 2024-07-15T07:01:52Z

Update: some things disappeared, but this has still a few errors/warnings:

onlyonequantile <- example_quantile[quantile_level == 0.3] |>
  as_forecast()

score(onlyonequantile, metrics = metrics_quantile(
  exclude = c("interval_coverage_50", "interval_coverage_90")
))

Working on them :)

* improve error messages, replace warnings with errors * fix tests * add more tests for `score()` * fix failing test --------- Co-authored-by: Sam Abbott <contact@samabbott.co.uk>

nikosbosse changed the title ~~Add more tests for score() for the quantile case~~ Add more tests for score() Nov 24, 2023

nikosbosse mentioned this issue Nov 29, 2023

scoringutils development plan #493

Closed

30 tasks

nikosbosse added the package improvement label Nov 29, 2023

nikosbosse added this to the scoringutils 2.0.0 milestone Nov 29, 2023

nikosbosse added this to scoringutils 2.0 Nov 29, 2023

nikosbosse added the implementation-ready This is ready for implementation label Dec 5, 2023

nikosbosse mentioned this issue Apr 22, 2024

Improve run_safely and apply_metrics() #801

Closed

nikosbosse mentioned this issue Apr 22, 2024

Issues #502 and #587- Add additional tests #802

Merged

9 tasks

nikosbosse added a commit that referenced this issue May 18, 2024

Merge pull request #802 from epiforecasts/add-tests-score

39ac584

Issues #502 and #587- Add additional tests

nikosbosse mentioned this issue Sep 25, 2024

Issue #502 - Add tests and Improve error handling #924

Merged

9 tasks

seabbs closed this as completed in #924 Sep 30, 2024

github-project-automation bot moved this to Done in scoringutils 2.0 Sep 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more tests for `score()` #502

Add more tests for `score()` #502

nikosbosse commented Nov 24, 2023 •

edited

Loading

seabbs commented Feb 28, 2024

seabbs commented Mar 18, 2024

nikosbosse commented Apr 22, 2024

nikosbosse commented Apr 22, 2024

nikosbosse commented May 4, 2024

jamesmbaazam commented May 7, 2024

nikosbosse commented Jul 15, 2024

Add more tests for score() #502

Add more tests for score() #502

Comments

nikosbosse commented Nov 24, 2023 • edited Loading

seabbs commented Feb 28, 2024

seabbs commented Mar 18, 2024

nikosbosse commented Apr 22, 2024

nikosbosse commented Apr 22, 2024

nikosbosse commented May 4, 2024

jamesmbaazam commented May 7, 2024

nikosbosse commented Jul 15, 2024

Add more tests for `score()` #502

Add more tests for `score()` #502

nikosbosse commented Nov 24, 2023 •

edited

Loading