Implement ways to immediately warn when a forecast object becomes invalid #816

nikosbosse · 2024-05-19T07:52:08Z

I see it has been discussed a bit in #507 but I am uncomfortable with the fact we re-validate the class in the print() method. Not just because it’s inefficient and non-standard, but mostly because it may be the symptom of a deeper issue. Once it has been created, we should ensure (to the best of our ability) that it remains valid no matter what. And the moment it stopps being valid, we should throw an error immediately, not wait for the print() method to be called.

This is an inherent weakness of S3 classes, which are loosely defined, but we can add safeguards. This could take the form of a custom [ method which ensures crucial columns are never dropped.

Originally posted by @Bisaloo in #791 (review)

The text was updated successfully, but these errors were encountered:

nikosbosse · 2024-05-19T08:04:27Z

Some ways in which a forecast object can become invalid:

you delete one of the protected columns (model, observed, predicted, quantile_level...)
you delete a column that is part of the forecast unit in a way that now a single forecast is not uniquely identified anymore (meaning that get_duplicate_forecasts() would fail
you add additional columns (e.g. you end up with both a quantile_level and a sample_id column)
you rbind additional rows, leading to the same error
you delete rows such that none remain or that only rows remain that have an NA value somewhere (meaning they get filtered out)

I'm not sure we'd be able to catch all of these immediately.
What do you think @Bisaloo @seabbs @sbfnk

Also is this something we should do before the CRAN release, or possibly afterwards?

seabbs · 2024-05-20T21:24:47Z

I am also not sure how we should do this. Suggestions @Bisaloo?

Bisaloo · 2024-05-21T10:54:00Z

As far as I can tell, all the changes mentioned here are done via [, or a function that internally uses it (to be confirmed for rbind()).

So with a custom [.forecast() method, you should be able to immediately error when the object becomes invalid:

Run NextMethod() ([.data.frame() in most/all cases)
Validate the object
Error if invalid OR return is valid

Also is this something we should do before the CRAN release, or possibly afterwards?

I'd say it's strictly speaking a breaking change but it doesn't really change anything in the API so it can possibly wait.

nikosbosse · 2024-06-02T15:25:54Z

@Bisaloo would you possibly be willing to take on this PR? I imagine you'd by far be best placed to tackle it.

Bisaloo · 2024-06-03T06:33:27Z

I'm fully booked until mid-July but could have a go after. Would this work with your timeline?

nikosbosse · 2024-06-13T18:27:23Z

Yes, that would be perfect, merci beaucoup!

seabbs mentioned this issue Jun 11, 2024

Use as.class type functionality to replace epidist_prepare epinowcast/epidist#73

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement ways to immediately warn when a forecast object becomes invalid #816

Implement ways to immediately warn when a forecast object becomes invalid #816

nikosbosse commented May 19, 2024 •

edited

nikosbosse commented May 19, 2024

seabbs commented May 20, 2024

Bisaloo commented May 21, 2024

nikosbosse commented Jun 2, 2024

Bisaloo commented Jun 3, 2024

nikosbosse commented Jun 13, 2024

Implement ways to immediately warn when a forecast object becomes invalid #816

Implement ways to immediately warn when a forecast object becomes invalid #816

Comments

nikosbosse commented May 19, 2024 • edited

nikosbosse commented May 19, 2024

seabbs commented May 20, 2024

Bisaloo commented May 21, 2024

nikosbosse commented Jun 2, 2024

Bisaloo commented Jun 3, 2024

nikosbosse commented Jun 13, 2024

nikosbosse commented May 19, 2024 •

edited