Adding time-varying methods, under-ascertainment methods and vignettes for all of the above #23

thimotei · 2023-03-14T13:29:34Z

I added a vignette for the static CFR calculation, including two examples. One for Ebola in 1976 and one for the first year of the COVID-19 outbreak in the U.K. While building the vignette, I encounted a couple of issues:

The structure of the output of the ccfr_calculation() function was a numeric vector with names. However, this is difficult to automatically build a table out of using kable(). I came up with a workaround in format_cfr_neatly() here, but it could be improved
I reverted back to the original known_outcomes() calculation, to the version @adamkucharski originally implemented. I played around a lot with various convolution implementations. Specifically the convolve() function in base R, the implementation myself and then @pratikunterwegs didi together and the original one by @adamkucharski. They all produced similar but not exactly the same results (even though I'm fairly confident that mathematically they are equivalent!). I couldn't figure out why and this was the easiest version to understand and work with. I think getting together and discussing the exact implementation we should use would be helpful
I changed the Poisson threshold in the likelihood calculation to 100 total cases. This was to ensure stability when calculating the CFR within the middle of the Ebola outbreak example, where the adjusted CFR exceeds 100%. The CFR being so high is a rare case, so returning Inf may not be an issue and could well be informative. But I thought it would be good to flag it here

adamkucharski

Added some edits to vignette, mostly to improve clarity of illustrations. Also noticed a potential bug with vectors in known_outcomes function.

vignettes/calculate_known_outcomes.Rmd

R/known_outcomes.R

vignettes/calculate_known_outcomes.Rmd

pratikunterwegs · 2023-03-14T17:29:54Z

Looks like a lot of comments already @thimotei, so I'll leave off for now - happy to help making fixes.

R/plot_raw_data.R

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

adamkucharski · 2023-05-02T15:46:25Z

Looks like these are the main test-coverage issues:
format_cfr_neatly is missing an expected type argument in test-format_cfr.R and test-known_outcomes.R

plot_data_and_cfr isn't defined as a function in test-plotting.R

Version issue with expect_snapshot() in test-rolling_cfr.R and test-static_cfr.R (but may just be my local setup)

pratikunterwegs · 2023-05-02T15:47:40Z

Hi @thimotei, just a few comments on the failing tests:

Some imported packages are missing; see the first error that recommends adding

importFrom("graphics", "grid", "legend", "lines", "par", "polygon")
importFrom("grDevices", "adjustcolor")

to the NAMESPACE (or import the {graphics} and other necessary packages in DESCRIPTION).

In known_outcomes(), a test is failing because the cumulative case count is not always increasing - this could be because you have a day or more of 0 case count increases - in this case the test should be corrected so the expectation is >= 0. Another test failing is the one checking for column names, I think, so check what's being returned.
The plot_data_and_cfr() function appears to be missing from the package exports, have you added it to the NAMESPACE using devtools::document()?
The function format_cfr_neatly(scfr_naive) needs an extra argument to be provided, or to have a default value defined - that should fix the error.

If you could fix those first we could take a look at the warnings and notes next - I can also help with this if needed. :)

pratikunterwegs

I've tried to fix the issues here as much as possible to ensure checks pass for now; this includes:

Removed the vignette calculate_known_outcomes.Rmd which uses a number of functions that have either been removed like static_cfr(), or which don't appear to be in the repo like plot_raw_data() - this file is still present on the branch dev/bad-vignette (which is behind the head of this branch, so do not rebase!).
Added the file estimate_severity() which was missing,
Refactored estimate_time_varying() to reduce cyclomatic complexity; the burn-in period is now set to be the mean of the delay distribution by default, with other values possible,
^ This above has mean making some changes to estimate_reporting() as well,
Removed outdated tests, overall test coverage is very low.

It's not super clear to me whether this branch should see a lot more work before being merged into main - there are arguments for doing so and then fixing test coverage and the vignette, as well as fixing those issues first. Hopefully my edits leave this in a better place from which to tackle those issues.

pratikunterwegs · 2023-05-11T13:36:02Z

R/estimate_time_varying.R

+    df_in$cases <- round(
+      zoo::rollmean(df_in$cases, k = smoothing_window, fill = NA)
+    )
+
+    df_in$deaths <- round(
+      zoo::rollmean(df_in$deaths, k = smoothing_window, fill = NA)
+    )


Suggest replacing zoo::rollmean() with stats::runmed() - the median is less sensitive to outliers that might result from things like weekend effects, and this also avoids the dependency on {zoo}.

Now converted to issue #27

pratikunterwegs · 2023-05-11T13:38:20Z

R/plot_epiparameter_distribution.R

+plot_epiparameter_distribution <- function(epidist,
+                                           from = 0,
+                                           to = 30,
+                                           by = 0.1) {


I think {epiparameter} has a plot() method for epidist objects - I wonder whether this function can then be removed. If this is a distinct method that achieves something quite different, it might be worth formally making this an S3 method for epidists.

pratikunterwegs · 2023-05-11T13:44:01Z

R/format_output.R

+format_output <- function(df_in,
+                          estimate_type,
+                          type = NULL) {


Is this function really necessary? Could we not have stuck to the earlier implementation of the severity estimate as a named vector with three values? That handles the pretty printing issue to some extent.

adamkucharski

I've made some tweaks to vignettes for readibility, but new functions seem to be running OK now. Would suggest double-checking the early time-varying UK estimates @thimotei, as CFR=100% – it may be down to patchy early reporting, but a quick sense-check plotting estimated known outcomes vs deaths should help clarify.

vignettes/estimate_time_varying_severity.Rmd

vignettes/estimate_ascertainment.Rmd

adamkucharski · 2023-05-12T22:14:13Z

vignettes/estimate_ascertainment.Rmd

+  severity_baseline = 0.014,
+  correct_for_delays = TRUE
+) |>
+  format_output(estimate_type = "reporting", type = "Under-reporting")


"Percent ascertained" might be clearer than "under-reporting", if type is easily changeable.

Now converted to issue #26

vignettes/estimate_ascertainment.Rmd

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

pratikunterwegs · 2023-05-15T08:04:10Z

Thanks @adamkucharski for taking a look - I have converted some open comments to issues #29 #28 #27 #26 #25, so hopefully those can be fixed in future PRs.

thimotei added 6 commits March 14, 2023 12:30

Removing old versions of functions

0ec0431

Reverting to original CFR method for stability

3af8f4c

Updating formatting functions for use with kable in vignette

f756a09

Reverting to base R plotting removing ggplot2 dependency

f155f89

Adding static CFR vignette

d08239e

Updating package structure

942fbad

thimotei added documentation Improvements or additions to documentation good first issue Good for newcomers labels Mar 14, 2023

thimotei requested a review from pratikunterwegs March 14, 2023 13:29

thimotei assigned sbfnk, adamkucharski, thimotei and pratikunterwegs Mar 14, 2023

adamkucharski reviewed Mar 14, 2023

View reviewed changes

adamkucharski reviewed Mar 22, 2023

View reviewed changes

R/plot_raw_data.R Outdated Show resolved Hide resolved

thimotei and others added 11 commits May 2, 2023 14:52

Update vignettes/calculate_known_outcomes.Rmd

469ca46

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

Update vignettes/calculate_known_outcomes.Rmd

78d779e

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

Update vignettes/calculate_known_outcomes.Rmd

9f0d852

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

Update vignettes/calculate_known_outcomes.Rmd

0c26306

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

Update vignettes/calculate_known_outcomes.Rmd

4fcfe3e

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

Update vignettes/calculate_known_outcomes.Rmd

c2b696a

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

Update vignettes/calculate_known_outcomes.Rmd

6506259

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

Update vignettes/calculate_known_outcomes.Rmd

d62e1e3

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

Update vignettes/calculate_known_outcomes.Rmd

b003ffd

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

Update vignettes/calculate_known_outcomes.Rmd

c052b59

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

Edits to text and examples in the first vignette

a1b625a

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

update to pass lintr and build checks

fc59a37

pratikunterwegs added 12 commits May 11, 2023 10:27

New snapshot test static CFR

bfdd9b9

Re-add data documentation

5a3e24e

Improve documentation

715a795

Explicit namespacing in plot_ fns

3fbeae6

Update pkg infrastructure

ab6966b

Update Rd files

64c3c64

Remove build vignettes

4beea02

Remove extra figures

c264ec9

Add gitignore to vignettes dir

dc95267

Fixes to vignettes without major issues

8565e85

Remove bad vignette

bf3abc2

Remove extra options incl cairo-png use for MacOS

02d673c

pratikunterwegs requested a review from jamesmbaazam May 11, 2023 13:33

pratikunterwegs reviewed May 11, 2023

View reviewed changes

pratikunterwegs requested a review from adamkucharski May 11, 2023 13:55

adamkucharski approved these changes May 12, 2023

View reviewed changes

Vignette text edits

c4dca3e

Co-authored-by: Adam Kucharski <adam.kucharski@lshtm.ac.uk>

This was referenced May 15, 2023

Test estimate_time_varying() #25

Closed

Rethink "type" argument in format_output() #26

Closed

Suggest replacing zoo::rollmean() with stats::runmed() #27

Closed

Rethink plot_epiparameter_distribution() #28

Closed

Rethink format_output() #29

Closed

pratikunterwegs merged commit 592c297 into main May 15, 2023

pratikunterwegs deleted the development branch May 15, 2023 08:04

This was referenced Jun 8, 2023

Pass epidist object instead of delay_pmf #12

Closed

Smoothing option for CFR #1

Closed

This was referenced Jul 28, 2023

Estimation of under-ascertained cases from deaths #9

Closed

Uncertainty in time-varying CFR estimate #6

Closed

pratikunterwegs mentioned this pull request Nov 16, 2023

Second full review cfr v0.1.0 #103

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding time-varying methods, under-ascertainment methods and vignettes for all of the above #23

Adding time-varying methods, under-ascertainment methods and vignettes for all of the above #23

thimotei commented Mar 14, 2023

adamkucharski left a comment

pratikunterwegs commented Mar 14, 2023

adamkucharski commented May 2, 2023

pratikunterwegs commented May 2, 2023

pratikunterwegs left a comment

pratikunterwegs May 11, 2023

pratikunterwegs May 15, 2023

pratikunterwegs May 11, 2023

pratikunterwegs May 11, 2023

adamkucharski left a comment

adamkucharski May 12, 2023

pratikunterwegs May 15, 2023

pratikunterwegs commented May 15, 2023

Adding time-varying methods, under-ascertainment methods and vignettes for all of the above #23

Adding time-varying methods, under-ascertainment methods and vignettes for all of the above #23

Conversation

thimotei commented Mar 14, 2023

adamkucharski left a comment

Choose a reason for hiding this comment

pratikunterwegs commented Mar 14, 2023

adamkucharski commented May 2, 2023

pratikunterwegs commented May 2, 2023

pratikunterwegs left a comment

Choose a reason for hiding this comment

pratikunterwegs May 11, 2023

Choose a reason for hiding this comment

pratikunterwegs May 15, 2023

Choose a reason for hiding this comment

pratikunterwegs May 11, 2023

Choose a reason for hiding this comment

pratikunterwegs May 11, 2023

Choose a reason for hiding this comment

adamkucharski left a comment

Choose a reason for hiding this comment

adamkucharski May 12, 2023

Choose a reason for hiding this comment

pratikunterwegs May 15, 2023

Choose a reason for hiding this comment

pratikunterwegs commented May 15, 2023