Feature missing reference support code #138

seabbs · 2022-07-25T19:58:05Z

This PR adds support code for adding modelling of missing reference dates (#43). This includes updating epinowcast() to have enw_missing() as an argument (it cannot be optional as stan complains about missing data even when it has zero dimensions), updating enw_missing() to have a not on mode, adding support for missing effects and priors, adding a new grouped version of the observation likelihood in which we can add likelihood changes from #107, and adding a prototype function for simulating missing data (we should probably look at adding this and enw_incidence_to_cumulative() to the package but I am aware we are getting lots of exported functions and it may be a bit overwhelming/hard to support in the long term.

Alongside #107 this leaves post-processing as a final support step in terms of supporting missing dates to the same level as non-missing dates.

Note the grouped observation likelihood is likely not quite there - tricky keeping track of the indexing. It would be easier if both grouped and snapshot versions could be used without missing data (as should return the same thing) but this might be a bit annoying to support internally.

adrian-lison

Again, great work, really close now to getting the missingness model running.

If I understand correctly, for the missingness likelihood, we

still need to integrate the ref_miss_prop
add a matrix storing the without-reference date obs while going through the snapshots
do the broadcasting as proposed in Add stan code changes for missing data model #107 for the without-reference date matrix
transform the matrix into a vector by log-sum-exping over the columns and
add another call to obs_lmpfs for this second vector

Anything else?

inst/examples/germany_missing.R

inst/stan/epinowcast.stan

seabbs · 2022-07-27T09:18:35Z

Yes, I think you have got all the changes we need. We also need some postprocessing work to make plotting/summary etc part of the tooling. The only other things I want to add to this PR are:

some tests that the two delay_ functions return the same thing.
Expose the ability to switch between them to the user.
Add a warning to enw_missing() indicating not currently supported in the available model.

seabbs · 2022-07-29T21:00:09Z

Note that rather than adding tests I instead added more lower level functions. I also used these to update the generated quantities to use the flat data structure.

… distributions

seabbs · 2022-07-30T17:41:16Z

still need to integrate the ref_miss_prop

Added in 706388c

codecov · 2022-07-30T18:39:42Z

Codecov Report

Merging #138 (1b054a2) into develop (57d2673) will decrease coverage by 0.06%.
The diff coverage is 91.76%.

@@             Coverage Diff             @@
##           develop     #138      +/-   ##
===========================================
- Coverage    86.20%   86.14%   -0.07%     
===========================================
  Files           12       12              
  Lines         1196     1234      +38     
===========================================
+ Hits          1031     1063      +32     
- Misses         165      171       +6

Impacted Files	Coverage Δ
R/check.R	`73.46% <22.22%> (-11.54%)`	⬇️
R/epinowcast.R	`92.10% <100.00%> (+2.10%)`	⬆️
R/model-modules.R	`99.31% <100.00%> (+0.05%)`	⬆️
R/model-tools.R	`94.21% <100.00%> (+0.46%)`	⬆️

📣 Codecov can now indicate which changes are the most critical in Pull Requests. Learn more

seabbs · 2022-07-30T18:56:17Z

I think everything planned for this PR is now in place so pinged for another review. As quite a few png updates in this PR so I think we should go for a squash merge.

A quick summary of what is here (sorry got a little out of hand):

Passes in missing model input to stan code
Added model machinery for the missingness model effects to stan
Updates handling of enw_missing() to make it work with stan
Reorganises the stan code slightly to be easier to read, including dropping repeated comments
Splitting delay_lmpf into two functions one that is aggregated over snapshots and one over groups. Expose this to the user as a choice and forcing the use of the grouped option when the missing model is present.
Both of these functions now depend on a vectorised version of expected_obs_from_index called expected_obs_from_snaps.
This has also been used to refactor the generated quantities. I have also switched all parts of this to use the flattened data storage structure.
Added regression tests for posterior predictions, nowcasts and key parameters. These are rough and only a first pass but better than nothing
Added a new example for missing data along with some prototype functions for making missing data (these likely can be refined and added to the package in another PR).

I think as long as correct most of these changes are fairly straightforward. There is an argument we liked the gq being verbose and different from other code as a check on our model but to be honest I prefer unit tests etc and hopefully sharing more code will make future additions easier.

Note: Going with a flat structure here poses some potential problems for allocating observations to report dates. One solution would be to pass in a look up for report dates and groups and iterate over it. I think the proposed matrix approach will still work but I may have made it harder.

seabbs · 2022-08-04T19:14:58Z

Woops!

adrian-lison

Looks all good to me. I added one proposal commit

inst/stan/epinowcast.stan

seabbs added 11 commits July 25, 2022 11:04

add missing model precursor support

a6974b0

add partial support for missing beta parameters

4b9bd67

add missing effects

a1755b1

model with missing input support compiles

05bfaf1

add simple simulator

688e2c7

build ild out prototype simulation tooling for missingness

742a329

fit example with missing data

730a0e3

model with missingness turned on compiles and fits

4c3836a

update missing model to be an argument

6f83f33

add test for new no model enw_missing behaviour

691262a

update tests

75244d8

seabbs added enhancement New feature or request high-priority labels Jul 25, 2022

seabbs self-assigned this Jul 25, 2022

seabbs added 4 commits July 25, 2022 20:59

work towards a grouped version of the observation likelihood

c7e56c2

add grouping variable

496d642

typo in example

65e676f

update snapshot tests

b222bcb

seabbs requested a review from adrian-lison July 25, 2022 22:05

seabbs and others added 4 commits July 25, 2022 22:21

update example data for new ordering of variables

6bc9965

fewer more useful stan code commnets

789f293

fix enw_missing test

bdff337

Use linspaced_int_array

9404cf8

adrian-lison approved these changes Jul 27, 2022

View reviewed changes

Minor formatting

6947241

merge changes from develop and fix

28c3b5f

seabbs mentioned this pull request Jul 27, 2022

Add data format converters #144

Closed

4 tasks

seabbs added 2 commits July 29, 2022 10:07

setup testing and more code modularisation

30609f2

reorg newly functional code

17cdcde

debugging from readme update

3acf452

seabbs added 3 commits July 30, 2022 16:40

fix test snapshots and nowcast gq

163b3ac

add some regression tests for gq and parameter estimates of reporting…

40bf8c5

… distributions

pass missing prop to observation likelihood and keep on the log scale

706388c

seabbs mentioned this pull request Jul 30, 2022

Modularise observation model #133

Closed

6 tasks

expose ll aggregation to the user, add tests, update snapshots

ce55817

epinowcast deleted a comment from codecov bot Jul 30, 2022

use percentage difference expectation

e0047aa

update news and readme

45253c2

add missing expect fn

314313a

seabbs requested a review from adrian-lison July 30, 2022 18:59

seabbs added 4 commits July 30, 2022 19:00

add al to news items

828a15d

make regression test rougher

fbc900d

add a tolerance arg to expect_diff_abs_lt_per

77707bc

streamline expectation

f542d1f

This was referenced Jul 31, 2022

Add missing reference model components to delay_group_lmpf and generated quantities #147

Merged

Highlight sparse specification / dense specification #153

Open

adrian-lison added 3 commits August 6, 2022 12:25

Add modules compatibility check

17bde5b

Make missingness warning immediate

b005627

Fix typo

1b054a2

adrian-lison approved these changes Aug 6, 2022

View reviewed changes

inst/stan/epinowcast.stan Show resolved Hide resolved

seabbs merged commit ab48d8a into develop Aug 6, 2022

seabbs deleted the feature-missing-support-code branch August 6, 2022 12:04

seabbs mentioned this pull request Aug 11, 2022

Add stan code changes for missing data model #107

Closed

seabbs mentioned this pull request Dec 5, 2022

Support for missing reference dates MVP #43

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature missing reference support code #138

Feature missing reference support code #138

seabbs commented Jul 25, 2022 •

edited

adrian-lison left a comment

seabbs commented Jul 27, 2022 •

edited

seabbs commented Jul 29, 2022 •

edited

seabbs commented Jul 30, 2022

codecov bot commented Jul 30, 2022 •

edited

seabbs commented Jul 30, 2022 •

edited

seabbs commented Aug 4, 2022

adrian-lison left a comment

Feature missing reference support code #138

Feature missing reference support code #138

Conversation

seabbs commented Jul 25, 2022 • edited

adrian-lison left a comment

Choose a reason for hiding this comment

seabbs commented Jul 27, 2022 • edited

seabbs commented Jul 29, 2022 • edited

seabbs commented Jul 30, 2022

codecov bot commented Jul 30, 2022 • edited

Codecov Report

seabbs commented Jul 30, 2022 • edited

seabbs commented Aug 4, 2022

adrian-lison left a comment

Choose a reason for hiding this comment

seabbs commented Jul 25, 2022 •

edited

seabbs commented Jul 27, 2022 •

edited

seabbs commented Jul 29, 2022 •

edited

codecov bot commented Jul 30, 2022 •

edited

seabbs commented Jul 30, 2022 •

edited