Add initial support for pathfinder #464

seabbs · 2024-05-16T16:55:10Z

Description

This PR adds initial support for Pathfinder. It appears to work fairly well for some models and less well for others (generally based on complexity). Definitely more to do here but I think having this functionality in now is useful.

Some likely contributors who may be interested to review @adrian-lison, @medewitt, @sbfnk.

Something this does not add is pathfinder initialisation for NUTs which I am waiting on support in cmdstanr for.

Try it out with:

library(epinowcast)

pobs <- enw_example("preprocessed")
latest_obs <- enw_example("observations")

path_nowcast <- epinowcast(pobs,
  expectation = enw_expectation(~1, data = pobs),
  fit = enw_fit_opts(enw_pathfinder, pp = TRUE, num_threads = 4, num_paths = 16),
  obs = enw_obs(family = "poisson", data = pobs)
)

plot(path_nowcast, latest_obs)

nuts_nowcast <- epinowcast(pobs,
  expectation = enw_expectation(~1, data = pobs),
  fit = enw_fit_opts(save_warmup = FALSE, pp = TRUE,
	chains = 2, iter_warmup = 1000, iter_sampling = 1000,
  ),
  obs = enw_obs(family = "poisson", data = pobs)
)

plot(nuts_nowcast, latest_obs)

@athowes we should check brms to see if Pathfinder is supported as for the same reasons as its useful here it might be handy to surface in epidist.

Checklist

My PR is based on a package issue and I have explicitly linked it.
I have included the target issue or issues in the PR title in the for Issue(s) issue-numbers: PR title
I have read the contribution guidelines.
I have tested my changes locally.
I have added or updated unit tests where necessary.
I have updated the documentation if required.
My code follows the established coding standards.
I have added a news item linked to this PR.
I have reviewed CI checks for this PR and addressed them as far as I am able.

github-actions · 2024-05-16T17:32:07Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if e75618b is merged into main:

✔️day_of_week_model: 20.5s -> 19.8s [-7.77%, +1.38%]
✔️latent_renewal_model: 23s -> 24.8s [-8.31%, +24.15%]
✔️missingness_model: 1.3m -> 1.31m [-2.22%, +3.77%]
✔️multi_group_latent_renewal_model: 5.78s -> 6.11s [-12.81%, +24.08%]
✔️preprocessing: 508ms -> 512ms [-4.98%, +6.75%]
✔️simple_model: 4.54s -> 4.86s [-10.89%, +24.67%]
✔️simple_negbin_model_with_pp: 5.09s -> 4.9s [-58.97%, +51.61%]
These benchmarks are based on package examples which are available here. Further explanation regarding interpretation and methodology can be found in the documentation of touchstone.

medewitt

One typo, one philosophical. How do we detect the version of CmdStanR installed such that we can alert the user that pathfinder (and eventually laplace) isn't available? Or do we let CmdStan do it for us?

R/model-tools.R

DESCRIPTION

…nder

…tests for enw_sample)

github-actions · 2024-05-16T21:39:17Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if 96444ce is merged into main:

✔️day_of_week_model: 19.7s -> 19.7s [-2.44%, +3.09%]
✔️latent_renewal_model: 23.2s -> 24.7s [-4.86%, +18.2%]
✔️missingness_model: 1.3m -> 1.32m [-4.61%, +7.69%]
✔️multi_group_latent_renewal_model: 5.84s -> 5.74s [-14.69%, +11.13%]
🚀preprocessing: 507ms -> 498ms [-3.18%, -0.36%]
✔️simple_model: 4.78s -> 4.75s [-26.37%, +25.4%]
✔️simple_negbin_model_with_pp: 3.75s -> 4.45s [-10.71%, +48.08%]
These benchmarks are based on package examples which are available here. Further explanation regarding interpretation and methodology can be found in the documentation of touchstone.

github-actions · 2024-05-17T09:50:44Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if dc6c75e is merged into main:

✔️day_of_week_model: 19.5s -> 19.8s [-2.79%, +6.05%]
✔️latent_renewal_model: 25.1s -> 23.6s [-22.19%, +10.42%]
✔️missingness_model: 1.29m -> 1.3m [-2.86%, +3.97%]
✔️multi_group_latent_renewal_model: 5.62s -> 5.72s [-8.83%, +12.47%]
✔️preprocessing: 497ms -> 495ms [-1.58%, +0.66%]
✔️simple_model: 4.44s -> 4.36s [-13.25%, +9.72%]
✔️simple_negbin_model_with_pp: 3.86s -> 4.34s [-25.47%, +50.17%]
These benchmarks are based on package examples which are available here. Further explanation regarding interpretation and methodology can be found in the documentation of touchstone.

codecov · 2024-05-19T17:44:00Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.46%. Comparing base (8d0da65) to head (0fa2690).
Report is 4 commits behind head on main.

❗ Current head 0fa2690 differs from pull request most recent head e19e5f8

Please upload reports for the commit e19e5f8 to get more accurate results.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #464      +/-   ##
==========================================
+ Coverage   97.45%   97.46%   +0.01%     
==========================================
  Files          15       15              
  Lines        2160     2172      +12     
==========================================
+ Hits         2105     2117      +12     
  Misses         55       55

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

github-actions · 2024-05-19T18:07:05Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if 0fa2690 is merged into main:

✔️day_of_week_model: 19.7s -> 19.4s [-7.21%, +4.27%]
✔️latent_renewal_model: 24.3s -> 23s [-12.39%, +1.54%]
✔️missingness_model: 1.28m -> 1.29m [-1.54%, +4.05%]
✔️multi_group_latent_renewal_model: 5.6s -> 5.49s [-13.54%, +9.6%]
✔️preprocessing: 493ms -> 493ms [-1.41%, +1.54%]
✔️simple_model: 4.55s -> 6.13s [-65.14%, +134.39%]
✔️simple_negbin_model_with_pp: 3.81s -> 3.76s [-19.73%, +16.93%]
These benchmarks are based on package examples which are available here. Further explanation regarding interpretation and methodology can be found in the documentation of touchstone.

medewitt

This looks good. I think the method detection is the only think I would recommend. Maybe a cli::cli_abort rather than cli::warn?

R/model-tools.R

DESCRIPTION

github-actions · 2024-05-20T21:08:17Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if e19e5f8 is merged into main:

✔️day_of_week_model: 19.5s -> 19.7s [-4.83%, +6.81%]
✔️latent_renewal_model: 25.1s -> 23.4s [-33.16%, +19.4%]
✔️missingness_model: 1.31m -> 1.31m [-4.5%, +4.53%]
🚀multi_group_latent_renewal_model: 6.37s -> 6.01s [-10.45%, -0.83%]
✔️preprocessing: 500ms -> 500ms [-1.19%, +1.24%]
✔️simple_model: 4.99s -> 4.51s [-32%, +12.79%]
✔️simple_negbin_model_with_pp: 4.78s -> 5.33s [-49.84%, +72.93%]
These benchmarks are based on package examples which are available here. Further explanation regarding interpretation and methodology can be found in the documentation of touchstone.

medewitt

LGTM--can't wait to play with this!

adrian-lison

Looking good! @seabbs, how much did you already play around with it? I just tested it on your example but with a daily random walk model and generation time, it wasn't too bad in terms of expectation, only the uncertainty quantification got worse (intervals very narrow).

When I tested pathfinder a while ago for EpiSewer, I wasn't impressed. It didn't seem to like the renewal model at all... Trends were often considerably off, and there was basically no uncertainty. Interestingly, this got worse when running more paths, results were best when running just one path with a limited number of iterations...

Would be great to get a better understanding of pathfinder's behaviour on time series models and possible tweaks, as this is obviously very useful.

seabbs · 2024-06-03T08:29:50Z

on your example but with a daily random walk model and generation time, it wasn't too bad in terms of expectation, only the uncertainty quantification got worse (intervals very narrow).

Yes, I also only did some limited exploration and found similar. I concluded that its likely useful for prototyping but I wouldn't trust it (on the examples I tested) for anything where I wanted to use the output.

@athowes is interested in potentially doing a more in depth look. I think your point about potential tweaks is a good one as it might be something simple that improves performance.

Interestingly, this got worse when running more paths, results were best when running just one path with a limited number of iterations...

I didn't see this and that is interesting. Suggests instability based on initialisation?

medewitt reviewed May 16, 2024

View reviewed changes

R/model-tools.R Outdated Show resolved Hide resolved

DESCRIPTION Show resolved Hide resolved

seabbs added 12 commits May 16, 2024 21:15

first pass playing at pathfinder

1575c06

doc and add simple enw_pathfinder option

b98ceff

rever simple example and flesh out docs for enw_sample and enw_pathfi…

abd501e

…nder

add end to end test for pathfinder to epinowcast() testing (matching …

07c41b6

…tests for enw_sample)

fix linting

ca7b78e

update wordlist and make sure threads are being passed

2e083fd

fix missing examples

6c8e145

fix indent

0a00097

add missing args back in

109f82b

get rid of empty test and depend on integration test

1fc4400

make stan examples interactive() only

6c53fce

clean up roxygen

6873623

seabbs force-pushed the playing-with-pathfinder branch from 5b351da to 6873623 Compare May 16, 2024 20:15

seabbs added 2 commits May 16, 2024 21:34

revert vignette test

cc57d77

make it possible for pathfinder to be reallllyyy approximate

c9bb41c

make testing of pathfinder even more permissive

dc6c75e

seabbs mentioned this pull request May 19, 2024

Add/Check support for using a Cmdstan fit as initial conditions for a new fit + support passing from pathfinder to NUTS #465

Open

seabbs requested a review from medewitt May 19, 2024 17:29

seabbs enabled auto-merge May 19, 2024 17:29

seabbs added 2 commits May 19, 2024 18:30

Update NEWS.md - add @medewitt as the reviewer.

569ecf3

Update NEWS.md - add PR number and update contributions.

0fa2690

medewitt reviewed May 20, 2024

View reviewed changes

R/model-tools.R Show resolved Hide resolved

DESCRIPTION Show resolved Hide resolved

Update model-tools.R - add in check for pathfinder method

68d0330

seabbs requested a review from medewitt May 20, 2024 20:30

Update model-tools.R - fix line length

e19e5f8

medewitt approved these changes May 21, 2024

View reviewed changes

seabbs added this pull request to the merge queue May 21, 2024

Merged via the queue into main with commit 886b45c May 21, 2024
10 checks passed

seabbs deleted the playing-with-pathfinder branch May 21, 2024 12:44

adrian-lison reviewed May 31, 2024

View reviewed changes

seabbs mentioned this pull request Jun 11, 2024

Continuously benchmark performance CDCgov/Rt-without-renewal#274

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add initial support for pathfinder #464

Add initial support for pathfinder #464

seabbs commented May 16, 2024 •

edited

github-actions bot commented May 16, 2024

medewitt left a comment

github-actions bot commented May 16, 2024

github-actions bot commented May 17, 2024

codecov bot commented May 19, 2024 •

edited

github-actions bot commented May 19, 2024

medewitt left a comment

github-actions bot commented May 20, 2024

medewitt left a comment

adrian-lison left a comment

seabbs commented Jun 3, 2024

Add initial support for pathfinder #464

Add initial support for pathfinder #464

Conversation

seabbs commented May 16, 2024 • edited

Description

Checklist

github-actions bot commented May 16, 2024

medewitt left a comment

Choose a reason for hiding this comment

github-actions bot commented May 16, 2024

github-actions bot commented May 17, 2024

codecov bot commented May 19, 2024 • edited

Codecov Report

github-actions bot commented May 19, 2024

medewitt left a comment

Choose a reason for hiding this comment

github-actions bot commented May 20, 2024

medewitt left a comment

Choose a reason for hiding this comment

adrian-lison left a comment

Choose a reason for hiding this comment

seabbs commented Jun 3, 2024

seabbs commented May 16, 2024 •

edited

codecov bot commented May 19, 2024 •

edited