Implement options for pseudo-time stepping #243

adelinehillier · 2022-04-09T17:04:58Z

This branch builds on glw/pseudotime and adds several options for pseudo-time stepping schemes in the PseudoSteppingSchemes module.

To do:

Add tests

…timocean.jl into ah/adaptive

glwagner · 2022-04-11T18:35:11Z

src/EnsembleKalmanInversions.jl

    iteration_summaries :: S
    resampler :: R
    unconstrained_parameters :: X
    forward_map_output :: G
-    convergence_rate :: C
+    pseudo_stepping :: C
+    precomputed_matrices :: P


Recommdnation: generalize noise_covariance to include storage of precomputed matrice, then define an interface with a few lines:

noise_covariance(eki::EnsembleKalmanInversion) = noise_covariance(eki.noise_covariance) inverse_noise_covariance(eki::EnsembleKalmanInversion) = inverse_noise_covariance(eki.noise_covariance) sqrt_inverse_noise_covariance(eki::EnsembleKalmanInversion) = sqrt_inverse_noise_covariance(eki.noise_covariance) noise_covariance(Γy) = Γy inverse_noise_covariance(Γy) = inv(Γy) sqrt_inv_noise_covariance(Γy) = sqrt(inv(Γy)) struct PrecomputedInverseNoiseCovariance covariance inverse_covariance sqrt_inverse_covariance end PrecomputedInverseNoiseCovariance(Γy) = PrecomputedInverseNoiseCovariance(Γy, inv(Γy), sqrt(inv(Γy))) noise_covariance(pinc::PrecomputedInverseNoiseCovariance) = pinc.covariance inverse_noise_covariance(pinc::PrecomputedInverseNoiseCovariance) = pinc.inverse_covariance sqrt_inv_noise_covariance(pinc::PrecomputedInverseNoiseCovariance) = pinc.sqrt_inverse_covariance

Thanks for the suggestion! I had implemented precomputed_arrays before seeing your reply to #245. My concern is that it might confuse users who probably have no use for inverse_covariance or sqrt_inverse_covariance. Also, I have since decided to add Γθ, μθ, and inv_sqrt_Γθ, which all correspond to EKI prior, to the list of precomputed arrays. We could add a PrecomputedPriors object to store this information, but I don't want to drown users or developers in a jungle of made-up names and abstractions.

Certainly, there's no way around using words and/or symbols to describe things. I prefer names to symbols both as a developer and user, but if others prefer Γθ to noise_covariance then we should go with that.

I agree that we often won't have use for inverse_covariance. So I suggest that when we don't need it, noise_covariance is just Matrix, UniformScaling, etc. When we do need it, noise_covariance is the more complicated object PrecomputedInverseNoiseCovariance. This will solve the problems you've brought up:

Users aren't drowned in excess names, because the objects they build only contain the information they need.

Developers aren't drowned in excess names, because each "noise covariance" has its own implementation. Thus you only need to read the part of the code that's specific to your problem.

…ocean.jl into ah/adaptive

…pping`

Project.toml

glwagner · 2022-06-23T15:01:54Z

src/EnsembleKalmanInversions.jl

@@ -41,6 +42,8 @@ mutable struct EnsembleKalmanInversion{E, I, M, O, S, R, X, G, C, F}
    unconstrained_parameters :: X
    forward_map_output :: G
    pseudo_stepping :: C
+    precomputed_arrays :: P


What are "precomputed_arrays"?

I agree that we should move the fixes to a new PR.

It's all developer stuff; it doesn't affect any of the user scripts.

src/Transformations.jl

src/iteration_summary.jl

src/resampling.jl

…daptive

glwagner · 2022-07-19T15:41:54Z

src/EnsembleKalmanInversions.jl

+- `process`: The Ensemble Kalman process. Default: `Inversion().
+
+- `tikhonov`: Whether to incorporate prior information in the EKI objective via Tikhonov regularization.
+    See Chada et al. "Tikhonov Regularization Within Ensemble Kalman Inversion." SIAM J. Numer. Anal. 2020.


Can you clarify when this is used? Does this pertain to adaptive stepping (eg to a particular scheme?) or does this pertain to information stored in the iteration summaries?

tikhonov determines whether Tikhonov regularization (corresponding to the prior misfit term $\frac{1}{2}{\left| {\Gamma_\theta}^{-\frac{1}{2}}(\theta - \mu_{\theta}) \right|}^2$ in the EKI objective function $\frac{1}{2}{\left| {\Gamma_y}^{-\frac{1}{2}}(y - G(\theta)) \right|}^2 + \frac{1}{2}{\left| {\Gamma_\theta}^{-\frac{1}{2}}(\theta - \mu_{\theta}) \right|}^2$) happens. If it is false then EKI only considers the data misfit $\frac{1}{2}{\left| {\Gamma_y}^{-\frac{1}{2}}(y - G(\theta)) \right|}^2$. Certain arrays have to be augmented if one wishes to incorporate the Tikhonov term, hence these lines in PseudoSteppingSchemes.jl:

observations(eki) = eki.tikhonov ? eki.precomputed_arrays[:y_augmented] : eki.mapped_observations obs_noise_covariance(eki) = eki.tikhonov ? eki.precomputed_arrays[:Σ] : eki.noise_covariance inv_obs_noise_covariance(eki) = eki.tikhonov ? eki.precomputed_arrays[:inv_Σ] : eki.precomputed_arrays[:inv_Γy]

Ideally these augmented arrays should only be computed once, hence the need to store some of the precomputed_arrays.

glwagner · 2022-07-19T15:42:44Z

src/EnsembleKalmanInversions.jl

+                                        :η_mean_augmented => vcat(zeros(length(y)), -μθ),
+                                        :Σ => Σ, 
+                                        :inv_Σ => inv(Σ),
+                                        :inv_sqrt_Σ => inv(sqrt(Σ)))


Can you add a comment that documents what each of these properties mean? When we have a type that represents this data, we can add that to the docstring.

If I were to implement an object with non-symbol attributes to store all the information in precomputed_arrays, it would look something like this:

struct PrecomputedArrays{M, T} inverse_observation_noise_covariance :: M inverse_sqrt_observation_noise_covariance :: M prior_covariance :: M inverse_sqrt_prior_covariance :: M prior_means :: T augmented_mapped_observations :: T augmented_observation_noise_mean :: M augmented_observation_noise_covariance :: M inverse_augmented_observation_noise_covariance :: M inverse_sqrt_augmented_observation_noise_covariance :: M end

Can you also report which pseudo-time-stepping methods require which data?

glwagner · 2022-07-19T15:43:14Z

src/EnsembleKalmanInversions.jl

+                          pseudo_Δt = nothing,
+                          pseudo_stepping = eki.pseudo_stepping,
+                          covariance_inflation = 0.0,
+                          momentum_parameter = 0.0)


Alignment is off here

glwagner · 2022-07-19T15:43:24Z

src/EnsembleKalmanInversions.jl

+"""
+    pseudo_step!(eki::EnsembleKalmanInversion;
+                pseudo_Δt = eki.pseudo_Δt,
+                pseudo_stepping = eki.pseudo_stepping)


Alignment is off in this docstring

src/Transformations.jl

glwagner

We just need a bit more documentation so that other developers can contribute, modify, and refactor this code in the future. Ideally, we don't use dictionaries for important properties. Something that isn't clear to me is whether precomputed_arrays and precomputed_augmented_arrays are needed for all pseudostepping schemes, or just some. It would seem that they can't always be needed, since they didn't exist previously.

adelinehillier and others added 25 commits April 8, 2022 19:34

Add PseudoSteppingSchemes options

eca7f4f

Add inverse_normalize! convenience for use in GPR

98b7f1f

Update adaptive_step_parameters arguments

9828d46

Merge branch 'glw/pseudotime' of https://github.com/CliMA/ParameterEs…

377ea5f

…timocean.jl into ah/adaptive

Merge branch 'glw/pseudotime' of https://github.com/CliMA/ParameterEs…

e567d38

…timocean.jl into ah/adaptive

Add LineSearches to dependencies

856aca0

Use outer constructors instead

cb05b56

Merge branch 'glw/pseudotime' of https://github.com/CliMA/ParameterEs…

0a9b121

…timocean.jl into ah/adaptive

Refactor function arguments

3117be1

Merge branch 'glw/pseudotime' of https://github.com/CliMA/ParameterEs…

0d42ee5

…timocean.jl into ah/adaptive

Typo

0b792cf

Fix cholesky factorization bug

0441cd4

Updates

dbf32fd

Typos

91690dd

Add tests

e110e69

Cleanup

4d357ec

Add precomputed_matrices attribute to EnsembleKalmanInversion

580505c

Use precomputed matrices

7ac7afb

Return mean and variance information of GP

25529ac

Reduce redundant computations

f32447d

Standardize GP inputs

18c84ca

Typo

9ca29e9

Typo

3de8554

update manifest

36a99eb

Update Project.toml

06f84da

glwagner reviewed Apr 11, 2022

View reviewed changes

adelinehillier added 4 commits April 11, 2022 16:37

Typo

378c9cc

Rebuild manifest

21c5c79

Merge branch 'ah/adaptive' of https://github.com/CliMA/ParameterEstim…

b4c8652

…ocean.jl into ah/adaptive

Add inv_sqrt_Γθ and μθ to precomputed_arrays

be7f52a

adelinehillier added 9 commits June 16, 2022 16:32

Uncomment tests

ac4a244

Default pseudo_Δt to nothing; else Number overrides `pseudo_ste…

6d590ea

…pping`

Replace Default with ThresholdedConvergenceRatio

4043151

Replace Default with ThresholdedConvergenceRatio

b355f15

Add test for pseudo_Δt == nothing

077bc72

Cleanup

3075861

Default pseudo_stepping to nothing

899f257

Update tests

351727d

Fix test

41e4679

adelinehillier requested a review from glwagner June 19, 2022 20:17

glwagner reviewed Jun 23, 2022

View reviewed changes

Project.toml Show resolved Hide resolved

glwagner reviewed Jun 23, 2022

View reviewed changes

src/Transformations.jl Outdated Show resolved Hide resolved

glwagner reviewed Jun 23, 2022

View reviewed changes

src/iteration_summary.jl Outdated Show resolved Hide resolved

glwagner reviewed Jun 23, 2022

View reviewed changes

src/iteration_summary.jl Show resolved Hide resolved

glwagner reviewed Jun 23, 2022

View reviewed changes

src/resampling.jl Outdated Show resolved Hide resolved

adelinehillier and others added 6 commits July 17, 2022 23:10

Fix RescaledZScore

3fcb3a2

Merge remote-tracking branch 'origin/ah/fix-rescaledzscore' into ah/a…

9500ee9

…daptive

inverse_normalize! -> denormalize!

24dd883

parameters_unconstrained -> unconstrained_parameters

653fc16

Remove debugging @Assert

f9b714e

Accidental autoreplace

ffae5fd

adelinehillier requested a review from glwagner July 18, 2022 23:31

glwagner reviewed Jul 19, 2022

View reviewed changes

src/Transformations.jl Show resolved Hide resolved

glwagner approved these changes Jul 19, 2022

View reviewed changes

adelinehillier merged commit a5ecff2 into main Aug 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement options for pseudo-time stepping #243

Implement options for pseudo-time stepping #243

adelinehillier commented Apr 9, 2022 •

edited

glwagner Apr 11, 2022

adelinehillier Apr 12, 2022

glwagner May 2, 2022

glwagner Jun 23, 2022

adelinehillier Jun 23, 2022

adelinehillier Jun 23, 2022

glwagner Jul 19, 2022

adelinehillier Jul 20, 2022 •

edited

glwagner Jul 19, 2022

adelinehillier Jul 20, 2022 •

edited

glwagner Aug 4, 2022

glwagner Jul 19, 2022

glwagner Jul 19, 2022

glwagner left a comment

Implement options for pseudo-time stepping #243

Implement options for pseudo-time stepping #243

Conversation

adelinehillier commented Apr 9, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adelinehillier Jul 20, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adelinehillier Jul 20, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

glwagner left a comment

Choose a reason for hiding this comment

adelinehillier commented Apr 9, 2022 •

edited

adelinehillier Jul 20, 2022 •

edited

adelinehillier Jul 20, 2022 •

edited