Add natural gradient variational inference algorithms #211

Red-Portal · 2025-11-07T09:10:30Z

This adds the following natural gradient VI algorithms:

Variational online Newton¹²
Square-root variational online Newton³⁴⁵⁶

Natural gradient VI (NGVI) is a family of algorithms that correspond to mirror descent under the Bregman divergence. Since the pseudo-metric is a divergence between distributions, the algorithm can be thought of as a measure-space algorithm. Therefore, empirically, NGVI tends to converge faster than BBVI/ADVI. However, the algorithm also involves quantities defined in terms of variational parameters, so it is not a fully measure-space algorithm. As such, design decisions related to parametrizations and update rules result in different implementations (hence two algorithms in this PR). Furthermore, NGVI is restricted to (mixtures) exponential variational families. The PR only implements the Gaussian variational family variant. Another downside is that the update rules tend to involve operations that are costly ($\mathrm{O}(d^3)$ for a $d$-dimensional target) and sensitive to numerical errors.

This addresses #1

Khan, M., & Lin, W. (2017, April). Conjugate-computation variational inference: Converting variational inference in non-conjugate models to inferences in conjugate models. AISTATS. ↩
Khan, M. E., & Rue, H. (2023). The Bayesian learning rule. Journal of Machine Learning Research, 24(281), 1-46. ↩
Kumar, N., Möllenhoff, T., Khan, M. E., & Lucchi, A. (2025). Optimization Guarantees for Square-Root Natural-Gradient Variational Inference. TMLR. ↩
Lin, W., Dangel, F., Eschenhagen, R., Bae, J., Turner, R. E., & Makhzani, A. (2024, July). Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective. ICML. ↩
Lin, W., Duruisseaux, V., Leok, M., Nielsen, F., Khan, M. E., & Schmidt, M. (2023, July). Simplifying momentum-based positive-definite submanifold optimization with applications to deep learning. ICML. ↩
Tan, L. S. (2025). Analytic natural gradient updates for Cholesky factor in Gaussian variational approximation. JRSS:B. ↩

Red-Portal · 2025-11-07T09:11:15Z

src/algorithms/klminwassfwdbwd.jl

    subsampling::Sub = nothing
 end

-"""


This portion has been moved to a separate file algorithms/gauss_expected_grad_hess.jl.

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

github-actions · 2025-11-07T09:24:17Z

AdvancedVI.jl documentation for PR #211 is available at:
https://TuringLang.github.io/AdvancedVI.jl/previews/PR211/

github-actions

Benchmark Results

Benchmark suite	Current: `49236af`	Previous: `f7f965a`	Ratio
`normal/RepGradELBO + STL/meanfield/Zygote`	`3891957202` ns	`3981218216.5` ns	`0.98`
`normal/RepGradELBO + STL/meanfield/ReverseDiff`	`1135166344` ns	`1149188698` ns	`0.99`
`normal/RepGradELBO + STL/meanfield/Mooncake`	`1182439886` ns	`1191740601` ns	`0.99`
`normal/RepGradELBO + STL/fullrank/Zygote`	`3887266131.5` ns	`3944970212.5` ns	`0.99`
`normal/RepGradELBO + STL/fullrank/ReverseDiff`	`1638470865.5` ns	`1690940595` ns	`0.97`
`normal/RepGradELBO + STL/fullrank/Mooncake`	`1233128801` ns	`1237776514` ns	`1.00`
`normal/RepGradELBO/meanfield/Zygote`	`2757037062.5` ns	`2762774402` ns	`1.00`
`normal/RepGradELBO/meanfield/ReverseDiff`	`783215434` ns	`791093181` ns	`0.99`
`normal/RepGradELBO/meanfield/Mooncake`	`1070456904` ns	`1084249199` ns	`0.99`
`normal/RepGradELBO/fullrank/Zygote`	`2801742993.5` ns	`2822211650` ns	`0.99`
`normal/RepGradELBO/fullrank/ReverseDiff`	`969649118` ns	`991218524` ns	`0.98`
`normal/RepGradELBO/fullrank/Mooncake`	`1105595092` ns	`1087435619` ns	`1.02`
`normal + bijector/RepGradELBO + STL/meanfield/Zygote`	`5552914513` ns	`5523158001` ns	`1.01`
`normal + bijector/RepGradELBO + STL/meanfield/ReverseDiff`	`2361965592` ns	`2456796214` ns	`0.96`
`normal + bijector/RepGradELBO + STL/meanfield/Mooncake`	`4005359864.5` ns	`3998343804` ns	`1.00`
`normal + bijector/RepGradELBO + STL/fullrank/Zygote`	`5553126335` ns	`5543012671` ns	`1.00`
`normal + bijector/RepGradELBO + STL/fullrank/ReverseDiff`	`3060106239.5` ns	`3121884284` ns	`0.98`
`normal + bijector/RepGradELBO + STL/fullrank/Mooncake`	`4124451867.5` ns	`4204003867.5` ns	`0.98`
`normal + bijector/RepGradELBO/meanfield/Zygote`	`4200797898.5` ns	`4283445922` ns	`0.98`
`normal + bijector/RepGradELBO/meanfield/ReverseDiff`	`2008641069` ns	`2093976830` ns	`0.96`
`normal + bijector/RepGradELBO/meanfield/Mooncake`	`3833597618.5` ns	`3895280534.5` ns	`0.98`
`normal + bijector/RepGradELBO/fullrank/Zygote`	`4279512628.5` ns	`4390742008.5` ns	`0.97`
`normal + bijector/RepGradELBO/fullrank/ReverseDiff`	`2272013350` ns	`2342918998` ns	`0.97`
`normal + bijector/RepGradELBO/fullrank/Mooncake`	`4002567444.5` ns	`4036735807.5` ns	`0.99`

This comment was automatically generated by workflow using github-action-benchmark.

sunxd3

not sure all of these are correct, but worth taking a look?

src/algorithms/gauss_expected_grad_hess.jl

src/algorithms/klminnaturalgraddescent.jl

src/algorithms/klminsqrtnaturalgraddescent.jl

Co-authored-by: Xianda Sun <5433119+sunxd3@users.noreply.github.com>

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

…al_gradient_vi

sunxd3

tiny things, looking really good

src/algorithms/gauss_expected_grad_hess.jl

src/algorithms/klminnaturalgraddescent.jl

…al_gradient_vi

sunxd3

Thanks, looks good! (Still wait till tests pass?)

Red-Portal · 2025-11-14T15:51:16Z

@sunxd3 Thank you for thoroughly reviewing and spotting various mistakes!

Red-Portal added 5 commits November 7, 2025 02:44

move gaussian expectation of grad and hess to its own file

1980f93

add square-root variational newton algorithm

c84b453

apply formatter

48daaa0

add natural gradient descent (variational online Newton)

3483e8d

update docstrings remove redundant comments

8267a98

Red-Portal commented Nov 7, 2025

View reviewed changes

run formatter

f3790c3

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

github-actions bot reviewed Nov 7, 2025

View reviewed changes

Red-Portal requested review from mhauru, penelopeysm, sunxd3 and yebai November 7, 2025 09:37

update history

3ba8401

sunxd3 reviewed Nov 10, 2025

View reviewed changes

Red-Portal and others added 8 commits November 11, 2025 11:21

fix gauss expected grad hess, use in-place operations, add tests

bca8f55

fix always wrap hess_buf with a Symmetric (not Hermitian)

82e9f15

Apply suggestion from @sunxd3

8fdecb1

Co-authored-by: Xianda Sun <5433119+sunxd3@users.noreply.github.com>

Apply suggestion from @sunxd3

3ff2c0f

Co-authored-by: Xianda Sun <5433119+sunxd3@users.noreply.github.com>

Apply suggestion from @github-actions[bot]

45b9989

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

fix bug in init of klminnaturalgraddescent

75e489d

remove unintended benchmark code

6f55a5c

update docs

78d3559

Red-Portal requested a review from sunxd3 November 11, 2025 16:30

Red-Portal added 3 commits November 11, 2025 11:36

fix relax Hermitian to Symmetric in NGVI ensure posdef

020634d

Merge branch 'main' of github.com:TuringLang/AdvancedVI.jl into natur…

ccf6506

…al_gradient_vi

Merge branch 'main' of github.com:TuringLang/AdvancedVI.jl into natur…

9070c82

…al_gradient_vi

sunxd3 reviewed Nov 13, 2025

View reviewed changes

src/algorithms/gauss_expected_grad_hess.jl Outdated Show resolved Hide resolved

src/algorithms/klminnaturalgraddescent.jl Outdated Show resolved Hide resolved

Red-Portal added 2 commits November 14, 2025 09:02

Merge branch 'main' of github.com:TuringLang/AdvancedVI.jl into natur…

47c8ed2

…al_gradient_vi

fix gauss expected grad hess

321fb92

Red-Portal added 2 commits November 14, 2025 09:07

fix callback argument in measure space algorithms

f7f965a

fix the positive definite preserving update rule in NGVI

49236af

Red-Portal requested a review from sunxd3 November 14, 2025 15:28

sunxd3 approved these changes Nov 14, 2025

View reviewed changes

Red-Portal merged commit 1c6f1de into main Nov 14, 2025
33 of 40 checks passed

Red-Portal deleted the natural_gradient_vi branch November 14, 2025 16:41

Red-Portal mentioned this pull request Nov 14, 2025

Natural Gradients + Monte Carlo VI #1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add natural gradient variational inference algorithms #211

Add natural gradient variational inference algorithms #211

Uh oh!

Red-Portal commented Nov 7, 2025 •

edited

Loading

Uh oh!

Red-Portal Nov 7, 2025

Uh oh!

github-actions bot commented Nov 7, 2025

Uh oh!

github-actions bot left a comment •

edited

Loading

Uh oh!

sunxd3 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sunxd3 left a comment

Uh oh!

Uh oh!

Uh oh!

sunxd3 left a comment •

edited

Loading

Uh oh!

Red-Portal commented Nov 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add natural gradient variational inference algorithms #211

Add natural gradient variational inference algorithms #211

Uh oh!

Conversation

Red-Portal commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Footnotes

Uh oh!

Red-Portal Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 7, 2025

Uh oh!

github-actions bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Benchmark Results

Uh oh!

sunxd3 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sunxd3 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sunxd3 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Red-Portal commented Nov 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Red-Portal commented Nov 7, 2025 •

edited

Loading

github-actions bot left a comment •

edited

Loading

sunxd3 left a comment •

edited

Loading