Add a new intro tutorial #2991

eb8680 · 2021-12-12T22:53:14Z

open in nbviewer

Derived from Bayesian regression tutorials

Tasks:

…gression tutorials

martinjankowiak · 2021-12-13T01:27:49Z

cool looks good! will take a closer look tomorrow but two comments from a quick perusal:

would be nice to have a little more intro at the top. what is the purpose of the tutorial etc
given the length probably nice to add a table of contents linking to various sections

martinjankowiak · 2021-12-13T14:20:39Z

more detailed comments/questions/etc:

is there any material from the previous intro tutorials worth keeping (for elsewhere)? importance sampling? geometric distribution? pyro.condition? something else? obviously no need to block this PR but something we should think about...
smoke_test = ('CI' in os.environ) etc
typo 'observed variabels'
'we can efficiently compute the pointwise log probability density' => add log to eqn
put in a plated graphic after you introduce "plate notation" in bold?
is it best to avoid lambdas in params in an intro tutorial? or at least explain why you're using them?
"would produce calibrated" well only if the model is well-specified...
some caveats may be more confusing than helpful for an intro tutorial? "Inference algorithms in Pyro change its behavior,"
what does this mean? "when any random variable is observed, the meaning of every other pyro.sample statement in a model changes"
"it stores its argument" -> "it stores init"
nit: "to perform them in parallel in a single" them = ?
"random scale parameter σy for the output" => "that controls the observation noise" or something?
you're using both \sigma and \sigma_y
the part around "A theoretically appealing choice..." flows a bit strangely. maybe proceed with "If we're going to somehow convert Bayesian inference into an optimization problem, we need an objective function or loss function." or some other glue. on the same note the first KL figure is just sort of plopped in there without discussion. tie into discussion? walk through figure?
discuss the rendered model for custom_guide i.e. independence of nodes
"The guide alone does not fully specify an inference algorithm, however. " connect back to graphic? something like "We've defined an initial distribution and a variational family of distributions but now we have to move the initial distribution towards the posterior by solving an optimization problem"
i would argue we should avoid too large learning rates (e.g. 0.03) in intros even if it's fine for a particular problem because it may suggest bad habits to users
interpret the locs and scales printout for the reader. loc is mean of normal etc
discuss use of plate for sampling from autoguide?
"this is done by first generating samples for the unobserved sites in " maybe avoid "sites" or explain what you mean by site? this appears to be your only usage
comment on log_gdp=None in svi_samples?
section heading: redoing => revisiting?
does the rendered model for the mvn require more explanation? i.e. discuss the auxiliary latent that's introduced by the autoguide?
"via the posterior_samples keyword argument instead of passing the guide as above" => "as in the previous example"?
close by pointing to a few "next step" tutorials?

eb8680 · 2021-12-13T19:16:09Z

@martinjankowiak thanks, addressed most of your comments and added the rest to the PR todos.

eb8680 · 2021-12-13T23:10:36Z

This is ready for more content review, the other tasks are mostly boilerplate.

martinjankowiak · 2021-12-13T23:58:54Z

looks great!!! a bit long, but such is the price of thoroughness...

what's the new file name going to be? intro?
remove pip install before merging?
nit: "Probabilistic models in Pyro are given" => "specified as", "encoded as", ...
nit: "strongly suggested" => suggests
"when any pyro.sample statement is observed, the meaning of every other pyro.sample statement in a model changes" i still think this is confusing. as i see it it's not so much that that every other statement in the model changes but rather that queries on the model (e.g. computing the posterior) change
nit: "that does not depend on q" -> q_phi
nit: "... return a dictionary of values for each pyro.sample site they contain...." maybe add "(which are themselves functions)" or the like (to make it clear you're talking about a vanilla python return and not some mysterious pyro internal thing)
"...which repeats and vectorizes the sampling operations..." i guess the caveat as always is that that only works if the model is vectorized correctly. guess no need to mention here...
nit: add jax link
supernit: remove the dead cell at the end : )

eb8680 · 2021-12-14T04:00:15Z

what's the new file name going to be? intro?

How about intro_long?

eb8680 · 2021-12-14T05:24:34Z

Ok, I think I've finished everything and this should be OK to merge pending final reviews. I'd prefer to merge an imperfect version and release to update the website and docs rather than iterate this PR too much more so that the old intro tutorials are moved out of sight of new users.

eb8680 · 2021-12-14T05:26:56Z

"when any pyro.sample statement is observed, the meaning of every other pyro.sample statement in a model changes" i still think this is confusing. as i see it it's not so much that that every other statement in the model changes but rather that queries on the model (e.g. computing the posterior) change

This is the one thing that I don't have an obvious idea for improving, suggestions and/or direct edits are welcome.

eb8680 · 2021-12-14T05:31:12Z

tutorial/source/index.rst

+   :name: deprecated
+
+   intro_part_i
+   intro_part_ii


I kept these and stuck them under a "Deprecated" header because there are links on the forum to the old intro tutorials that I don't want to break (yet).

martinjankowiak · 2021-12-14T14:16:23Z

@eb8680 looks great. i'd suggest this last change but we can merge as is if you prefer:

"the meaning of every other sample statement in a model changes following Bayes' rule" bayes rule is good. how about "...the cumulative effect of all the sample statements in a model changes following Bayes' rule"

fritzo · 2021-12-14T14:17:11Z

Looks great, this is a big improvement!

A note on performance: this optimization process is surprisingly slow

Whoa, no need to recommend switching to JAX, we just need to tune the optimizer and use multiple particles. This is a tiny model and inference should easily run in 1000 steps and <30sec.

nits:

pyro.enable_validation(True) is no longer necessary, I recommend removing

fritzo · 2021-12-14T14:19:23Z

Can I tune the optimizers and add %%time to the inference cells or sth? I'd really like to give users a good first impression.

eb8680 · 2021-12-14T14:21:06Z

@fritzo sure, although I think it's still going to be pretty slow. I had also turned down the learning rates at @martinjankowiak's suggestion but you could turn them up again.

martinjankowiak · 2021-12-14T14:23:04Z

the issue is that users will copy paste code, change models, and then generate lots of forum questions that wonder why their elbos don't converge using a giant LR. i'd recommend not going above lr = 0.005 to discourage high LRs as a default choice

eb8680 · 2021-12-14T14:25:05Z

@martinjankowiak could you make html the tutorials locally and check the rendered HTML for glaring errors? I did this and I think I fixed everything but a second pair of eyes couldn't hurt.

fehiepsi

Looks great to me!! Some minor nits:

Add plt.show() at the end of cell 17 to remove the text <matplotlib.legend.Legend at 0x7f93ab767090>
The first formula in section "Example: revisiting Bayesian regression with a full-rank guide" is not split into two lines
Probably add links to docs for each reference classes (rather than the first one that appears) just in case some users jump to a random section and still can click to go the the docs.

fritzo · 2021-12-14T14:27:22Z

It's really important to give users a good first impression. Inference easily converges in 5 seconds with a fast learning rate. I see lower loss using Adam({"lr": 0.02}) and 1000 learning steps, in under 5 seconds.

eb8680 · 2021-12-14T14:30:17Z

@fritzo I am inclined to agree given how slow 20000 steps is, how about adding @martinjankowiak's caveat about lowering learning rates in other models or a link to SVI part 4? We can always change it to a safer value later if it becomes an issue.

fritzo · 2021-12-14T14:30:21Z

If you're worried about too-high learning rates, then the right solution is to add an example with a too-high learning rate and say "oops the learning rate was too high and loss didn't decrease, see this loss plot. let's try decreasing learning rate".

eb8680 · 2021-12-14T14:33:11Z

Probably add links to docs for each reference classes (rather than the first one that appears) just in case some users jump to a random section and still can click to go the the docs.

Agreed this would be an improvement. I'll do this in a followup PR that doesn't block the release because there are quite a lot of references and I need to figure out how to get Sphinx to render links whose names are formatted as code.

fritzo · 2021-12-14T14:35:15Z

Can we please remove the "Use JAX instead" statement before merging though? This is a silly thing to say in a first tutorial with mis-used optimization.

eb8680 · 2021-12-14T14:36:32Z

Can we please remove the "Use JAX instead" statement before merging though?

Sure, do you want to do this along with your fixes or should I?

fritzo · 2021-12-14T14:40:31Z

Sure, I can make some minor edits and push

fritzo · 2021-12-14T15:04:38Z

@eb8680 I added a comment about convergence, tuned the optimizers, and encouraged users to look at the loss curve over time. However I haven't been able to regenerate plots due to a missing seaborn.histplot, even after pip install -U seaborn. not sure what's wrong. Could you regenerate plots?

eb8680 · 2021-12-14T15:07:31Z

However I haven't been able to regenerate plots due to a missing seaborn.histplot, even after pip install -U seaborn. not sure what's wrong.

Hmm this is concerning. FYI I have seaborn==0.11.2 and matplotlib==3.5.0 locally.

Could you regenerate plots?

Will do

martinjankowiak · 2021-12-14T15:19:29Z

can address later but typo: espcially

eb8680 · 2021-12-14T15:38:15Z

I'll address seaborn compatibility issues, documentation linking and other typos in non-blocking followup PRs. Should be good to merge unless there are any obvious rendering issues.

Add a new language introduction tutorial derived from the Bayesian re…

5195464

…gression tutorials

eb8680 added the Examples label Dec 12, 2021

eb8680 requested review from fritzo and martinjankowiak December 12, 2021 22:53

eb8680 added the awaiting review label Dec 12, 2021

eb8680 added 2 commits December 13, 2021 14:02

Address coments

3d66f89

Address comments 2

779e0ee

eb8680 added 3 commits December 13, 2021 17:58

Address more comments, add intro and conclusion

5205ce0

Fix some conclusion text and links

6f4ee5b

smoke test

eddc5b6

fritzo mentioned this pull request Dec 14, 2021

Bump to 1.8.0 #2994

Merged

eb8680 added 3 commits December 13, 2021 21:38

Address comments, add table of contents

9db6298

Fix seaborn warnings

7658540

tweak text and fix ylim

48ff3b3

eb8680 added 10 commits December 13, 2021 23:05

nits

b62f246

rename

f8593ae

remove all references to old intro tutorials

5bf6484

update index

8a2349a

Add notes about deprecation to old intros

eaa1fbe

fix code links

0f5bca4

Fix more sphinx errors

a4d6b99

fix yet another sphinx error...

e026a72

add a couple sentences about elbo estimators

76756f3

fix weird commit??

addbc7b

eb8680 commented Dec 14, 2021

View reviewed changes

fehiepsi reviewed Dec 14, 2021

View reviewed changes

fritzo added 2 commits December 14, 2021 09:58

Speed up inference

cd95ddc

Remove obsolete comment

fe8ec3b

eb8680 added 2 commits December 14, 2021 10:10

regenerate plots

8eabef3

address comments

15d33eb

fritzo approved these changes Dec 14, 2021

View reviewed changes

martinjankowiak merged commit 5a1d74e into dev Dec 14, 2021

eb8680 deleted the new-intro-tutorial branch December 14, 2021 17:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a new intro tutorial #2991

Add a new intro tutorial #2991

eb8680 commented Dec 12, 2021 •

edited

Loading

martinjankowiak commented Dec 13, 2021

martinjankowiak commented Dec 13, 2021 •

edited

Loading

eb8680 commented Dec 13, 2021

eb8680 commented Dec 13, 2021

martinjankowiak commented Dec 13, 2021

eb8680 commented Dec 14, 2021

eb8680 commented Dec 14, 2021 •

edited

Loading

eb8680 commented Dec 14, 2021

eb8680 Dec 14, 2021

martinjankowiak commented Dec 14, 2021

fritzo commented Dec 14, 2021

fritzo commented Dec 14, 2021

eb8680 commented Dec 14, 2021

martinjankowiak commented Dec 14, 2021

eb8680 commented Dec 14, 2021

fehiepsi left a comment

fritzo commented Dec 14, 2021 •

edited

Loading

eb8680 commented Dec 14, 2021

fritzo commented Dec 14, 2021

eb8680 commented Dec 14, 2021 •

edited

Loading

fritzo commented Dec 14, 2021

eb8680 commented Dec 14, 2021

fritzo commented Dec 14, 2021

fritzo commented Dec 14, 2021

eb8680 commented Dec 14, 2021

martinjankowiak commented Dec 14, 2021

eb8680 commented Dec 14, 2021

Add a new intro tutorial #2991

Add a new intro tutorial #2991

Conversation

eb8680 commented Dec 12, 2021 • edited Loading

martinjankowiak commented Dec 13, 2021

martinjankowiak commented Dec 13, 2021 • edited Loading

eb8680 commented Dec 13, 2021

eb8680 commented Dec 13, 2021

martinjankowiak commented Dec 13, 2021

eb8680 commented Dec 14, 2021

eb8680 commented Dec 14, 2021 • edited Loading

eb8680 commented Dec 14, 2021

eb8680 Dec 14, 2021

Choose a reason for hiding this comment

martinjankowiak commented Dec 14, 2021

fritzo commented Dec 14, 2021

fritzo commented Dec 14, 2021

eb8680 commented Dec 14, 2021

martinjankowiak commented Dec 14, 2021

eb8680 commented Dec 14, 2021

fehiepsi left a comment

Choose a reason for hiding this comment

fritzo commented Dec 14, 2021 • edited Loading

eb8680 commented Dec 14, 2021

fritzo commented Dec 14, 2021

eb8680 commented Dec 14, 2021 • edited Loading

fritzo commented Dec 14, 2021

eb8680 commented Dec 14, 2021

fritzo commented Dec 14, 2021

fritzo commented Dec 14, 2021

eb8680 commented Dec 14, 2021

martinjankowiak commented Dec 14, 2021

eb8680 commented Dec 14, 2021

eb8680 commented Dec 12, 2021 •

edited

Loading

martinjankowiak commented Dec 13, 2021 •

edited

Loading

eb8680 commented Dec 14, 2021 •

edited

Loading

fritzo commented Dec 14, 2021 •

edited

Loading

eb8680 commented Dec 14, 2021 •

edited

Loading