Compare exp and softplus transform using synthetic data (cell2location model) #14

vitkl · 2021-04-12T09:04:27Z

Adding the experiments which compare the effect of exp and softplus transform on the cell2location model estimation (https://github.com/BayraktarLab/cell2location). Stability of ELBO and accuracy using ground truth estimates (R^2, PR curves) are compared on synthetic data. The cell2location model is ported from pymc3 to both numpyro and pyro:

Pyro, 2021-03-softplus_scales/cell2location_model.py, 2021-03-softplus_scales/cell2location_synthetic_data.ipynb
Numpyro, 2021-03-softplus_scales/cell2location_model_numpyro.py, 2021-03-softplus_scales/cell2location_synthetic_data_numpyro.ipynb

The model is slightly different to the original pymc3 implementation:

Gamma(mu, sigma) had to be re-parameterised to Gamma(alpha, beta) because PyTorch and numpyro do not support Gamma(mu, sigma).
Negative Binomial distrution is different in all 3 cases: pymc3 uses NB(mu, alpha also called theta and total count), pyro uses NB(logits=log(mu) - log(alpha), total count) and numpyro uses GammaPoisson(alpha, beta=alpha / mu).

Three conditions are compared:

Ext used for all positive transformations
Softplus used for transforming AutoNormal scales.
Softplus used for all positive transformations

For numpyro, the results are as follows:

Ext leads to exploding ELBO (see plot below).
Softplus for scales improves the stability of ELBO (consistent with findings reported in Softplus transform as a more numerically stable way to enforce positive constraint numpyro#855) but, surprisingly, has low accuracy compared to pymc3. I am not sure what might be driving this. I did not use pyro.plates when I did the original experiments (with numpyro version 0.4.1, I did not use pyro).
Softplus for all gives accuracy similar to the original.

For pyro, for some reason, this implementation gives NaN in all three comparisons after just a few iterations. Any thoughts about potential solutions would be appreciated. Maybe I am using the plates interface incorrectly?

…iteration;

This reverts commit 3b8e14e.

vitkl · 2021-04-13T16:35:53Z

The issue with pyro was overlapping plate dimension - now replaced with expand/to_event (excluding obs_axis).

The analysis essentially confirms the same observation, only setting 3 using softplus for all positive transformations retains original accuracy (see 2D histogram below for pymc3 and the notebook for pyro and numpyro):

Exp leads to exploding ELBO (see plot below).
Softplus for scales improves the stability of ELBO (consistent with findings reported in Softplus transform as a more numerically stable way to enforce positive constraint numpyro#855) but, surprisingly, has lower accuracy compared to pymc3 (especially clearly seen in 2D histograms). I did not use pyro.plates when I did the original experiments (with numpyro version 0.4.1, I did not use pyro).
Softplus for all positive transformations gives accuracy similar to the original.

@fritzo what do you think? Please let me know if you have any question and if I need to give more explanations in the notebook.

This model can be described as a GLM/non-negative factor analysis where factor loadings for variables are provided as fixed and the goal is to learn loadings for observations. I can also do the analysis of ELBO stability using a model where factor loadings for variables also need to be learned. I expect this trend to be stronger because the model is less constrained - but I do not have the same ground truth for evaluating accuracy.

vitkl · 2021-04-13T18:15:21Z

The same pattern (ELBO stability) and accuracy are also reproduced with 5x larger data.

fehiepsi · 2021-04-13T22:10:50Z

@vitkl Thanks for setting the experiments! This is so cool. I would like to walk through this to understand better the behavior. I am a bit busy this week so will get back to you sometime early next week. :)

fritzo

Hi @vitkl, these experiments look great! @fehiepsi and I discussed and agreed that:

Your experiments are sufficient evidence to change autoguide scale parameters to use softplus transforms by default.
While it would be too big a change to alter the default transform for constraints.positive, we would be happy to add documentation on how change the default, e.g. adding to the new Tips & Tricks tutorial.

fritzo · 2021-04-25T18:09:32Z

@vitkl is this ready to merge?

vitkl · 2021-04-26T00:58:34Z

Let me quickly check that the text in the notebooks makes sense.

…

On Sun, 25 Apr 2021, 19:09 Fritz Obermeyer, ***@***.***> wrote: @vitkl <https://github.com/vitkl> is this ready to merge? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#14 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFMFTV4OXCOWO2W75D2U63TTKRLGVANCNFSM42Y3JD7Q> .

fritzo · 2021-05-04T17:50:33Z

@vitkl mind if I merge this?

vitkl · 2021-05-10T02:57:02Z

Thanks for your patience! I just cleaned up the notebooks a bit. I think you can merge this.

vitkl added 11 commits March 24, 2021 10:39

draft analysis and model

4833d83

all transforms are unstable

14fa38f

tests using numpyro translation of cell2location

fbb15bb

pyro model, with all priors as tensors, but gives Inf ELBO in second …

c84156a

…iteration;

numpyro n_iter=50k + clear ELBO plot

8de86fd

support for modifying scales and loc init

ceed63c

added PR curves

a027b5d

numpyro model with init_scale exactly as in pymc3 (no effect)

e010455

added package versions

dbc2a46

error training for a few iterations

89bd0ec

added links to download data

99e8f4b

fritzo added the WIP label Apr 12, 2021

vitkl added 8 commits April 13, 2021 11:08

removed plates except obs plate

fe8ca2d

removed to_event in numpyro implementation

3b8e14e

Revert "removed to_event in numpyro implementation"

de2adf6

This reverts commit 3b8e14e.

added to_event statements + minor bug fix

2a913f6

replaced pyro plates with expand/to_event (excluding obs_axis)

e37a035

pyro reproduces numpyro results about stability of softplus

2a3f92f

black changes;

59a2974

add evaluation with PR curves;

d7b5a1b

deleted unnecessary text

20d8f07

fritzo approved these changes Apr 21, 2021

View reviewed changes

fritzo mentioned this pull request Apr 25, 2021

Switch to softplus transforms for autoguide scales pyro-ppl/pyro#2823

Merged

fritzo added awaiting review and removed WIP labels May 4, 2021

vitkl added 2 commits May 10, 2021 03:55

clean up notebooks

1812391

same test on 5x larger data

b8cb54b

fritzo changed the title ~~[WIP] Comparing exp and softplus transform using synthetic data (cell2location model)~~ Compare exp and softplus transform using synthetic data (cell2location model) May 10, 2021

fritzo merged commit 723ebd0 into pyro-ppl:master May 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compare exp and softplus transform using synthetic data (cell2location model) #14

Compare exp and softplus transform using synthetic data (cell2location model) #14

vitkl commented Apr 12, 2021 •

edited

Loading

vitkl commented Apr 13, 2021

vitkl commented Apr 13, 2021

fehiepsi commented Apr 13, 2021

fritzo left a comment •

edited

Loading

fritzo commented Apr 25, 2021

vitkl commented Apr 26, 2021 via email

fritzo commented May 4, 2021

vitkl commented May 10, 2021

Compare exp and softplus transform using synthetic data (cell2location model) #14

Compare exp and softplus transform using synthetic data (cell2location model) #14

Conversation

vitkl commented Apr 12, 2021 • edited Loading

vitkl commented Apr 13, 2021

vitkl commented Apr 13, 2021

fehiepsi commented Apr 13, 2021

fritzo left a comment • edited Loading

Choose a reason for hiding this comment

fritzo commented Apr 25, 2021

vitkl commented Apr 26, 2021 via email

fritzo commented May 4, 2021

vitkl commented May 10, 2021

vitkl commented Apr 12, 2021 •

edited

Loading

fritzo left a comment •

edited

Loading