Update case studies to pymc v5 #78

vandalt · 2024-03-06T14:00:00Z

Hi!

This PR is a first attempt to update all case studies to the PyMC v5 version of exoplanet.

Most changes went smoothly, but here are a few things worth noting:

The estimate_invers_gamma_parameters() function had been removed from pymc-ext. I added it back (see Add back estimate_inverse_gamma_parameters() in utils pymc-ext#46)
In PyMC v5, the observed argument must be static data and cannot depend on parts of the models.
- This affected the eccentricity priors when eccentricity is a derived parameter. I modified the two priors to work when eccentricity is derived from other parameters using a pm.Potential(). This change is in Support PyMC v5 exoplanet#309
- It also affected GPs if the mean function was removed directly from the data to fit the residuals, instead of passing a mean argument to the GP. I modified all GPs to be specified in the latter way. I updated the related sigma-clipping and plotting code as well.
At least two notebooks were significantly slower than the PyMC3 version (on my laptop, at least): the eclipsing binary one and the multi-instrument transit one.
- For the eclipsing binary one, I noticed that even in PyMC3, the optimization gave "NaN" logps. In PyMC 5, this raised errors, so I had to modify the optimization a bit. The final sampling results are the same, but MCMC was slower to run on my machine by a factor ~4.
- For the multi-instrument transit, I'm not sure what the issue is. The KeplerianOrbit issues a warning that ror should be specified (which I think would require the two instruments to have their own orbit, but with shared parameters except ror). The MAP result is also not exactly the same as in PyMC3. The sampling took a couple of hours. Results are again the same as with PyMC3, just much slower.
- A few things common to the two notebooks that could help further investigation: they both include light curves with fairly large datasets, they both use a GP on those light curves, and they both use sigma clipping with masked data.

I rarely have to fit light curves, and I'm mostly using Jax recently (though I do like to have PyMC models available as an option), so I don't think I'll have time/interest to dive further into the speed issues, but I think this PR is a good first step to getting all case studies working with PyMC 5.

Thanks!

The main change is that residuals depending on the light curve model can no longer be used as "obs" for the GP. Simplest fix is to use the data and pass the light curve as the mean. This required minor changes to model plots and sigma clipping codes.

Optimization was hard to make converge (NaNs logps in PyMC3 as well). MCMC was much slower than PyMC3 version.

for more information, see https://pre-commit.ci

vandalt and others added 15 commits March 1, 2024 23:01

Update RV tutorial to PyMC v5

cf2eca1

Update transit tutorial to PyMC v5

a9d05c2

Update astrometric model to PyMC v5

1181a14

Update stellar variability tutorial to PyMC v5

46449fc

Remove leftover todo in astrometric notebok (and format)

13fbfca

Remove leftover todo in joint RV+Transit notebok

7274646

Update TESS tutorial to PyMC v5

d68adc5

Update quick TESS notebook to PyMC v5

9da14fc

Update TTV tutorial to PyMC v5

089f41a

Update multi-instrument RV tutorial to PyMC v5

4dd60f7

Update transit+activity tutorial to PyMC v5

75325fe

Update eclipsing binary tutorial to PyMC v5

e53fbd0

Optimization was hard to make converge (NaNs logps in PyMC3 as well). MCMC was much slower than PyMC3 version.

Update multi-instrument light curve tutorial to PyMC v5

e076871

[pre-commit.ci] auto fixes from pre-commit.com hooks

6040ac6

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update case studies to pymc v5 #78

Update case studies to pymc v5 #78

vandalt commented Mar 6, 2024

Update case studies to pymc v5 #78

Are you sure you want to change the base?

Update case studies to pymc v5 #78

Conversation

vandalt commented Mar 6, 2024