Seed HSGP bootstrap tests #6679

ricardoV94 · 2023-04-18T15:31:00Z

Failed in #6678

📚 Documentation preview 📚: https://pymc--6679.org.readthedocs.build/en/6679/

codecov · 2023-04-18T15:45:13Z

Codecov Report

Merging #6679 (71e663e) into main (1ed4475) will decrease coverage by 2.47%.
The diff coverage is n/a.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6679      +/-   ##
==========================================
- Coverage   91.99%   89.52%   -2.47%     
==========================================
  Files          94       94              
  Lines       15944    15944              
==========================================
- Hits        14667    14274     -393     
- Misses       1277     1670     +393

see 5 files with indirect coverage changes

bwengals · 2023-04-18T20:24:20Z

weird that a different seed it still fails. maybe very unlucky...? I couldnt get it to fail in the original PR. Does it fail for like, any seed?

ricardoV94 · 2023-04-18T20:40:56Z

My last commit tries 5 seeds, it fails in one of them. The failure rate seems higher than 0.01. Even with 5 tests should only fail 5% of the time

bwengals · 2023-04-19T01:48:03Z

yup somethings wrong, will pull and give it a try

bwengals · 2023-04-20T23:13:46Z

tests/gp/test_hsgp_approx.py

+        seeds = np.arange(5) + 197  # 5 possible seeds
+        rng = np.random.default_rng(np.random.choice(seeds))
+


It seems cleaner to use @parameterize like for test_prior?

The @parametrize was just to show what seed caused the test to fail. The idea was to revert back to choosing one of 5 possible seeds once the root cause was found.

I imagine we don't want to run each test 5 times, but instead allow some variation across tests.

bwengals · 2023-04-20T23:14:17Z

tests/gp/test_hsgp_approx.py

        """Compare HSGP prior to unapproximated GP prior, pm.gp.Latent.  Draw samples from the
        prior and compare them using MMD two sample test.  Tests both centered and non-centered
        parameterizations.
        """
+        rng = np.random.default_rng(seed)
+
        with model:
            hsgp = pm.gp.HSGP(m=[200], c=2.0, parameterization=parameterization, cov_func=cov_func)


try increasing m here to 500

bwengals · 2023-04-20T23:15:11Z

tests/gp/test_hsgp_approx.py

        with model:
            hsgp = pm.gp.HSGP(m=[200], c=2.0, parameterization=parameterization, cov_func=cov_func)
            f1 = hsgp.prior("f1", X=X1)

            gp = pm.gp.Latent(cov_func=cov_func)
            f2 = gp.prior("f2", X=X1)

-            idata = pm.sample_prior_predictive(samples=1000)
+            idata = pm.sample_prior_predictive(samples=1000, random_seed=rng)


I think its ok to decrease samples here to 500, will run faster which will be nice for trying the different random seeds

bwengals · 2023-04-20T23:15:42Z

tests/gp/test_hsgp_approx.py

@@ -194,17 +197,20 @@ def test_conditional(self, model, cov_func, X1, parameterization):
        prior and compare them using MMD two sample test.  Tests both centered and non-centered
        parameterizations.  The conditional should match the prior when no data is observed.
        """
+        seeds = np.arange(5) + 197  # 5 possible seeds
+        rng = np.random.default_rng(np.random.choice(seeds))
+
        with model:
            hsgp = pm.gp.HSGP(m=[100], c=2.0, parameterization=parameterization, cov_func=cov_func)


try increasing m here to 500 too. This got rid of the random failures for me locally

bwengals

Increasing m seems to have fixed things for me locally. Then, decreasing the number of samples in pm.sample_posterior_predictive helps it go a bit faster. I think that should fix it

Seed HSGP bootstrap tests

48c4b25

ricardoV94 added tests needs info Additional information required GP Gaussian Process labels Apr 18, 2023

ricardoV94 requested a review from bwengals April 18, 2023 15:31

ricardoV94 force-pushed the seed_hsgp_bootstrap branch from fe336b9 to 48c4b25 Compare April 18, 2023 15:31

DO NOT MERGE: Test all seeds

71e663e

bwengals reviewed Apr 20, 2023

View reviewed changes

bwengals requested changes Apr 20, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Seed HSGP bootstrap tests #6679

Seed HSGP bootstrap tests #6679

ricardoV94 commented Apr 18, 2023 •

edited

Loading

codecov bot commented Apr 18, 2023 •

edited

Loading

bwengals commented Apr 18, 2023

ricardoV94 commented Apr 18, 2023

bwengals commented Apr 19, 2023

bwengals Apr 20, 2023

ricardoV94 Apr 21, 2023

bwengals Apr 20, 2023

bwengals Apr 20, 2023

bwengals Apr 20, 2023

bwengals left a comment

		seeds = np.arange(5) + 197 # 5 possible seeds
		rng = np.random.default_rng(np.random.choice(seeds))

Seed HSGP bootstrap tests #6679

Are you sure you want to change the base?

Seed HSGP bootstrap tests #6679

Conversation

ricardoV94 commented Apr 18, 2023 • edited Loading

codecov bot commented Apr 18, 2023 • edited Loading

Codecov Report

bwengals commented Apr 18, 2023

ricardoV94 commented Apr 18, 2023

bwengals commented Apr 19, 2023

bwengals Apr 20, 2023

Choose a reason for hiding this comment

ricardoV94 Apr 21, 2023

Choose a reason for hiding this comment

bwengals Apr 20, 2023

Choose a reason for hiding this comment

bwengals Apr 20, 2023

Choose a reason for hiding this comment

bwengals Apr 20, 2023

Choose a reason for hiding this comment

bwengals left a comment

Choose a reason for hiding this comment

ricardoV94 commented Apr 18, 2023 •

edited

Loading

codecov bot commented Apr 18, 2023 •

edited

Loading