Test for valid use of (S)NPE API #24

psteinb · 2021-11-05T15:51:28Z

This PR attempts a solution to #23 without (yet) introducing test categories a la @pytest.mark.slow (see https://github.com/mackelab/sbi/blob/86256e02c1080965795e65062c4ab9d3a19015d2/tests/linearGaussian_snpe_test.py#L196)

psteinb · 2021-11-05T15:54:28Z

btw, when run, this unit tests results in:

        # For torch < 1.8 log_abs_det_jacobian is returned for each dimension.
        if vals.ndim > 1 and vals.shape[1] == dim_parameters:
            vals = vals.sum(-1)
    
>       assert (
            vals.numel == batch_size
        ), "Mismatch in batch size, took sum over whole batch?"
E       AssertionError: Mismatch in batch size, took sum over whole batch?

../../sbibm/utils/torch.py:105: AssertionError

which is related to #15 I guess

jan-matthis · 2021-11-05T17:42:07Z

which is related to #15 I guess

Yes, this is due to #15. @janfb will fix this beginning of next week, that will iron it out

jan-matthis · 2021-11-12T08:37:27Z

tests/algorithms/test_snpe_posterior.py

+    [
+        (task_name, num_observation)
+        for task_name in ["gaussian_linear", "slcp",]
+        for num_observation in random.sample(range(1, 11), 2)


I'd prefer keeping the tests deterministic, i.e., let's just run them on observation 1 and 2 only if 10 is too slow. How long does this test take on your machine?

With 2 observations and 2.500 simulated samples per task, all 6 tests in total run for 220s on my box. That is still quite long and mostly due to NPE running epochs. In this case, this is a hen-and-egg problem: we balance simulation budget versus expressiveness of the predictor.
I'd propose to ditch the SLCP task or at least one gaussian task. Limiting the number of epochs would be most ideal - but impossible I guess at the moment.

I boiled it down to 2 tasks and about 140s to complete (9e91818). Still quite long. I guess the central question is, what we want to achieve with this test. If it is really only about the interface, then one task with one observation is enough.

Yes, I think we only want to do interface testing as part of sbibm. I have just merged a PR (#33) that adds an argument to be able to set max_num_epochs explicitly. How about running the test for a low maximum number of epochs and only checking that the shape of the return is correct?

Cool, these tests are now going at 30s on my box. That is still long, but at least c2st appears to return something meaningful. e642d77

tests/algorithms/test_snpe_posterior.py

- ditch slcp task (posterior would need more samples and epochs to train) - 2000 sim samples and 100 observations appear to be valid choices - runs in ~140s on my box

jan-matthis · 2021-11-12T14:26:11Z

Cheers!

jan-matthis requested review from jan-matthis and janfb November 5, 2021 17:42

jan-matthis reviewed Nov 12, 2021

View reviewed changes

jan-matthis changed the title ~~SNPE tests for sbi in order to check valid use of API~~ Test for valid use of (S)NPE API Nov 12, 2021

psteinb force-pushed the snpe-tests-for-sbi branch from 8b0196b to 667092b Compare November 12, 2021 10:41

psteinb added 6 commits November 12, 2021 14:33

choose an observation randomly, less tests at same coverage

83f37ab

added fast unit test for sbi compatiblity

5a7a42d

limit ourselves to 2 observations only

86bb682

check for the shape and roughly for accuracy

641c0d5

optimize runtime of npe tests

c93535a

- ditch slcp task (posterior would need more samples and epochs to train) - 2000 sim samples and 100 observations appear to be valid choices - runs in ~140s on my box

fixing the number of epochs to 30 brings this test to 30s runtime

e642d77

psteinb force-pushed the snpe-tests-for-sbi branch from 6688571 to e642d77 Compare November 12, 2021 13:38

jan-matthis merged commit a9c20c8 into sbi-benchmark:main Nov 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test for valid use of (S)NPE API #24

Test for valid use of (S)NPE API #24

psteinb commented Nov 5, 2021

psteinb commented Nov 5, 2021

jan-matthis commented Nov 5, 2021

jan-matthis Nov 12, 2021

psteinb Nov 12, 2021

psteinb Nov 12, 2021 •

edited

jan-matthis Nov 12, 2021 •

edited

psteinb Nov 12, 2021

jan-matthis commented Nov 12, 2021

Test for valid use of (S)NPE API #24

Test for valid use of (S)NPE API #24

Conversation

psteinb commented Nov 5, 2021

psteinb commented Nov 5, 2021

jan-matthis commented Nov 5, 2021

jan-matthis Nov 12, 2021

Choose a reason for hiding this comment

psteinb Nov 12, 2021

Choose a reason for hiding this comment

psteinb Nov 12, 2021 • edited

Choose a reason for hiding this comment

jan-matthis Nov 12, 2021 • edited

Choose a reason for hiding this comment

psteinb Nov 12, 2021

Choose a reason for hiding this comment

jan-matthis commented Nov 12, 2021

psteinb Nov 12, 2021 •

edited

jan-matthis Nov 12, 2021 •

edited