Add warning for varying simulator output sizes

Varying simulator output sizes are a common occurrence when the number of samples varies between calls to `simulator.sample()`:

```py3
def context(batch_size):
    n = np.random.randint(10, 101)
    return dict(n=n)

def prior():
    mu = np.random.normal()
    sigma = np.random.gamma(shape=2)
    return dict(mu=mu, sigma=sigma)

def likelihood(n, mu, sigma):
    y = np.random.normal(mu, sigma, size=n)
    return dict(y=y)

simulator = bf.make_simulator([prior, likelihood], meta_fn=context)
```

However, these can trigger excessive compile times in JAX, where each value for `n` triggers a recompilation. For a wide range of `n`, this can mean that the compilation dominates the training time.

The current best-practice fix for users is to use padded tensors:

```py3
def likelihood(n, mu, sigma):
    y = np.random.normal(mu, sigma, size=100)  # uses fixed maximum size
    y[n:] = 0  # set unused entries to zero, or some other placeholder value
    return dict(y=y)
```

When we detect that compile times dominate, we should output a warning to the user, with a suggested fix. We could also improve support for padded simulator output in general. Further, we could look into if there are better ways to mask out unused values rather than just setting them to placeholder values like above.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add warning for varying simulator output sizes #370

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add warning for varying simulator output sizes #370

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions