Store PopulationSizeHistory as a json list #274

hyanwong · 2023-06-15T10:54:14Z

It would be nice to store the passed-in PopulationSizeHistory object in the provenance of a tsdated tree sequence. I don't think it is a very large object, right? It just consists of 2 arrays, neither of which is likely to be millions of numbers long. Any reason why not to store it?

hyanwong · 2023-06-15T11:01:57Z

Sorry - forgot to ping @nspope : mainly a question for him.

See tskit-dev#274

nspope · 2023-06-15T16:48:28Z

I think this is a good idea! Not too large, it should be two smallish 1D arrays of breakpoints and sizes respectively.

hyanwong · 2023-06-20T22:26:24Z

One problem is that we can't store a Python class in the JSON format required by the provenance spec.

One way around this, as long as we are content with piecewise population size changes (I'm not sure if this is valid for variational inference) would be to allow the population_size parameter to take either a number, a PopulationSizeHistory object, or a list of 2 lists, which are passed as the parameters to PopulationSizeHistory, e.g.

date(ts, population_size=[[1000, 2000, 3000], [500, 2500]])

which would be equivalent to

date(ts, population_size=tsinfer.PopulationSizeHistory(*[[1000, 2000, 3000], [500, 2500]]))

Then either invocation would result in a provenance like

{
  "parameters": {
    "command": "date",
    "population_size": [[1000.0, 2000.0, 3000.0], [500.0, 2500.0]],
  }
}

What do you think @nspope ? Is it too ugly allowing a list of lists as an additional population_size option? How would it fit with the variational inference. I assume that since we haven't actually released a new version, we're free to change the API etc as we wish.

hyanwong · 2023-06-20T23:08:39Z

Or actually, it might be clearer as a dictionary that is unpacked into the PopulationSizeHistory constructor:

date(ts, population_size={"population_size": [1000, 2000, 3000], "time_breaks":[500, 2500]})

saved in provenance as

{
  "parameters": {
    "command": "date",
    "population_size": {"population_size":[1000.0, 2000.0, 3000.0], "time_breaks":[500.0, 2500.0]}
  }
}

nspope · 2023-06-21T01:06:13Z

I think the dict option is cleaner. Wrt how variable population size interfaces with the variational algorithm-- it should work perfectly fine with the current implementation. This might change depending on how the prior calibration is fixed (still a work in progress).

Fixes tskit-dev#274

hyanwong added a commit to hyanwong/tsdate that referenced this issue Jun 15, 2023

Add TODO

89162ad

See tskit-dev#274

hyanwong added a commit to hyanwong/tsdate that referenced this issue Jun 15, 2023

Add TODO

ce7400c

See tskit-dev#274

hyanwong added a commit to hyanwong/tsdate that referenced this issue Jun 21, 2023

Save PopulationSizeHistory params

4b33864

Fixes tskit-dev#274

hyanwong mentioned this issue Jun 21, 2023

Save PopulationSizeHistory params #283

Merged

hyanwong added a commit to hyanwong/tsdate that referenced this issue Jun 24, 2023

Save PopulationSizeHistory params

868e177

Fixes tskit-dev#274

hyanwong closed this as completed in #283 Jun 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store PopulationSizeHistory as a json list #274

Store PopulationSizeHistory as a json list #274

hyanwong commented Jun 15, 2023 •

edited

Loading

hyanwong commented Jun 15, 2023

nspope commented Jun 15, 2023 •

edited

Loading

hyanwong commented Jun 20, 2023 •

edited

Loading

hyanwong commented Jun 20, 2023

nspope commented Jun 21, 2023

Store PopulationSizeHistory as a json list #274

Store PopulationSizeHistory as a json list #274

Comments

hyanwong commented Jun 15, 2023 • edited Loading

hyanwong commented Jun 15, 2023

nspope commented Jun 15, 2023 • edited Loading

hyanwong commented Jun 20, 2023 • edited Loading

hyanwong commented Jun 20, 2023

nspope commented Jun 21, 2023

hyanwong commented Jun 15, 2023 •

edited

Loading

nspope commented Jun 15, 2023 •

edited

Loading

hyanwong commented Jun 20, 2023 •

edited

Loading