Combine `regenerate` and `update` into an update method taking an UpdateSpec object #282

georgematheos · 2020-07-08T14:49:19Z

This PR builds upon #279 to change the signature of update to the following:

(new_tr, weight, retdiff, reverse_update_spec) = update(trace, args, argdiffs, update_spec::UpdateSpec, externally_constrained_addresses::Selection)

update_spec is a specific type of AddressTree. It can select some addresses to regenerate using the internal proposal distribution and constrain some addresses to choices, or include address tree leafs of type CustomUpdateSpec which specify an update that a custom generative function knows how to perform. As I am currently conceiving it, custom update specs must be equivalent to a combination of selecting and constraining addresses--but at some point I think we could reformalize in such a way to relax this requirement, and use these custom update specs to effectively allow generative functions to implement multiple internal proposal distributions.

externally_constrained_addresses is a selection including all addresses which external proposal distributions will constrain the values for when applying the reverse move to this update. This selection will determine what weight the update function calculates: it will include the term Q[old_tr | get_selected(reverse_update_spec, externally_constrained_addrs)]. To implement the old update weight, we set externally_constrained_addrs = AllSelection(); to implement the old regenerate weight, we set externally_constrained_addrs = EmptySelection().

I have implemented syntactic sugar so the old calls to update and regenerate still work, by being translated into the correct call to the new update method.

I have changed the dynamic DSL, static DSL, CallAt, Map, and Unfold combinators to use this new update method. I have not changed Recurse yet.

This is still a WIP and I need to document these changes and do some performance engineering. I am putting this online since others may need access to this branch to use some of my open universe inference tools.

notes:

Recurse did not have an implementation for regenerate; I have not implemented the fully general update function yet
Looks like there is currently a 2x slowdown for the static DSL inference benchmark; I have a few ideas where this might be coming from and will try to fix it once I get a chance.

TODOS:

Add documentation (for addresstree and the new update interface)
Performance
Add testing for updates which select some addresses and constrain others
Add a new variant of metropolis_hastings which allows for simultaneous selection and constraints

Resolves #266 , resolves #279 , resolves #259, resolves #258 , resolves #189, resolves #274, resolves #263

…eorgematheos-distributionsasgenfns

georgematheos · 2020-07-09T15:30:39Z

Made a couple performance improvements. Here is some benchmarking. Looks like the static DSL is a bit faster than on the master branch, and the dynamic DSL is slower (by a more significant amount). Asymptotic performance does not seem to be majorly affected in these simple tests.

The performance slowdown for the dynamic DSL appears to have been introduced by my initial PR which introduces the concept of a "Value Choice Map"; the performance on dynamic DSL inference did not degrade (and if anything, it slightly improved) when I made distributions generative functions, implemented addresstrees, and changed the update signature.

Benchmarks:

This PR:

Simple static DSL (including CallAt nodes) MH on regression model:
  0.307637 seconds (4.13 M allocations: 302.513 MiB, 14.84% gc time)
  0.288067 seconds (4.13 M allocations: 302.513 MiB, 13.35% gc time)

Simple dynamic DSL MH on regression model:
  7.253488 seconds (87.12 M allocations: 4.507 GiB, 11.41% gc time)
  7.367530 seconds (87.12 M allocations: 4.507 GiB, 11.71% gc time)

georgematheos@Georges-MBP-3 benchmarks % julia run_benchmarks.jl
Simple static DSL (including CallAt nodes) MH on regression model:
  0.326150 seconds (4.13 M allocations: 302.513 MiB, 16.59% gc time)
  0.317942 seconds (4.13 M allocations: 302.513 MiB, 14.69% gc time)

Simple dynamic DSL MH on regression model:
  7.357534 seconds (87.12 M allocations: 4.507 GiB, 11.98% gc time)
  7.435310 seconds (87.12 M allocations: 4.507 GiB, 12.35% gc time)

# asymptotics check:
Simple static DSL (including CallAt nodes) MH on regression model - 5x as many data points:
  1.420621 seconds (20.91 M allocations: 1.895 GiB, 11.45% gc time)
  1.415851 seconds (20.91 M allocations: 1.895 GiB, 12.54% gc time)

Simple dynamic DSL MH on regression model - 1/5 as many data points:
  0.419291 seconds (4.92 M allocations: 253.056 MiB, 12.15% gc time)
  0.421394 seconds (4.92 M allocations: 253.056 MiB, 11.92% gc time)

Master branch:

Simple static DSL (including CallAt nodes) MH on regression model:
  0.359577 seconds (4.35 M allocations: 309.954 MiB, 13.78% gc time)
  0.329181 seconds (4.35 M allocations: 309.954 MiB, 12.14% gc time)

Simple dynamic DSL MH on regression model:
  5.208065 seconds (68.65 M allocations: 4.524 GiB, 16.93% gc time)
  5.033524 seconds (68.65 M allocations: 4.524 GiB, 16.42% gc time)

georgematheos@Georges-MBP-3 benchmarks % julia run_benchmarks.jl
Simple static DSL (including CallAt nodes) MH on regression model:
  0.339774 seconds (4.35 M allocations: 309.954 MiB, 11.95% gc time)
  0.331594 seconds (4.35 M allocations: 309.954 MiB, 10.95% gc time)

Simple dynamic DSL MH on regression model:
  5.200960 seconds (68.65 M allocations: 4.524 GiB, 16.24% gc time)
  5.014927 seconds (68.65 M allocations: 4.524 GiB, 15.83% gc time)

# asymptotics check:
Simple static DSL (including CallAt nodes) MH on regression model - 5x as many data points:
  1.621701 seconds (21.96 M allocations: 1.928 GiB, 13.03% gc time)
  1.598446 seconds (21.96 M allocations: 1.928 GiB, 12.40% gc time)

Simple dynamic DSL MH on regression model - 1/5 as many data points:
  0.315384 seconds (3.91 M allocations: 256.996 MiB, 17.18% gc time)
  0.300466 seconds (3.91 M allocations: 256.996 MiB, 14.59% gc time)

…00706-georgematheos-updatespec

…ree{Union{val, sel}})

georgematheos · 2020-07-30T17:33:26Z

As a TODO--I realize the current implementation here of distributions as generative functions is incompatible with the distribution DSL since it assumes distributions are fully defined by their type information. This should be a quick fix.

georgematheos added 30 commits May 17, 2020 12:24

first draft of core functionality

23f79a8

add support for address schemas

c9b1d49

update choicemap docs

1e0a589

refactoring and tests

623bc8f

performance improvements and benchmarking

83349c7

benchmark for dynamic choicemap lookups

b9b5312

inline dynamicchoicemap methods

bce5e77

remove old version benchmark file

a985f9b

minor testing cleanup

1f5029c

ensure valuechoicemap[] syntax works

eb6adf7

provide some examples in the documentation

eef9417

fix some typos

a83adfb

add phrase 'nesting level zero' to docs

1bd705f

distribution <: GenFn; dynamic DSL simplification

676828b

simplify static ir code

5bf4207

brief documentation for Dist <: GenFn

61673a4

short map over distribution test

298a333

default static_get_submap = EmptyChoiceMap

e34875a

default static_get_submap = EmptyChoiceMap

972d455

dist performance improvements

ee64d12

minor performance improvement

fd1991f

performance improvement related to zip bug

c3d5db0

Merge branch '20200516-georgematheos-valuechoicemaps' into 20200617-g…

f652346

…eorgematheos-distributionsasgenfns

better static retdiff checking

8a43845

add static info for dist trace type

ffd9373

don't use static get_submap for staticchoicemap

67d5e12

some simple MH benchmarks

4966ea9

bug fix

0909a5b

remove ChoiceAt; bug fixes

47cca59

decrease iters on benchmark

10df952

georgematheos added 3 commits July 8, 2020 16:11

bug fixes with addresstrees

3645830

generate(, , ::Selection) --> generate(, , EmptyChoiceMap())

78819ff

bug fixes & cleanup

7c69f4d

georgematheos added 7 commits July 9, 2020 14:11

bug fixes and additional tests

7844fd4

Merge remote-tracking branch 'upstream/better-macro-support' into 202…

cdb0ce4

…00706-georgematheos-updatespec

SelectionLeaf & InvertedSelection

a320e13

bug fix

cea1b66

has_value for any address tree; UnderlyingChoices; generate(,,::AddrT…

481d891

…ree{Union{val, sel}})

bug fixes

b15d6c1

invert selection schemas

ad93f20

georgematheos mentioned this pull request Jul 29, 2020

Core name and syntax changes #42

Open

georgematheos added 16 commits July 31, 2020 15:36

add update(disttr, _, _, Val, ::Empty)

8da07fe

bug fix

f339474

getproperty diffs; update fewer julia nodes in static DSL using diffs

aa24dba

error check on set_value

a80e9d4

fix deep_dynamic_copy bug

5354537

small improvements to diffs

f104677

bug fix

c0d864a

update custom_determ for new update

e820035

new diff tests

015f484

dist bug fix

dab37e6

bug fixes in equality

8a92051

change bug behavior on overwriting values

4cda43a

initial impls for setmap, multiset

1b3b824

fix method overwrite issues

24fa021

add setmap tests

05f974c

minor changes & bug fixes

d620d29

georgematheos mentioned this pull request Nov 23, 2020

(Ready for review): Switch combinator #334

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Combine `regenerate` and `update` into an update method taking an UpdateSpec object #282

Combine `regenerate` and `update` into an update method taking an UpdateSpec object #282

georgematheos commented Jul 8, 2020 •

edited

Loading

georgematheos commented Jul 9, 2020

georgematheos commented Jul 30, 2020

Combine regenerate and update into an update method taking an UpdateSpec object #282

Are you sure you want to change the base?

Combine regenerate and update into an update method taking an UpdateSpec object #282

Conversation

georgematheos commented Jul 8, 2020 • edited Loading

georgematheos commented Jul 9, 2020

georgematheos commented Jul 30, 2020

Combine `regenerate` and `update` into an update method taking an UpdateSpec object #282

Combine `regenerate` and `update` into an update method taking an UpdateSpec object #282

georgematheos commented Jul 8, 2020 •

edited

Loading