Merging of ComponentArrays #69

scheidan · 2021-03-18T08:54:46Z

Thanks for the really useful package! It has helped a lot to clean up our model code.

One functionality I missed is to merge component arrays. With tuples we can do:

t1 = (a=1, b=2, c=3)
t2 = (a=111, d=444)
merge(t1, t2)            # (a = 111, b = 2, c = 3, d = 444)

I can simulate this behavior, but I think my implementation is not very optimal:

function merge(a::T, b::T) where T <: ComponentVector
    ComponentVector(merge(NamedTuple(a), NamedTuple(b)))
end
function merge(a::ComponentVector, b::NamedTuple) 
    ComponentVector(merge(NamedTuple(a), b))
end
function merge(a::NamedTuple, b::ComponentVector) 
    ComponentVector(merge(a, NamedTuple(b)))
end

ca1 = ComponentVector(a=1, b=2, c=3)
ca2 = ComponentVector(b=22, d=44)

merge(ca1, ca2)     # ComponentVector{Int64}(a = 1, b = 22, c = 3, d = 44)
# also useful to add a parameter
merge(ca1, (;new=222)) # ComponentVector{Int64}(a = 1, b = 2, c = 3, new = 222)

# it works with ForwardDiff but with with Zygote
f(x) = sum(merge2(ca1, x))
f(ca2)
Zygote.gradient(f, ca2)

A good use case would be optimizing some parameters while keeping others fix:

foo(ca) = ca.a + ca.b + ca.c + ca.d
ca_fix = ComponentVector(a=1, b=2)
# optimize only parameter 'c' and 'd'
optim(ca_opt -> foo(merge(ca_fix, ca_opt))
      ...
    )

The text was updated successfully, but these errors were encountered:

jonniedie · 2021-03-21T04:06:02Z

There is sorta a way to handle this already by passing in an existing ComponentArray to the constructor with keyword arguments for the fields you want to merge. But I think a merge method is probably a better. Here is the current way that sort of thing is done:

julia> ca1 = ComponentVector(a=1, b=2, c=3);

julia> ComponentArray(ca1; new=222, a=20)
ComponentVector{Int64}(a = 20, b = 2, c = 3, new = 222)

One of the problems with doing it this way is there is no easy way to splat new fields from another ComponentArray. merge would fix that. It's a little tricky to get a performant version of this. The speed of construction from a NamedTuple has been an open issue for a while now and this is a similar problem to that. This one should be a little easier to tackle, though.

scheidan · 2021-03-22T08:47:09Z

Thanks a lot, that's good to know!
Should I make a PR to add this to the quick start section?

Does the problem with splatting of new fields you mentioned relate to this:

ca = ComponentVector(a=(a1=1, a2=2), b=(b1=33, b2=44), c=555)
ComponentArray(ca; c = 5, new=222, a=(a2=33, a1=99))  # doesn't work

jonniedie · 2021-03-22T13:33:05Z

Yes, I PR would be much appreciated.

Yeah, that's part of it. Having a merge(x::NamedTuple, y::CompnentArray) and the reverse should fix that and other splatting issues. For example, you should be able to splat a ComponentArray after a semicolon in function arguments and have the fields splat out as if you were doing it with a NamedTuple. merge should fix that, I think.

scheidan · 2021-03-29T12:57:56Z

Just for reference: PropDicts.jl implements this kind of merge for dicts.

scheidan mentioned this issue Mar 23, 2021

document construction from existing ComponentArrays #73

Merged

scheidan mentioned this issue Feb 21, 2023

Implementation of a merge function #186

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merging of ComponentArrays #69

Merging of ComponentArrays #69

scheidan commented Mar 18, 2021

jonniedie commented Mar 21, 2021

scheidan commented Mar 22, 2021

jonniedie commented Mar 22, 2021

scheidan commented Mar 29, 2021

Merging of ComponentArrays #69

Merging of ComponentArrays #69

Comments

scheidan commented Mar 18, 2021

jonniedie commented Mar 21, 2021

scheidan commented Mar 22, 2021

jonniedie commented Mar 22, 2021

scheidan commented Mar 29, 2021