better update! interface #76

denizyuret · 2017-02-12T08:37:52Z

I was thinking of the update! interface. The requirement of doing individual updates to each numeric weight array is counter-intuitive even if we better documented it. People will naturally want a single update! for the whole model:

(weights, params) = update!(weights, params, grads)

The problem is we do not know the structure of the weights, params, or grads (Array, Tuple, Dict, other Type etc.) What if we require the minimum common interface for all three to be iterators i.e. that they only support for ... in ...? Then, update! can use zip to iterate over them:

for (w,p,g) in zip(weights, params, grads)
  ...
end

We probably need to think about what, if anything, to return. If we just guarantee that the returned values are also iterators, would that be sufficient?

The text was updated successfully, but these errors were encountered:

denizyuret · 2017-02-17T19:51:12Z

The docs indicate constructors like Momentum(), in actual code you require Momentum(w). Should we go back to Momentum()?

denizyuret · 2017-02-17T19:53:43Z

I am not sure if the functional interface: (w,p)=update!(w,g,p) is worth supporting. The original motivation was scalar parameters. But you use axpy! so you don't support this anyway?

denizyuret · 2017-02-17T19:56:10Z

There should be a update!(w,g;lr=0.1) which defaults to Sgd. This will come in handy if w,g are iterators, and we just want to apply a simple Sgd update to all without having to construct separate optimization objects.

denizyuret · 2017-02-17T20:01:04Z

Since all current update! methods require a well typed prms argument, the untyped update!(w,g,p) can be used to implement the iterator method. In fact this will support iterators within iterators (embedded arrays). The only thing it won't support is w::Dict, which requires its own method. In each case we require w,g,p to have parallel structure.

denizyuret · 2017-02-17T20:35:04Z

Do we have the Nesterov version of Momentum or the regular? Maybe we can make that a flag...

denizyuret · 2017-02-17T22:12:08Z

I am working on this in the newupdate branch.

ozanarkancan · 2017-02-18T11:57:30Z

We use the interface (w,p)=update!(w,g,p) to allow further developments. We don't want everyone change their own code when we change our inner code.

We don't have any nesterov...

denizyuret · 2017-02-19T17:22:00Z

The new calls have been integrated. The problem with the (w,p)=update!(w,g,p) syntax is (1) either we are modifying in-place, in which case it is not necessary, (2) or we are constructing new w,p in which case we need to match their types to the originals, i.e. dictionary, tuple, array, struct etc. I don't think it is worth the trouble.

denizyuret assigned denizyuret and ozanarkancan Feb 12, 2017

denizyuret added the enhancement label Feb 12, 2017

denizyuret closed this as completed Feb 19, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

better update! interface #76

better update! interface #76

denizyuret commented Feb 12, 2017

denizyuret commented Feb 17, 2017

denizyuret commented Feb 17, 2017

denizyuret commented Feb 17, 2017

denizyuret commented Feb 17, 2017

denizyuret commented Feb 17, 2017

denizyuret commented Feb 17, 2017

ozanarkancan commented Feb 18, 2017

denizyuret commented Feb 19, 2017

better update! interface #76

better update! interface #76

Comments

denizyuret commented Feb 12, 2017

denizyuret commented Feb 17, 2017

denizyuret commented Feb 17, 2017

denizyuret commented Feb 17, 2017

denizyuret commented Feb 17, 2017

denizyuret commented Feb 17, 2017

denizyuret commented Feb 17, 2017

ozanarkancan commented Feb 18, 2017

denizyuret commented Feb 19, 2017