Implement five (5) different modes for setting the learning rate #8

ptoulis · 2015-01-03T00:18:17Z

I believe the user should have the following options for the learning rate.

I suggest we work on the 2 & 3 & 4 for now.
We can add the rest as we go. Any thoughts?

lantian2012 · 2015-01-03T03:50:22Z

Ye and I just looked at the wiki. Thanks for the new method. we have a few questions about the method.

You mentioned " α_n and D_n need to approximate the inverse of nI(θ) ". we were wondering why the inverse of nI(θ) would be the optimal learning rate.
I think it would be helpful if you could point me to the literature about the method for the approximation of the inverse of nI(θ). (BTW, there might be a typo in " Take the inverse-square of all components Gi <- Gi^2 ". Do we take the before of Gi before squaring it? )
Is the iterative method to calculate learning rate also applicable to 1 dim learning rate?

Thanks!

ptoulis · 2015-01-03T04:08:11Z

re the questions.

It is a theoretical result that if one uses the inverse of n I(θ*) then SGD is optimal (same asymptotic variance as the MLE). I just added two papers about this in the "literature" dropbox folder.
The SGD-QN is the following http://jmlr.org/papers/volume10/bordes09a/bordes09a.pdf
It approximates the matrix in a BFGS style.
Yes! there was a typo. No "inverse-square" but just square.
It is but the method with multiple learning rates will be more efficient. We can try in the experiment to simply use the norms of the gradient of the log-likehood, and use this as a 1-dim learning rate.

ptoulis added feature learning rate labels Jan 3, 2015

ptoulis assigned k-ye and lantian2012 and unassigned k-ye and lantian2012 Jan 3, 2015

ptoulis added this to the Run the package on 3 separate datasets. milestone Jan 3, 2015

k-ye mentioned this issue Jan 4, 2015

Implementation of p-dimension learning rate #12

Closed

ptoulis unassigned k-ye Apr 24, 2015

dustinvtran mentioned this issue Jun 30, 2015

Implement other learning rates #60

Closed

3 tasks

dustinvtran removed the learning rate label Sep 22, 2015

Provide feedback