Cannot use scipy.optimize.minimize method='CG' to run gradient descent #1

oeyh · 2018-07-28T04:26:02Z

This is in assignment3 section 1.4 function oneVsAll()

result = minimize(lrCostFucntion, theta0, args=(X, ylabel, lmd), method='TNC', jac=True, options={'disp': True, 'maxiter':1000})

Method='CG' should work, too, or even faster. But there's error.

Suspect a bug inside minimize function, it seems to change the shape of theta in the process, causing dimension issue when trying to do matrix multiplication.

oeyh · 2018-09-03T06:01:05Z

Further observation: method='CG' works for all cases i in range(9) except i=8, giving error: shapes (5000,401) and (1,1,401) not aligned: 401 (dim 1) != 1 (dim 1)

oeyh · 2018-09-08T03:49:24Z

Seems to be a bug in scipy.

oeyh · 2018-09-09T04:58:36Z

Observation: in certain circumstances, scipy.optimize.minimize(... method='CG'...) will add dimension to x0 (here x0=theta0) and thus making matrix multiplication in cost function error out.

Temporary workaround: in my cost function, ravel theta0 first, make sure it is 1D; then add proper dimension to it to make it a 2D array (column vector).

More comments:

I dived deep into scipy's source codes hoping to find evidence that it adds dimension to x0 by mistake but couldn't. The source codes are still too hard for me to read. In the future, if possible, I'd like to find evidence, submit test report and maybe even try to fix it and submit pull request.....
CG refers to conjugate gradient method, for more info, take a look at wikipedia page: https://en.wikipedia.org/wiki/Conjugate_gradient_method

oeyh self-assigned this Sep 2, 2018

oeyh added a commit that referenced this issue Sep 3, 2018

worked on issue #1

359e67d

oeyh added wontfix This will not be worked on on hold and removed wontfix This will not be worked on labels Sep 8, 2018

oeyh added a commit that referenced this issue Sep 9, 2018

debugged and fixed issue #1

91ded4e

oeyh closed this as completed Sep 9, 2018

oeyh removed the on hold label Sep 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot use scipy.optimize.minimize method='CG' to run gradient descent #1

Cannot use scipy.optimize.minimize method='CG' to run gradient descent #1

oeyh commented Jul 28, 2018

oeyh commented Sep 3, 2018 •

edited

Loading

oeyh commented Sep 8, 2018

oeyh commented Sep 9, 2018 •

edited

Loading

Cannot use scipy.optimize.minimize method='CG' to run gradient descent #1

Cannot use scipy.optimize.minimize method='CG' to run gradient descent #1

Comments

oeyh commented Jul 28, 2018

oeyh commented Sep 3, 2018 • edited Loading

oeyh commented Sep 8, 2018

oeyh commented Sep 9, 2018 • edited Loading

oeyh commented Sep 3, 2018 •

edited

Loading

oeyh commented Sep 9, 2018 •

edited

Loading