fix bug in ch3/linear_regression_tf.py #9

discoverkl · 2018-04-19T03:57:48Z

I think there may be a bug due to brordcasting rule. Please take a look.

rbharath · 2018-04-24T19:29:13Z

Thanks for the PR! I think you might be right about the broadcasting error... I'll try rerunning this code with the fix to double check on my end.

RAvontuur · 2018-05-23T12:17:03Z

With the above fix, the system converges to the right solution.
After setting the noise to zero, and initialize the system with w=5 and b=2, the loss is now zero (as expected), and not some high positive value as it was before the fix.

Because of this, the following explanation in the book is not correct and requires an update:
'What happened on this system? Why didn’t TensorFlow learn the correct function despite being trained to convergence? This example provides a good illustration of one of the weaknesses of gradient descent algorithms. There is no guarantee of finding the true solution! The gradient descent algorithm can get trapped in local minima. That is, it can find solutions that look good, but are not in fact the lowest minima of the loss function'

Please, add a correct update of this explanation as a comment to the code. This makes it better understandable for future readers.

rbharath · 2018-05-23T14:11:55Z

Thanks for the feedback here. We'll make sure to fix this bug in a future printing of the book.

As a quick note, the explanation isn't wrong though. It's entirely common to see instability in training more complex models. It turns our the behavior on this linear system is in fact stable after this bugfix, but there are a number of unstable nonlinear systems you will encounter in practice. We will add a note to explain this.

hamelsmu · 2018-07-18T05:27:42Z

You can also fix this bug by merging #17

I agree this is confusing / misleading for readers. When reading the book, I was very skeptical of this model not converging and set out to debug the model. I noticed if you keep training the model the learned slope goes to zero (a flat line), which I found quite odd and gave me intuition that somehow the loss function was ill defined, because I noticed the loss decreasing even though the visualization of the learned model kept looking worse.

One idea is you could demonstrate how to use tf eager on how to debug this situation in the book, which is a useful thing to learn.

cc: @hohsiangwu @ankushagarwal

rbharath · 2018-07-19T01:46:21Z

@hamelsmu Good suggestion! We will add a section on debugging this model in the next edition of this book. Our apologies again for letting this error slip through review

fix bug in ch3/linear_regression_tf.py

78c69e0

rbharath mentioned this pull request Jun 19, 2018

Bug in ch3 example of linear regression #14

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix bug in ch3/linear_regression_tf.py #9

fix bug in ch3/linear_regression_tf.py #9

discoverkl commented Apr 19, 2018 •

edited

Loading

rbharath commented Apr 24, 2018

RAvontuur commented May 23, 2018

rbharath commented May 23, 2018

hamelsmu commented Jul 18, 2018 •

edited

Loading

rbharath commented Jul 19, 2018

fix bug in ch3/linear_regression_tf.py #9

Are you sure you want to change the base?

fix bug in ch3/linear_regression_tf.py #9

Conversation

discoverkl commented Apr 19, 2018 • edited Loading

rbharath commented Apr 24, 2018

RAvontuur commented May 23, 2018

rbharath commented May 23, 2018

hamelsmu commented Jul 18, 2018 • edited Loading

rbharath commented Jul 19, 2018

discoverkl commented Apr 19, 2018 •

edited

Loading

hamelsmu commented Jul 18, 2018 •

edited

Loading