Adding checks for broken bottleneck files #7131

rizasif · 2017-01-29T10:11:40Z

An exception is raised when a broken cached bottleneck files is read. Such a file can be created as a result of sudden system failure.

This commit refers the issue #2296 (#2296). Although closed but the problem still persists. Please view comments by @rizasif92 at the end. The changes contain nothing but error handling.

An exception is raised when a broken cached bottleneck files is read. Such a file can be created as a result of sudden system failure. This commit refers the issue tensorflow#2296

tensorflow-jenkins · 2017-01-29T10:11:41Z

Can one of the admins verify this patch?

googlebot · 2017-01-29T10:11:42Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed, please reply here (e.g. I signed it!) and we'll verify. Thanks.

If you've already signed a CLA, it's possible we don't have your GitHub username or you're using a different email address. Check your existing CLA data and verify that your email is set on your git commits.
If you signed the CLA as a corporation, please let us know the company's name.

rizasif · 2017-01-29T10:21:01Z

I signed it!

googlebot · 2017-01-29T10:21:02Z

CLAs look good, thanks!

petewarden

Thanks @rizasif , I really appreciate you putting this fix together! Could I request a slightly different approach though? I would prefer to keep the get_or_create_bottleneck() function always returning a bottleneck.

To ensure that, we could move the section nested inside the if not os.path.exists(bottleneck_path): condition into a new function, create_bottleneck_file. Then we can call it from that condition, but also have a flag that catches float exceptions when reading and calls it in that case too. Here's some untested code to try and show what I mean:

  if not os.path.exists(bottleneck_path):
    create_bottleneck_file(...)
  did_hit_error = False
  try:
    bottleneck_values = [float(x) for x in bottleneck_string.split(',')]
  except:
    print("Invalid float found, recreating bottleneck")
    did_hit_error = True
  if did_hit_error:
    create_bottleneck_file(...)
    # Allow exceptions to propagate here, since they shouldn't happen after a fresh creation
    bottleneck_values = [float(x) for x in bottleneck_string.split(',')]

I'm hoping with this approach none of the other changes outside this function should be needed since the contract to always return a list of bottleneck values is preserved.

Does that make sense?

The patch detects the broken bottleneck files (might be created as a result of sudden system shut down) and recreates them if required.

rizasif · 2017-01-31T13:36:35Z

@petewarden I agree with your solution and was thinking to implement it. I have made the changes as suggested, and tested them as well; I am attaching a screenshot of the output. Kindly review and let me know if anything else is required.
Thank you :)

petewarden

This looking great, thanks for the updates! I have a couple of minor comments, but once those are addressed this looks good to go in.

petewarden · 2017-02-01T10:43:55Z

tensorflow/examples/image_retraining/retrain.py

@@ -481,7 +498,6 @@ def get_random_cached_bottlenecks(sess, image_lists, how_many, category,
      ground_truth[label_index] = 1.0
      bottlenecks.append(bottleneck)
      ground_truths.append(ground_truth)
-      filenames.append(image_name)


I think we should put this line back?

petewarden · 2017-02-01T10:44:21Z

tensorflow/examples/image_retraining/retrain.py

@@ -493,11 +509,12 @@ def get_random_cached_bottlenecks(sess, image_lists, how_many, category,
                                              image_index, image_dir, category,
                                              bottleneck_dir, jpeg_data_tensor,
                                              bottleneck_tensor)
-        ground_truth = np.zeros(class_count, dtype=np.float32)


Can we revert this change, since now the bottlenecks will never be None?

Reverting some part of code for bottleneck handling changes

rizasif · 2017-02-01T11:18:48Z

@petewarden done

rmlarsen · 2017-02-01T17:35:38Z

@tensorflow-jenkins test this please

rmlarsen · 2017-02-01T21:31:53Z

@rizasif it looks like you added some 4 character indents, which breaks the sanity check. TF uses 2 character indents. Please fix.

See: https://ci.tensorflow.org/job/tensorflow-pull-requests-sanity/2923/console

This commit refers to the issue tensorflow#2296

rizasif · 2017-02-01T22:40:24Z

Indentation changes made, kindly re-run the tests. Thanks :)

rmlarsen · 2017-02-02T06:07:21Z

@tensorflow-jenkins test this please

rmlarsen · 2017-02-02T06:09:16Z

Thanks @rizasif. @petewarden can you take another look? If the logics right to you, approve and I'll get it merged.

petewarden

This looks great, thanks!

rizasif · 2017-02-02T10:51:42Z

Did the tests pass? :/

rmlarsen · 2017-02-02T17:07:40Z

@rizasif it looks like you still have some bad indents, e.g.

FAIL: Found 1 non-whitelited pylint errors:
tensorflow/examples/image_retraining/retrain.py:353: [E0001(syntax-error), ] unexpected indent

This commit refers to pull request tensorflow#7131 and issue tensorflow#2296

rizasif · 2017-02-02T17:46:30Z

@rmlarsen the code seems to be working at my end; so we might have to do this a few more times to resolve these indentation issues. I hope this commit works fine. @tensorflow-jenkins please test :)

rmlarsen · 2017-02-03T17:28:30Z

@rizasif Thanks :)

@tensorflow-jenkins test this please

rmlarsen · 2017-02-03T18:41:33Z

@rizasif thanks for the contribution!

rizasif · 2017-02-03T21:03:47Z

Pleasure's all mine @rmlarsen
Thank you so much for all the support. Hope to see you guys again. Take Care :)

This reverts commit 4a80e9f.

Adding checks for broken bottleneck files

8577dde

An exception is raised when a broken cached bottleneck files is read. Such a file can be created as a result of sudden system failure. This commit refers the issue tensorflow#2296

googlebot added the cla: no label Jan 29, 2017

googlebot added cla: yes and removed cla: no labels Jan 29, 2017

rmlarsen assigned petewarden Jan 30, 2017

rmlarsen requested a review from petewarden January 30, 2017 20:47

rmlarsen added the awaiting review Pull request awaiting review label Jan 30, 2017

petewarden suggested changes Jan 30, 2017

View reviewed changes

rmlarsen added stat:awaiting response Status - Awaiting response from author and removed awaiting review Pull request awaiting review labels Jan 30, 2017

handling broken bottleneck files by recreating

1888851

The patch detects the broken bottleneck files (might be created as a result of sudden system shut down) and recreates them if required.

rmlarsen added stat:awaiting tensorflower Status - Awaiting response from tensorflower and removed stat:awaiting response Status - Awaiting response from author labels Jan 31, 2017

petewarden suggested changes Feb 1, 2017

View reviewed changes

minor changes for broken bottleneck handling

1e80dcf

Reverting some part of code for bottleneck handling changes

rmlarsen added stat:awaiting response Status - Awaiting response from author and removed stat:awaiting tensorflower Status - Awaiting response from tensorflower labels Feb 1, 2017

Resolved indentation errors

b43c4d9

This commit refers to the issue tensorflow#2296

rmlarsen added awaiting review Pull request awaiting review and removed stat:awaiting response Status - Awaiting response from author labels Feb 2, 2017

petewarden approved these changes Feb 2, 2017

View reviewed changes

Indentation issue for broken bottleneck script

8228dee

This commit refers to pull request tensorflow#7131 and issue tensorflow#2296

rmlarsen merged commit 4a80e9f into tensorflow:master Feb 3, 2017

alberto7088 pushed a commit to synthesisai/tensorflow that referenced this pull request Feb 10, 2017

Revert "Adding checks for broken bottleneck files (tensorflow#7131)"

8587db2

This reverts commit 4a80e9f.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding checks for broken bottleneck files #7131

Adding checks for broken bottleneck files #7131

rizasif commented Jan 29, 2017

tensorflow-jenkins commented Jan 29, 2017

googlebot commented Jan 29, 2017

rizasif commented Jan 29, 2017

googlebot commented Jan 29, 2017

petewarden left a comment

rizasif commented Jan 31, 2017

petewarden left a comment

petewarden Feb 1, 2017

petewarden Feb 1, 2017

rizasif commented Feb 1, 2017

rmlarsen commented Feb 1, 2017

rmlarsen commented Feb 1, 2017 •

edited

rizasif commented Feb 1, 2017

rmlarsen commented Feb 2, 2017

rmlarsen commented Feb 2, 2017

petewarden left a comment

rizasif commented Feb 2, 2017

rmlarsen commented Feb 2, 2017

rizasif commented Feb 2, 2017

rmlarsen commented Feb 3, 2017

rmlarsen commented Feb 3, 2017

rizasif commented Feb 3, 2017

Adding checks for broken bottleneck files #7131

Adding checks for broken bottleneck files #7131

Conversation

rizasif commented Jan 29, 2017

tensorflow-jenkins commented Jan 29, 2017

googlebot commented Jan 29, 2017

rizasif commented Jan 29, 2017

googlebot commented Jan 29, 2017

petewarden left a comment

Choose a reason for hiding this comment

rizasif commented Jan 31, 2017

petewarden left a comment

Choose a reason for hiding this comment

petewarden Feb 1, 2017

Choose a reason for hiding this comment

petewarden Feb 1, 2017

Choose a reason for hiding this comment

rizasif commented Feb 1, 2017

rmlarsen commented Feb 1, 2017

rmlarsen commented Feb 1, 2017 • edited

rizasif commented Feb 1, 2017

rmlarsen commented Feb 2, 2017

rmlarsen commented Feb 2, 2017

petewarden left a comment

Choose a reason for hiding this comment

rizasif commented Feb 2, 2017

rmlarsen commented Feb 2, 2017

rizasif commented Feb 2, 2017

rmlarsen commented Feb 3, 2017

rmlarsen commented Feb 3, 2017

rizasif commented Feb 3, 2017

rmlarsen commented Feb 1, 2017 •

edited