More Preprocessing tuning for Resnet #3558

karmel · 2018-03-10T00:46:58Z

These changes bring us to 76+% accuracy, and also improve performance, especially for the 8 GPU case. Notable differences in training:

Use the bounding boxes supplied in the ImageNet dataset (accuracy)
Use the sample_distorted_bounding_box op instead of creating a random box for cropping during training (speed)
Use the fused op for decoding and cropping instead of two separate ops (speed)
The fused decoding requires that we resize after cropping

Eval is the same as previously, but cleaned up slightly now that the functions are not used for training as well.

Screenshot from a run with 4xGPU is below.

FYI @reedwm @bignamehyp @scott7z @robieta

googlebot · 2018-03-10T00:47:01Z

So there's good news and bad news.

👍 The good news is that everyone that needs to sign a CLA (the pull request submitter and all commit authors) have done so. Everything is all good there.

😕 The bad news is that it appears that one or more commits were authored by someone other than the pull request submitter. We need to confirm that all authors are ok with their commits being contributed to this project. Please have them confirm that here in the pull request.

Note to project maintainer: This is a terminal state, meaning the cla/google commit status will not change from this state. It's up to you to confirm consent of the commit author(s) and merge this pull request when appropriate.

nealwu

It looks like you're mainly swapping in code from https://github.com/tensorflow/models/blob/master/research/inception/inception/image_processing.py, right? Would it work better to just switch to that file?

Also, this PR ended up somehow including a commit from https://github.com/MTDzi, which I don't think you intended.

nealwu · 2018-03-13T04:51:58Z

official/resnet/imagenet_main.py

-  image/encoded (a JPEG-encoded string) and image/class/label (int)
+  The output of the build_image_data.py image preprocessing script is a dataset
+  containing serialized Example protocol buffers. Each Example proto contains
+  the following fields:


The numbers below are example values, right? Could we specify that in this comment?

nealwu · 2018-03-13T04:58:05Z

Regardless, nice work on 76+% accuracy!

karmel · 2018-03-13T17:48:44Z

As I understand it, there are several names around preprocessing that are used colloquially-- "inception," "resnet," and "vgg." Inception does more color distortion, which can improve accuracy, but slows down processing. Resnet is what we now have here, and VGG is similar, but doing the aspect-preserving-resize and no bounding boxes. We now have what we might call Resnet preprocessing, pulled largely from tf_cnn_benchmarks, but with fewer flags and options for simplicity. The linked inception preprocessing has the aforementioned color distortion, no fused op, etc., so a little more complex. Arguably, we might want to rename the vgg file at this point, but for now, probably okay to leave.

Pulling in the random commits was my attempt to squash a bunch of debugging commits and a master merge, but that ended in confusion and regret, as git rebasing always seems to :/ I think the changes made, however, are limited to the set intended to be made, with only the commit messages weirdly included.

Comment updated-- thanks for the review.

nealwu

I'm in favor of renaming to resnet_preprocessing, as right now the VGG name and the docstring at the top of the file aren't quite accurate for what the code actually does.

Also, we should definitely fix up the commit situation before merging. I would suggest just taking a git diff and writing that to a patch file, then hard resetting to the same state as master and git apply-ing your patch file to create a single commit. Then you can force push to overwrite this branch so that only your single commit is present.

karmel requested review from k-w-w and nealwu March 10, 2018 00:46

googlebot added the cla: no label Mar 10, 2018

karmel added cla: yes and removed cla: no labels Mar 10, 2018

nealwu reviewed Mar 13, 2018

View reviewed changes

nealwu approved these changes Mar 13, 2018

View reviewed changes

Preprocessing tuning for resnet

40ad868

karmel force-pushed the fix/perf-tune-2 branch from 7e5a3c1 to 40ad868 Compare March 13, 2018 21:39

karmel merged commit 86b1f07 into master Mar 13, 2018

karmel deleted the fix/perf-tune-2 branch March 13, 2018 21:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

More Preprocessing tuning for Resnet #3558

More Preprocessing tuning for Resnet #3558

Uh oh!

karmel commented Mar 10, 2018

Uh oh!

googlebot commented Mar 10, 2018

Uh oh!

nealwu left a comment

Uh oh!

nealwu Mar 13, 2018

Uh oh!

nealwu commented Mar 13, 2018

Uh oh!

karmel commented Mar 13, 2018

Uh oh!

nealwu left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

More Preprocessing tuning for Resnet #3558

More Preprocessing tuning for Resnet #3558

Uh oh!

Conversation

karmel commented Mar 10, 2018

Uh oh!

googlebot commented Mar 10, 2018

Uh oh!

nealwu left a comment

Choose a reason for hiding this comment

Uh oh!

nealwu Mar 13, 2018

Choose a reason for hiding this comment

Uh oh!

nealwu commented Mar 13, 2018

Uh oh!

karmel commented Mar 13, 2018

Uh oh!

nealwu left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants