Add minor performance improvements to resnet input pipeline #4247

guptapriya · 2018-05-11T23:54:43Z

Added the following changes to improve input pipeline:

Remove usage of one_hot labels (~2% improvement)
Add drop_remainder to batching. This helps in input shape being determined ahead of time which helps improve performance. (~2% improvement)
Use parallel_interleave for imagenet dataset reading. (~3% improvement)

The improvements were observed when testing with 8 V100 GPUs on DGX-1 using mirrored strategy (fp16, resnet v2). The differences may not be noticeable in other cases where input pipeline is not the bottleneck.

…rleve in imagenet dataset.

robieta

I think overall this is a good balance between performance and accessibility. Thanks a lot for doing this.

robieta · 2018-05-12T00:14:43Z

official/resnet/imagenet_main.py

    dataset = dataset.shuffle(buffer_size=_NUM_TRAIN_FILES)

  # Convert to individual records
+  # TODO(guptapriya): Should we make this cycle_length a flag similar to


Did you get a chance to check how sensitive performance is to this on both something big (DGX, V100 GCE) and something small (1x K80/P100)? I prefer not to have a performance flag unless it makes a big difference. And if it is a constant it would be nice to have a brief comment so it isn't just a magic number.

No I haven't had the chance to run on things other than DGX-1V. I don't think the performance difference will show up on K80s because the input pipeline will not be the bottleneck. But I haven't tested it. I am talking to the input team to figure out if a constant here makes sense, or should this be tuned (in which case we may need to just remove it)

robieta · 2018-05-12T00:17:09Z

official/resnet/resnet_run_loop.py

          batch_size=batch_size,
-          num_parallel_batches=1))
+          num_parallel_batches=1,
+          drop_remainder=True))


This could matter for cifar with only 60k images.

It should not matter much as long as the batch size is reasonable, since this only drops the part of the dataset that doesn't fit into a full batch.

Right, it only drops the last partial batch. so for 60k images for a batch size of 2048 also it should drop only 608 images or so.

@lamberta

Add @lamberta to CODEOWNERS/samples

Add lamberta to CODEOWNERS/samples

Fixed Bug regarding tfrecord shuffling in object_detection

…rleve in imagenet dataset.

… resnet-minor-perf

googlebot · 2018-05-15T06:15:15Z

So there's good news and bad news.

👍 The good news is that everyone that needs to sign a CLA (the pull request submitter and all commit authors) have done so. Everything is all good there.

😕 The bad news is that it appears that one or more commits were authored or co-authored by someone other than the pull request submitter. We need to confirm that all authors are ok with their commits being contributed to this project. Please have them confirm that here in the pull request.

Note to project maintainer: This is a terminal state, meaning the cla/google commit status will not change from this state. It's up to you to confirm consent of the commit author(s) and merge this pull request when appropriate.

pkulzc · 2018-05-15T06:44:43Z

The object detection change in this PR has already been approved and merged in other PR so object detection reviews are not necessary. I'm removing derekjchow, jch1 and me from reviewers.
Please let me know if further reviews from us are required. Thanks!

guptapriya · 2018-05-15T07:07:03Z

@pkulzc - yes! I have messed up this PR while doing rebase etc.. Hence it pulled in some already merged changes :( Sorry about the spam. I will fix it, and yes, no action is needed from you guys!

vladpaunescu and others added 2 commits May 8, 2018 17:15

Fixed Bug regarding tfrecord shuffling in object_detection

e04280e

Remove one hot labels, Add drop_remainder to batch, Use parallel inte…

d93b56e

…rleve in imagenet dataset.

guptapriya requested review from a team and karmel as code owners May 11, 2018 23:54

googlebot added the cla: yes label May 11, 2018

minor lint fix

ee0d0e8

guptapriya requested a review from robieta May 11, 2018 23:56

robieta reviewed May 12, 2018

View reviewed changes

MarkDaoust and others added 6 commits May 14, 2018 11:43

Update CODEOWNERS

ed4e723

Add @lamberta to CODEOWNERS/samples

Merge pull request #4256 from MarkDaoust/add-lamberta

9f58547

Add lamberta to CODEOWNERS/samples

Merge pull request #4205 from vladpaunescu/master

ea6d6aa

Fixed Bug regarding tfrecord shuffling in object_detection

Remove one hot labels, Add drop_remainder to batch, Use parallel inte…

09300a4

…rleve in imagenet dataset.

minor lint fix

838060b

Merge branch 'resnet-minor-perf' of github.com:tensorflow/models into…

1137995

… resnet-minor-perf

guptapriya requested review from derekjchow, jch1 and pkulzc as code owners May 15, 2018 06:15

googlebot added cla: no and removed cla: yes labels May 15, 2018

pkulzc removed request for derekjchow, jch1 and pkulzc May 15, 2018 06:44

guptapriya mentioned this pull request May 22, 2018

Add minor performance improvements to resnet input pipeline #4340

Merged

guptapriya closed this Aug 10, 2018

tfboyd deleted the resnet-minor-perf branch February 7, 2019 19:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add minor performance improvements to resnet input pipeline #4247

Add minor performance improvements to resnet input pipeline #4247

Uh oh!

guptapriya commented May 11, 2018

Uh oh!

robieta left a comment

Uh oh!

robieta May 12, 2018

Uh oh!

guptapriya May 14, 2018

Uh oh!

robieta May 12, 2018

Uh oh!

karmel May 12, 2018

Uh oh!

guptapriya May 14, 2018

Uh oh!

googlebot commented May 15, 2018

Uh oh!

pkulzc commented May 15, 2018

Uh oh!

guptapriya commented May 15, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Add minor performance improvements to resnet input pipeline #4247

Add minor performance improvements to resnet input pipeline #4247

Uh oh!

Conversation

guptapriya commented May 11, 2018

Uh oh!

robieta left a comment

Choose a reason for hiding this comment

Uh oh!

robieta May 12, 2018

Choose a reason for hiding this comment

Uh oh!

guptapriya May 14, 2018

Choose a reason for hiding this comment

Uh oh!

robieta May 12, 2018

Choose a reason for hiding this comment

Uh oh!

karmel May 12, 2018

Choose a reason for hiding this comment

Uh oh!

guptapriya May 14, 2018

Choose a reason for hiding this comment

Uh oh!

googlebot commented May 15, 2018

Uh oh!

pkulzc commented May 15, 2018

Uh oh!

guptapriya commented May 15, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants