Skip to content

Conversation

guptapriya
Copy link
Contributor

Added the following changes to improve input pipeline.

  1. Remove usage of one_hot labels (~2% improvement)
  2. Add drop_remainder to batching. This helps in input shape being determined ahead of time which helps improve performance. (~2% improvement)
  3. Use parallel_interleave for imagenet dataset reading. (~3% improvement)

The improvements were observed when testing with 8 V100 GPUs on DGX-1 using mirrored strategy (fp16, resnet v2). Also saw improvements in GCE V100. No improvement observed on K80s but that is expected since they are not bottlenecked on input.

@guptapriya guptapriya requested review from a team and karmel as code owners May 22, 2018 20:48
@guptapriya
Copy link
Contributor Author

This is just a re-do of #4247 which I messed up.

@guptapriya guptapriya requested a review from robieta May 22, 2018 20:49
Copy link
Contributor

@robieta robieta left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@robieta robieta merged commit fd1d178 into master May 23, 2018
@tfboyd tfboyd deleted the resnet-perf-2 branch February 7, 2019 18:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants