Fix model layer for resnet v1. #4230

qlzh727 · 2018-05-10T17:45:52Z

The final BN and ReLU layer is only need for v2 model since it was
doing preactivation in each block.

The final BN and ReLU layer is only need for v2 model since it was doing preactivation in each block.

robieta

LGTM

karmel · 2018-05-10T18:01:41Z

How was this missed? Or, to put that another way, why didn't this have any effect in training, etc.? Does this fix the loss scale issue?

robieta · 2018-05-10T18:31:09Z

Sadly this does not resolve the divergent fp16 v1 training.

qlzh727 · 2018-05-10T18:56:51Z

@karmel, the extra ReLU shouldn't impact the correctness of the model. The BN will affect the number to let the final number scale down further.

The fp16 problem in v1 is because of overflow of the number on the top few layer of the model. Removing the final BN and ReLU won't fix it.

HiKapok · 2018-05-11T01:49:31Z

have the pretrained model of resnet-v1 been replaced ?

qlzh727 · 2018-05-11T03:20:44Z

@HiKapok, the pretrained model has not been updated yet. Will do in later this week.

The final BN and ReLU layer is only need for v2 model since it was doing preactivation in each block.

Fix model layer for resnet v1.

ac1a3e1

The final BN and ReLU layer is only need for v2 model since it was doing preactivation in each block.

qlzh727 requested review from karmel and robieta May 10, 2018 17:45

qlzh727 requested a review from a team as a code owner May 10, 2018 17:45

googlebot added the cla: yes label May 10, 2018

qlzh727 mentioned this pull request May 10, 2018

resnet_model maybe wrong in resnet_v1 #4198

Closed

robieta approved these changes May 10, 2018

View reviewed changes

qlzh727 merged commit 89edd1c into tensorflow:master May 10, 2018

qlzh727 deleted the resnet_v1_fix branch May 10, 2018 19:45

omegafragger pushed a commit to omegafragger/models that referenced this pull request May 15, 2018

Fix model layer for resnet v1. (tensorflow#4230)

d0ce2bb

The final BN and ReLU layer is only need for v2 model since it was doing preactivation in each block.

robieta mentioned this pull request Sep 25, 2018

Fix/resnet reference mlcommons/training#141

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix model layer for resnet v1. #4230

Fix model layer for resnet v1. #4230

Uh oh!

qlzh727 commented May 10, 2018

Uh oh!

robieta left a comment

Uh oh!

karmel commented May 10, 2018

Uh oh!

robieta commented May 10, 2018

Uh oh!

qlzh727 commented May 10, 2018

Uh oh!

HiKapok commented May 11, 2018

Uh oh!

qlzh727 commented May 11, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Fix model layer for resnet v1. #4230

Fix model layer for resnet v1. #4230

Uh oh!

Conversation

qlzh727 commented May 10, 2018

Uh oh!

robieta left a comment

Choose a reason for hiding this comment

Uh oh!

karmel commented May 10, 2018

Uh oh!

robieta commented May 10, 2018

Uh oh!

qlzh727 commented May 10, 2018

Uh oh!

HiKapok commented May 11, 2018

Uh oh!

qlzh727 commented May 11, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants