Skip to content

Changes to hint losses, mostly for decode_diffs=True. Before, only one of the terms of the MASK type loss was masked by gt_diff. Also, the loss was averaged over all time steps, including steps without diffs and therefore contributing 0 to the loss. Now we average only over the non-zero-diff steps.#57

Merged
copybara-service[bot] merged 1 commit intomasterfrom
test_431003851
Feb 27, 2022

Conversation

@copybara-service
Copy link
Copy Markdown

Changes to hint losses, mostly for decode_diffs=True. Before, only one of the terms of the MASK type loss was masked by gt_diff. Also, the loss was averaged over all time steps, including steps without diffs and therefore contributing 0 to the loss. Now we average only over the non-zero-diff steps.

With this, the chunked and non-chunked code of the losses can be shared more.

@google-cla
Copy link
Copy Markdown

google-cla bot commented Feb 25, 2022

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

For more information, open the CLA check for this pull request.

…e of the terms of the MASK type loss was masked by gt_diff. Also, the loss was averaged over all time steps, including steps without diffs and therefore contributing 0 to the loss. Now we average only over the non-zero-diff steps.

With this, the chunked and non-chunked code of the losses can be shared more.

PiperOrigin-RevId: 431292151
@copybara-service copybara-service bot merged commit 25a7cbb into master Feb 27, 2022
@copybara-service copybara-service bot deleted the test_431003851 branch February 27, 2022 19:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants