Potential bug in weighting trigger candidate losses #4

davides · 2022-12-08T17:47:57Z

First, thanks for publishing your implementation of this technique. It's been very helpful!

While stepping through the code, I think I may have found a small issue. The goal of this code seems to be: re-weight the loss contributed by each batch by the fraction of unmasked tokens it contains.

If that's the case, shouldn't curr_num_elements count all elements != constants.PAD_TOKEN_ID (-100), instead of -1?

Thanks!

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Potential bug in weighting trigger candidate losses #4

Potential bug in weighting trigger candidate losses #4

davides commented Dec 8, 2022

Potential bug in weighting trigger candidate losses #4

Potential bug in weighting trigger candidate losses #4

Comments

davides commented Dec 8, 2022