I think there is a bug in the implementation of bpc #2

OleNet · 2021-04-05T09:04:57Z

According to the material I have find from here and here, bpc=log2(NLL).
But in the implementation in your code, I found that bpc = NLL / log2.
Is there something wrong for the calculation of bpc, or I have missed anything?

yzh119 · 2021-04-05T12:14:37Z

My implementation follows the definition of BPC in https://arxiv.org/pdf/1308.0850.pdf (page 8) and aligns with the implementation with Transformer-XL, and adaptive-span transformer and the StackOverflow thread you posted.

For the paper you mentioned, I think that's a typo in the paper. NLL has already applied a log to the input: https://pytorch.org/docs/stable/generated/torch.nn.NLLLoss.html#torch.nn.NLLLoss.

OleNet · 2021-04-06T06:17:47Z

OK， I got it , thanks

OleNet closed this as completed Apr 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I think there is a bug in the implementation of bpc #2

I think there is a bug in the implementation of bpc #2

OleNet commented Apr 5, 2021

yzh119 commented Apr 5, 2021

OleNet commented Apr 6, 2021

I think there is a bug in the implementation of bpc #2

I think there is a bug in the implementation of bpc #2

Comments

OleNet commented Apr 5, 2021

yzh119 commented Apr 5, 2021

OleNet commented Apr 6, 2021