Questions related to your paper #5

hbell99 · 2021-11-22T05:42:59Z

Hi~ I've read your paper and the code, and I am little confused about some details.

Firstly, the coordinates of bounding boxes are discretized and the vocabulary for layouts contains the 8-bit uniform quantization token (which is token indexes are 0-127 in your code), categorial token, padding token, bos and eos token. But what if during inference stage, the predicted coordinate of the trained model doesn't lie in 0-127 tokens? Will you do any post processing to correct these mis-predicted layouts?

Secondly, in your paper, it is said that 'we minimize KL-Divergence between soft-max predictions and output one-hot distribution with Label Smoothing', but why the code is still a cross entropy loss?

duzhenjiang113 · 2021-11-22T13:23:53Z

I also have a question. The paper described that the input order of the data is a random sequence, but from the code point of view, it is input in a specific sort of order.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions related to your paper #5

Questions related to your paper #5

hbell99 commented Nov 22, 2021

duzhenjiang113 commented Nov 22, 2021

Questions related to your paper #5

Questions related to your paper #5

Comments

hbell99 commented Nov 22, 2021

duzhenjiang113 commented Nov 22, 2021