beam_size == 1 for self-critical decoding? #7

sgondala · 2020-03-03T20:46:39Z

The paper 'Self-critical Sequence Training for Image Captioning' mentions that our baseline is greedy argmax decoding, which is the same as the inference time technique used.

If that's the case, shouldn't the beam size for inference be always 1? If we just choose argmax at each step, there's just one possible way of forming a sentence right?

ruotianluo · 2020-03-03T20:50:01Z

There is no specific connection between baseline and inference method. They can be different.

SCST uses greedy decoding during inference time because beam search doesn't boost the performance much.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

beam_size == 1 for self-critical decoding? #7

beam_size == 1 for self-critical decoding? #7

sgondala commented Mar 3, 2020

ruotianluo commented Mar 3, 2020

beam_size == 1 for self-critical decoding? #7

beam_size == 1 for self-critical decoding? #7

Comments

sgondala commented Mar 3, 2020

ruotianluo commented Mar 3, 2020