Off by one error in Polygen? #72

wamiq-reyaz · 2020-08-30T15:09:05Z

If we look at this line, we see that the input sequence is padded on the right.

And in order to generate the predictive distribution, we do this , which makes sense - the last element in the sequence is not used for prediction. But then in an autoregressive setting, I would assume that you would have element 0, a_0 predict a_1 and so on.

In the training script, we have

vertex_model_loss = -tf.reduce_sum( vertex_model_pred_dist.log_prob(vertex_model_batch['vertices_flat']) * vertex_model_batch['vertices_flat_mask'])

This does not make sense to me. This would mean we are using the current element to predict itself. This has zero generative power, right?
@saran-t @charlienash

The text was updated successfully, but these errors were encountered:

wamiq-reyaz · 2020-08-30T17:08:17Z

Whoopsie daisy. The sequence is padded on the left here, so ignore what I said. ;p

wamiq-reyaz closed this as completed Aug 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Off by one error in Polygen? #72

Off by one error in Polygen? #72

wamiq-reyaz commented Aug 30, 2020 •

edited

Loading

wamiq-reyaz commented Aug 30, 2020

Off by one error in Polygen? #72

Off by one error in Polygen? #72

Comments

wamiq-reyaz commented Aug 30, 2020 • edited Loading

wamiq-reyaz commented Aug 30, 2020

wamiq-reyaz commented Aug 30, 2020 •

edited

Loading