314 fix transformer training #318

marksgraham · 2023-03-17T19:59:27Z

Fixes #314

Warvito

Hi Mark, thanks for working in this tutorial. During the review, I found a few things that might be wrong in the inferer (besides the ones pointed in the review). I will try to investigate it further

generative/inferers/inferer.py

Warvito · 2023-03-17T22:11:06Z

generative/inferers/inferer.py


        # if we have not covered the full sequence we continue with inefficient looping
-        if probs.shape[1] < latent.shape[1]:
+        if logits.shape[1] < latent.shape[1]:


I think it might have something wrong here, because this logits.shape[1] < latent.shape[1]: will always be true since logits are size= spatial_shape[0] * spatial_shape[1] and latent will be it +1 (BOS)

Running the tests, i find the logits and the latents have the same shape, unless
transformer_model.max_seq_len < (spatial_shape[0] * spatial_shape[1])+1
that is the logits also have shape (spatial_shape[0] * spatial_shape[1])+1

yes, but usually the transformer.max_seq_len=(spatial_shape[0] * spatial_shape[1]). Here, are you considering cases where max_seq_len = (spatial_shape[0] * spatial_shape[1])+1 because we pad the BOS token?

Yeah, I've always been setting max_seq_len = (spatial_shape[0] * spatial_shape[1])+1 in my networks. Have you been doing it without the +1? In all the tests for the VQVAETransformerInferer it is set to (spatial_shape[0] * spatial_shape[1])+1

marksgraham added 2 commits March 17, 2023 11:39

Update likelihood calculation

dcfd937

Fixes notebook

126badd

Warvito self-requested a review March 17, 2023 21:04

Warvito requested changes Mar 17, 2023

View reviewed changes

marksgraham added 2 commits March 18, 2023 10:42

Adds deleted log

65f3593

Explicitly sets latent target

db3d87c

Warvito approved these changes Mar 20, 2023

View reviewed changes

Merge branch 'main' into 314_fix_transformer_training

a003684

marksgraham merged commit 78fde33 into main Mar 20, 2023

Warvito deleted the 314_fix_transformer_training branch March 20, 2023 22:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

314 fix transformer training #318

314 fix transformer training #318

marksgraham commented Mar 17, 2023

Warvito left a comment

Warvito Mar 17, 2023

marksgraham Mar 17, 2023

Warvito Mar 17, 2023 •

edited

marksgraham Mar 18, 2023

314 fix transformer training #318

314 fix transformer training #318

Conversation

marksgraham commented Mar 17, 2023

Warvito left a comment

Choose a reason for hiding this comment

Warvito Mar 17, 2023

Choose a reason for hiding this comment

marksgraham Mar 17, 2023

Choose a reason for hiding this comment

Warvito Mar 17, 2023 • edited

Choose a reason for hiding this comment

marksgraham Mar 18, 2023

Choose a reason for hiding this comment

Warvito Mar 17, 2023 •

edited