Some minor questions regarding the design #29

XavierXiao · 2023-07-21T17:45:21Z

Thanks for updating the implementation frequently! The codebase looks much nicer than when I first looked at it. I took a closer look on the details and would like to ask some questions on specific design choice. Of course I understand that not every specific design comes with a reason, but I would like to know if you have reference/intuition on these things:

It seems like in the main generator, self attention comes before cross attention, while in the upsampler, cross attention comes before self attention.
This line has a residual connection, but it is already done inside the Transformer class. Same here. Is this something new in transformer literature?
Here in the discriminator, the residual is added after attention block. Does it make more sense to add it right after two conv blocks, since the attention block has its own residual connection?
Very tiny issue. The definition in this is unused.

lucidrains · 2023-07-21T18:09:59Z

@XavierXiao thank you for the code review! this is more helpful than you could know 🙏

could you see if the latest commit addresses all the issues?

XavierXiao · 2023-07-21T19:20:37Z

Yes! Looks good to me. I will let you know if I further spot something.

lucidrains added a commit that referenced this issue Jul 21, 2023

address #29

b722514

lucidrains closed this as completed Jul 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some minor questions regarding the design #29

Some minor questions regarding the design #29

XavierXiao commented Jul 21, 2023

lucidrains commented Jul 21, 2023

XavierXiao commented Jul 21, 2023

Some minor questions regarding the design #29

Some minor questions regarding the design #29

Comments

XavierXiao commented Jul 21, 2023

lucidrains commented Jul 21, 2023

XavierXiao commented Jul 21, 2023