Discrepancy in the Number of Decoder Layers #226

shreeshailgan · 2024-03-14T10:25:41Z

In Section 3.1, under Model Configuration, the paper states that the decoder consists of 4 FFT Transformer blocks. However, the provided checkpoints (and the model.yaml configs) have 6 FFT Transformer blocks in the decoder.
Why this discrepancy? Did you later observe improvements in perfomance using 6 Decoder blocks instead of 4?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discrepancy in the Number of Decoder Layers #226

Discrepancy in the Number of Decoder Layers #226

shreeshailgan commented Mar 14, 2024

Discrepancy in the Number of Decoder Layers #226

Discrepancy in the Number of Decoder Layers #226

Comments

shreeshailgan commented Mar 14, 2024