Text Classification Configuration: Paper vs Code #21

adamsolomou · 2021-03-19T09:44:31Z

Hi, for the byte-level document classification task there seems to be a discrepancy between the paper (see Appendix 1.2) and the config file in the repository.

Paper

6 layers, 8 heads, 512 hidden dimensions, d=2048 for positional FFN

Code

config.emb_dim = 256
config.num_heads = 4
config.num_layers = 4
config.qkv_dim = 256
config.mlp_dim = 1024

Could you please resolve this?

This is also the case for other tasks, e.g. Image Classification

MostafaDehghani · 2021-06-10T07:50:21Z

Configs for reproducing the results are now available at: https://github.com/google-research/long-range-arena/tree/main/lra_benchmarks/text_classification/configs

Feel free to reopen the issue if there was any further questions.

vanzytay closed this as completed Jun 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text Classification Configuration: Paper vs Code #21

Text Classification Configuration: Paper vs Code #21

adamsolomou commented Mar 19, 2021 •

edited

MostafaDehghani commented Jun 10, 2021

Text Classification Configuration: Paper vs Code #21

Text Classification Configuration: Paper vs Code #21

Comments

adamsolomou commented Mar 19, 2021 • edited

MostafaDehghani commented Jun 10, 2021

adamsolomou commented Mar 19, 2021 •

edited