Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Text Classification Configuration: Paper vs Code #21

Closed
adamsolomou opened this issue Mar 19, 2021 · 1 comment
Closed

Text Classification Configuration: Paper vs Code #21

adamsolomou opened this issue Mar 19, 2021 · 1 comment

Comments

@adamsolomou
Copy link

adamsolomou commented Mar 19, 2021

Hi, for the byte-level document classification task there seems to be a discrepancy between the paper (see Appendix 1.2) and the config file in the repository.

Paper

6 layers, 8 heads, 512 hidden dimensions, d=2048 for positional FFN

Code

config.emb_dim = 256
config.num_heads = 4
config.num_layers = 4
config.qkv_dim = 256
config.mlp_dim = 1024

Could you please resolve this?

This is also the case for other tasks, e.g. Image Classification

@MostafaDehghani
Copy link
Collaborator

Configs for reproducing the results are now available at: https://github.com/google-research/long-range-arena/tree/main/lra_benchmarks/text_classification/configs

Feel free to reopen the issue if there was any further questions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants