extremely high accuracy in document retrieval task #18

mlpen · 2021-01-31T16:34:04Z

Hi,

I am testing the document retrieval task. I found that the zip file (https://storage.googleapis.com/long-range-arena/lra_release.gz) already contains the actual documents instead of just document ids. When I run the test with my own implementation of model in pytorch, the accuracy is over 70%.

vanzytay · 2021-01-31T18:25:22Z

Two things to take note of here.

ensure you're not using cross attention between documents.
ensure that you're not using subword or word level but character level.

Thanks

mlpen · 2021-01-31T20:55:55Z

Thanks for replying.

I am also use two tower style model
token_out_0 = self.model(input_ids_0, mask_0)
token_out_1 = self.model(input_ids_1, mask_1)
seq_scores = self.seq_classifer(token_out_0, token_out_1)
Within self.seq_classifer, the following is computed:
X_0 = pooling(token_out_0, self.pooling_mode)
X_1 = pooling(token_out_1, self.pooling_mode)
seq_scores = self.mlpblock(torch.cat([X_0, X_1, X_0 * X_1, X_0 - X_1], dim = -1))
I use the input_pipeline.get_matching_datasets to generate data and tokenizer is set to "char"
train_ds, eval_ds, test_ds, encoder = input_pipeline.get_matching_datasets(
n_devices = 1, task_name = None, data_dir = "../../lra_release/lra_release/tsv_data/",
batch_size = 1, fixed_vocab = None, max_length = 4000, tokenizer = "char",
vocab_file_path = None)

adamsolomou · 2021-04-08T13:14:36Z

@mlpen How many training steps and warmup did you use? Config says to use 5K training steps and 8K warmup steps, but that feels weird.

vanzytay · 2021-06-10T08:07:53Z

That's because we used some default FLAX code and only did cursory sweep of hparams (hparam sweeps not within scope of the paper). Some other folks have found that training longer leads to better performance, hence I recommend works like https://arxiv.org/abs/2106.01540 and follow their setup. Thanks :)

vanzytay closed this as completed Jun 10, 2021

mlpen mentioned this issue Aug 26, 2021

Retrieval accuracy different from official JAX/FLAX implementation mlpen/Nystromformer#11

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extremely high accuracy in document retrieval task #18

extremely high accuracy in document retrieval task #18

mlpen commented Jan 31, 2021

vanzytay commented Jan 31, 2021

mlpen commented Jan 31, 2021 •

edited

adamsolomou commented Apr 8, 2021

vanzytay commented Jun 10, 2021

extremely high accuracy in document retrieval task #18

extremely high accuracy in document retrieval task #18

Comments

mlpen commented Jan 31, 2021

vanzytay commented Jan 31, 2021

mlpen commented Jan 31, 2021 • edited

adamsolomou commented Apr 8, 2021

vanzytay commented Jun 10, 2021

mlpen commented Jan 31, 2021 •

edited