Can PRIMERA accept 16k input? #17

GabrielLin · 2022-07-27T14:24:06Z

Could you please tell me can the models on HF (https://huggingface.co/allenai/PRIMERA, https://huggingface.co/allenai/PRIMERA-arxiv) accept 16k input. Can I just set the max_length to 16384 to let it accept such a length of a long document? Thanks.

Wendy-Xiao · 2022-08-06T22:30:15Z

Hi there,

Yes, it can accept 16k input. However, in the models on HF, as it only pretrained with max_length=4096, it does not have trained position embedding for the tokens after that. If you would like to use a larger max_length, you can follow the same method used in Longformer-Encoder-Decoder, i.e. simply copying the position embeddings four times, and fine-tune the model with the new position embeddings.

GabrielLin · 2022-08-08T02:22:47Z

Dear @Wendy-Xiao , thank you for your answer. It solves my concern.

attekei · 2022-11-08T10:06:33Z

@GabrielLin Curious if you have built a model with max_length=16384. I'm summarising lots of documents at once, and frequently have ~10,000 tokens in total 🙂

I mean, if you have built trained such model, would be really cool if you could publish it in HuggingFace

GabrielLin · 2022-11-15T15:26:54Z

Hi @attekei . Thank you for your interest. I am just doing research to compare different models.

GabrielLin closed this as completed Aug 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can PRIMERA accept 16k input? #17

Can PRIMERA accept 16k input? #17

GabrielLin commented Jul 27, 2022

Wendy-Xiao commented Aug 6, 2022

GabrielLin commented Aug 8, 2022

attekei commented Nov 8, 2022 •

edited

Loading

GabrielLin commented Nov 15, 2022

Can PRIMERA accept 16k input? #17

Can PRIMERA accept 16k input? #17

Comments

GabrielLin commented Jul 27, 2022

Wendy-Xiao commented Aug 6, 2022

GabrielLin commented Aug 8, 2022

attekei commented Nov 8, 2022 • edited Loading

GabrielLin commented Nov 15, 2022

attekei commented Nov 8, 2022 •

edited

Loading