[QUESTION] single_texts vs group_texts #13

agademic · 2023-03-06T22:30:24Z

Hi @mallorbc!
Thank you so much for your work on the repo and your tutorials!

Did you experiment with the different data preprocessing settings (single_texts, group_texts) in different task scenarios?
I am running experiments on the quotes dataset and I am getting very different losses in these two settings: single ~ 0.89 vs group ~ 3.3.

Single is padded to a certain length, while group is concatenated with eos token in between.

Do you have any idea when to use which setting or why there is this difference in loss?

Any hints are appreciated!

mallorbc · 2023-03-15T01:20:45Z

While I have not tested the two, intuitively they make sense. For group texts, if your data entries are not related at all, by grouping them together the model will incorrectly learn that some text follows some unrelated text.

If entries are statistically independent, I keep them separate. If they are related, such as a book or some general knowledge corpus for further pretraining, I would group them

mallorbc · 2023-03-15T23:10:12Z

Closing. Feel free to reopen if you are still confused.

agademic changed the title ~~single_texts vs group_texts~~ [QUESTION] single_texts vs group_texts Mar 6, 2023

mallorbc closed this as completed Mar 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUESTION] single_texts vs group_texts #13

[QUESTION] single_texts vs group_texts #13

agademic commented Mar 6, 2023

mallorbc commented Mar 15, 2023

mallorbc commented Mar 15, 2023

[QUESTION] single_texts vs group_texts #13

[QUESTION] single_texts vs group_texts #13

Comments

agademic commented Mar 6, 2023

mallorbc commented Mar 15, 2023

mallorbc commented Mar 15, 2023