Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[QUESTION] single_texts vs group_texts #13

Closed
agademic opened this issue Mar 6, 2023 · 2 comments
Closed

[QUESTION] single_texts vs group_texts #13

agademic opened this issue Mar 6, 2023 · 2 comments

Comments

@agademic
Copy link

agademic commented Mar 6, 2023

Hi @mallorbc!
Thank you so much for your work on the repo and your tutorials!

Did you experiment with the different data preprocessing settings (single_texts, group_texts) in different task scenarios?
I am running experiments on the quotes dataset and I am getting very different losses in these two settings: single ~ 0.89 vs group ~ 3.3.

Single is padded to a certain length, while group is concatenated with eos token in between.

Do you have any idea when to use which setting or why there is this difference in loss?

Any hints are appreciated!

@agademic agademic changed the title single_texts vs group_texts [QUESTION] single_texts vs group_texts Mar 6, 2023
@mallorbc
Copy link
Owner

While I have not tested the two, intuitively they make sense. For group texts, if your data entries are not related at all, by grouping them together the model will incorrectly learn that some text follows some unrelated text.

If entries are statistically independent, I keep them separate. If they are related, such as a book or some general knowledge corpus for further pretraining, I would group them

@mallorbc
Copy link
Owner

Closing. Feel free to reopen if you are still confused.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants