Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Word length for partitioning and setting the data set #3

Open
zzduoYI opened this issue Apr 13, 2024 · 0 comments
Open

Word length for partitioning and setting the data set #3

zzduoYI opened this issue Apr 13, 2024 · 0 comments

Comments

@zzduoYI
Copy link

zzduoYI commented Apr 13, 2024

Hello, I would like to know about the specific division of the cov-ctr dataset and setting the English word length, I observed that the average word length is 69. But I see that your code is set to: "seq_sent_length": 300, "num_medterms": 241;; I don't understand which of these is the word length you set. Thank you for your work!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant