New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
about the repetition of the ground-turth #18
Comments
Hi @LHRYANG -- Thank you for your interest in our work. Have you tried to truncate the reference text to its first 128 tokens and then measure the diversity? |
Following your suggestion, I tried to truncate the reference text by adding two lines to your original code ((if len(token_list)>128:
The output is: The result of rep-2 is 4.53, rep-3 is 1.07, rep-4 is 0.37, and diversity is 0.94, still different from that you reported. |
Hi @LHRYANG — I will double check the results on my end. Feel free to report your replicated numbers in your work :-) |
Hi Yixuan, I have a question about the calculation of repetition rate of the ground truth. I use the code you provide:
I can reproduce the result you reported in your paper:
However, when I change the code "text = item['generated_result']['0']['continuation']" to "text = item['reference_continuation_text']", it outputs
Which is different from the human score in your paper.
Could you help me solve this issue?
Thanks a lot!
The text was updated successfully, but these errors were encountered: