-
Notifications
You must be signed in to change notification settings - Fork 316
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add CMB #376
Add CMB #376
Conversation
Thank you very much for your contribution. CMB is indeed a nice work. We're very sorry that we messed up the hash suffix of BBH last week, which resulted in us reopening an issue. Our apologies again. I've noticed that your modifications involve the Additionally, the meaning of PPL is to calculate the perplexity of different data prompt sentences. Given that there are multiple correct options in the CMB dataset, I'm not sure if a PPL evaluation would be feasible? If it's not feasible, you can directly delete the implementation of PPL. Otherwise, you might consider using PPL for single correct options and Gen for multiple correct options. You can refer to the following implementation: |
Thanks for your kindly advice. I remove the implementation of PPL and share the specific data to @gaotongxiao. |
Thanks for your contribution! |
|
* Add CMB * modify CMB --------- Co-authored-by: wangxidong <xidongw@163.com>
Motivation
Add CMB dataset.
Modification
Add /opencompass/opencompass/datasets/cmb.py
Modify /opencompass/opencompass/models/init.py
Add /opencompass/configs/datasets/cmb
Add /opencompass/data/CMB/test.json and /opencompass/data/CMB/val.json