Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Add Xiezhi SQuAD2.0 ANLI #101

Merged
merged 4 commits into from
Aug 10, 2023

Conversation

Leymore
Copy link
Collaborator

@Leymore Leymore commented Jul 26, 2023

Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers.

Motivation

ANLI:
Adversarial NLI: A New Benchmark for Natural Language Understanding
https://github.com/facebookresearch/anli

Xiezhi:
Xiezhi (獬豸) is a comprehensive evaluation suite for Language Models (LMs).
https://github.com/MikeGu721/XiezhiBenchmark

SQuAD2.0:
Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to every question is a segment of text, or span, from the corresponding reading passage, or the question might be unanswerable.
https://rajpurkar.github.io/SQuAD-explorer/

@Leymore Leymore requested a review from gaotongxiao July 28, 2023 08:16
@gaotongxiao gaotongxiao merged commit e7fc54b into open-compass:main Aug 10, 2023
1 check passed
go-with-me000 pushed a commit to go-with-me000/opencompass that referenced this pull request Oct 9, 2023
* add Xiezhi SQuAD2.0 ANLI; update WSC

* update

* update

* update doc string
liuyaox pushed a commit to liuyaox/opencompass that referenced this pull request Jun 26, 2024
* add Xiezhi SQuAD2.0 ANLI; update WSC

* update

* update

* update doc string
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants