Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding the ANTIQUE dataset to BEIR #130

Open
heliah opened this issue Mar 12, 2023 · 4 comments
Open

Adding the ANTIQUE dataset to BEIR #130

heliah opened this issue Mar 12, 2023 · 4 comments

Comments

@heliah
Copy link

heliah commented Mar 12, 2023

Hi,

I am a creator of the ANTIQUE dataset -- a passage retrieval dataset for non-factoid questions answering. Please find the paper that explains the data here. I think it could be beneficial, and would like to add it to the BEIR benchmark. Please let me know if you need me to take any action.

Thank you,
Helia Hashemi (hhashemi@cs.umass.edu)

@thakur-nandan
Copy link
Member

thakur-nandan commented Mar 14, 2023

Hi @heliah,

Thank you for sharing the ANTIQUE dataset. Interestingly the dataset contains non-factoid questions, where retrieval models require to understand the meaning of the passages to judge their relevancy for a given query.

One question on the domain for the ANTIQUE dataset, this covers Wikipedia right?

Currently we are thinking of developing the next version of the BEIR benchmark and thinking of more diversity in terms of domains and tasks. We will reach out when we reach the dataset finalization stage.

Kind Regards,
Nandan Thakur

@heliah
Copy link
Author

heliah commented Mar 16, 2023 via email

@nreimers
Copy link
Member

Sounds interesting, would be nice to have it added to BEIR.

@heliah Do you have the files in the BEIR format?

@heliah
Copy link
Author

heliah commented May 7, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants