We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi,
I have a question regarding the number of queries in MSMARCO. According to the paper and the readme, the number of test queries in MSMARCO is 6,980.
However, when I ran the following codes, I was only able to get 43 queries.
>> corpus, queries, qrels = GenericDataLoader(data_folder='msmarco').load(split="test") >> print(len(queries)) 43
Instead, I got 6,980 queries from the dev set. Should I use the dev queries when evaluating MSMARCO instead of the test queries?
Thanks!
The text was updated successfully, but these errors were encountered:
The split you are looking for is the "dev" split (so split="dev"). BEIR considers MSMARCO test to be one of the TREC-DL competitions.
Sorry, something went wrong.
@cadurosar
Thanks! It's clear now.
No branches or pull requests
Hi,
I have a question regarding the number of queries in MSMARCO.
According to the paper and the readme, the number of test queries in MSMARCO is 6,980.
However, when I ran the following codes, I was only able to get 43 queries.
Instead, I got 6,980 queries from the dev set.
Should I use the dev queries when evaluating MSMARCO instead of the test queries?
Thanks!
The text was updated successfully, but these errors were encountered: