Group-QPP

This is the github repository of paper "Groupwise Query Performance Prediction with BERT" in ECIR2022.

You can find the Groupwise QPP model(COBERT) and Vanilla BERT model for QPP here.

You may need to prepare the previous baseline files properly as the paper demonstrates.

You can run the pipeline_datasetname.sh to get the results, but before running the shell, you should carefully config the parameters to make sure they're right. The data processing part is kind of complicate and I will make a more detailed instruction later.

You need to tokenize your data into a .csv file, formed as "topic_id, docid, bias(we use maxp in our experiments,this shows the position of the passage in its long document, not used in our main experiments), input_id, attention_mask, segment_mask, qrel_score(0 or 1)", one record per line, and different data fields are separated by commas.

And other files like BERT checkpoints, qrel files, topic id list (one id per line), and initial QPP file (formed as "topic_id score") are also needed in our experiments.

By the way, you don't need to split the id into train and test for 30 random rounds by yourself. Our code will do it for you.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
code		code
shells		shells
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

code

code

shells

shells

README.md

README.md

environment.yml

environment.yml

Repository files navigation

Group-QPP

About

Releases

Packages

Languages

VerdureChen/Group-QPP

Folders and files

Latest commit

History

Repository files navigation

Group-QPP

About

Resources

Stars

Watchers

Forks

Languages