Skip to content

Latest commit

 

History

History
12 lines (6 loc) · 950 Bytes

README.md

File metadata and controls

12 lines (6 loc) · 950 Bytes

CQSUMDP

The dataset is available here: https://drive.google.com/drive/folders/172qZJDwzQK0wbB_vqEd9EA7U8vuYPfYp?usp=sharing

The dataset is formatted in a way such that the baseline experiments using HuggingFace's summarization models could be quickly run, with the "text" column represents the combination of the input "query" and the "document", and the "summary" column indicates the gold reference summary.

The original query and the re-annotated queries are also added in the file. There is a "Type" column that contains the information on whether the LLM has classified the query as relevant/irrevalent.

If the query is considered as "relevant", then the LLM classified type is "yes"; otherwise "no".