Skip to content

isekulic/chiir24-answer-rewriting

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational Search

This repository contains conversational information-seeking (CIS) data from QReCC, with annotated sample of 360 question-answer pairs to characterize entity saliency in CIS. Moreover, we realease crowd workers' assessments of answer rewrite type preferences. These include both the specific type preferred and a reason for choosing the specific type in textual format.

The resources in the repository are developed for our CHIIR'24 paper: "Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational Search", by Ivan Sekulić1, Krisztian Balog2, and Fabio Crestani1.

1 Università della Svizzera italiana, Lugano, Switzerland

2 University of Stavanger, Stavanger, Norway

Entity salience annotations

entity_salience.csv contains question-answer pairs from QReCC data, with entities extracted from the given answers. The entities are associated with their salience scores, which are the average of three annotations done by crowd workers.

Each row in the .csv contains:

  • QID - concatenation of Conversation_no and Turn_no, indicating question ID in QReCC dataset;
  • Question - question text;
  • Answer - text of the answer to the question;
  • Entity - entity extracted from the answer;
  • Salience (float) - salience score of the entity, i.e., how essential is the entity for comprehending the answer to the question. Salience score is a real number between 0 and 2.

Rewrite type preference

preference_type_choice_and_reason.csv contains crowd workers' annotations for the answer rewrite type preference.

Columns descriptions:

  • QID - question ID in QReCC dataset;
  • Question - question text;
  • Input.answer_{1, 2, 3} - answer rewrites offered as multiple choice inqury to the crowd worker. It always contains (shuffled) original answer, answer with inline entity explanations from Wikibase, and answer with clarifying prompt.
  • Input.top_N - how many (N) most salient entities are provided explanations for or offered to clarify in the prompt;
  • Preference - rewrite type preference of the worker (one of original, wikibase, or prompt);
  • Answer.reason - textual description of the reason why the worker chose the rewrite type.

Citation

If you find the resource useful, please cite as:

@inproceedings{Sekulic:2024:CHIIR,
    author = {Sekuli\'c, Ivan and Balog, Krisztian and Crestani, Fabio},
    title = {Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational Search},
    year = {2024},
    booktitle = {Proceedings of the 2024 Conference on Conference Human Information Interaction and Retrieval},
    pages = {tbd},
    series = {CHIIR '24}
}

About

This repository contains resources developed for our CHIIR'24 paper: "Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational Search"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published