Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational Search

This repository contains conversational information-seeking (CIS) data from QReCC, with annotated sample of 360 question-answer pairs to characterize entity saliency in CIS. Moreover, we realease crowd workers' assessments of answer rewrite type preferences. These include both the specific type preferred and a reason for choosing the specific type in textual format.

The resources in the repository are developed for our CHIIR'24 paper: "Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational Search", by Ivan Sekulić¹, Krisztian Balog², and Fabio Crestani¹.

¹ Università della Svizzera italiana, Lugano, Switzerland

² University of Stavanger, Stavanger, Norway

Entity salience annotations

entity_salience.csv contains question-answer pairs from QReCC data, with entities extracted from the given answers. The entities are associated with their salience scores, which are the average of three annotations done by crowd workers.

Each row in the .csv contains:

QID - concatenation of Conversation_no and Turn_no, indicating question ID in QReCC dataset;
Question - question text;
Answer - text of the answer to the question;
Entity - entity extracted from the answer;
Salience (float) - salience score of the entity, i.e., how essential is the entity for comprehending the answer to the question. Salience score is a real number between 0 and 2.

Rewrite type preference

preference_type_choice_and_reason.csv contains crowd workers' annotations for the answer rewrite type preference.

Columns descriptions:

QID - question ID in QReCC dataset;
Question - question text;
Input.answer_{1, 2, 3} - answer rewrites offered as multiple choice inqury to the crowd worker. It always contains (shuffled) original answer, answer with inline entity explanations from Wikibase, and answer with clarifying prompt.
Input.top_N - how many (N) most salient entities are provided explanations for or offered to clarify in the prompt;
Preference - rewrite type preference of the worker (one of original, wikibase, or prompt);
Answer.reason - textual description of the reason why the worker chose the rewrite type.

Citation

If you find the resource useful, please cite as:

@inproceedings{Sekulic:2024:CHIIR,
    author = {Sekuli\'c, Ivan and Balog, Krisztian and Crestani, Fabio},
    title = {Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational Search},
    year = {2024},
    booktitle = {Proceedings of the 2024 Conference on Conference Human Information Interaction and Retrieval},
    pages = {tbd},
    series = {CHIIR '24}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational Search

Entity salience annotations

Rewrite type preference

Citation

About

Releases

Packages

isekulic/chiir24-answer-rewriting

Folders and files

Latest commit

History

Repository files navigation

Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational Search

Entity salience annotations

Rewrite type preference

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages