Skip to content
/ MeeQA Public

The code and data for MeeQA: Natural Questions in Meeting Transcripts

Notifications You must be signed in to change notification settings

reutapel/MeeQA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MeeQA

The code and data for MeeQA: Natural Questions in Meeting Transcripts

Abstract

We present MeeQA, a dataset for natural-language question answering over meeting transcripts. It includes real questions asked during meetings by its participants. The dataset contains 48K question-answer pairs, extracted from 422 meeting transcripts, spanning multiple domains. Questions in transcripts pose a special challenge as they are not always clear, and considerable context may be required in order to provide an answer. Further, many questions asked during meetings are left unanswered. To improve baseline model performance on this type of questions, we also propose a novel loss function, Flat Hierarchical Loss, designed to enhance performance over questions with no answer in the text. Our experiments demonstrate the advantage of using our approach over standard QA models.

About

The code and data for MeeQA: Natural Questions in Meeting Transcripts

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages