Skip to content

How to build a custom data set for Question Answering #2207

Discussion options

You must be logged in to vote

Hi @gabriead! To train an extractive QA model, you would need a context which contains the answer and the exact position of the answer inside this context. Therefore, you would need to map your question-answer pairs to a document containing the answer and extract the position of the answer. However, you might use your data to do open-domain evaluation, as this does not require to extract the exact position of an answer. Like this, you can check whether the existing models are already good enough for your use case such that you don't need to train a custom model. See this blog post for more information on evaluation.

As to how many labels are needed to do reasonable training: This depends …

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by gabriead
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants