Implement a Natural Questions Style Question Answering System #334

brandenchan · 2020-04-22T15:20:42Z

This implements a Natural Questions style QA system which can return [span, is_impossible, yes, no] predictions. This is done with two prediction heads (QA and text classification) whose outputs are merged using QuestionAnsweringHead.merge().

It is worth mentioning a few limitations in this implementation:

Predictions Objects are now implemented for QA but not for any other task
The structure of Baskets and Samples is different in SQuAD vs NQ and needs to be cleaned in a future PR
The apply_tokenization() functions of SQuAD and NQ processors are very similar to each other and might need in future to be merged
When performing NQ inference, no probs are returned from the Text Classification Head
While data flows end to end, it has not been tested for performance
Preprocessing of data is not optimized so NQ is still very slow

brandenchan · 2020-04-22T15:50:50Z

@tholor Before merging, it would also be good to check that the output of the Squad style Inferencer is still compatible with Haystack. I have a suspicion that the output might be nested in another list() layer

tholor

Great work! This is a huge PR. As discussed I am happy to merge this in an initial version and then do some clean-ups later (many of my comments). There's however at least one change required before merge: the inferencer format doesn't comply with latest master (we moved num_processes from the methods to the constructor).

farm/data_handler/data_silo.py

farm/data_handler/input_features.py

farm/data_handler/processor.py

tholor · 2020-04-24T15:19:55Z

farm/data_handler/processor.py

+            return "", -1
+
+    def unify_short_answers(self, short_answers, doc_text, tok_to_ch):
+        """ In cases where an NQ sample has multiple disjoint short answers, this fn generates the single shortest


We should verify that this merge of short answers produces meaningful answers. If the disjoint short answers are in very different places of the passage, we will get a very long answer. Not a blocker for merging, but we should test it at some point before we train models for production.

Yes I agree! Will put it on the back log

tholor · 2020-04-24T15:30:37Z

farm/data_handler/samples.py


        answers_clear.append(curr_answer_clear)
        answers_tokenized.append(curr_answer_tokenized)
    return answers_clear, answers_tokenized

+
+def create_samples_qa(dictionary, max_query_len, max_seq_len, doc_stride, n_special_tokens):


Do we still need the above create_samples_squad() or is this one used for both SQuAD and NQ now?

We don't need it anymore - will delete when all tests pass

farm/modeling/adaptive_model.py

farm/utils.py

farm/modeling/prediction_head.py

* Flatten prediction lists in adaptiive model * Add squad question id to output dict in inference mode

Timoeller · 2020-05-14T17:53:44Z

I have thouroughly tested the code, especially changes to squad processing, but also other tasks like ner or doc_classification. This looks good.

2 Things NQ related:

The dataset in the examples/natural_questions.py cannot be loaded, please provide the download functionality
When using the Inferencer on a NQ model (in tests/test_natural_questions.py) the output seems to be only squad style extractive QA. Is the classification head part integration missing?

…into natural_questions

Timoeller · 2020-05-27T13:51:42Z

There is still much missing but I think we could safely merge this branch without destroying too many other tasks : )

We have trained a model on NQ and evaluated on a NQ development set that has been converted into a format useable by haystack (it just comprises short answers or unanswerable questions).

I have hijacked SQuAD evaluation scripts to evaluate the NQ model and also a SQuAD model on this NQ development subset.
When just looking at the text predictions the NQ model performs much better.

NQ model:
"HasAns_exact": 43.16
"HasAns_f1": 52.04

SQuAD model:
"HasAns_exact": 25.0,
"HasAns_f1": 37.14

brandenchan added 21 commits March 6, 2020 16:35

WIP: Messily adding NQ support

6c88d5e

WIP: NQ example

113a03f

WIP: NQ data tokenized (very messy)

afdab31

WIP: need to work on process_answers

d5617fd

WIP: Samples created

2b9bb68

WIP: Data can be featurized

965627e

WIP: IDs featurized

b5af93e

WIP: NQ training can run

4a7fa4e

WIP: Working on downsampling is_impossible

d95bdb1

WIP: Processor can downsample is_impossible

aa3c06f

Merge branch 'master' into natural_questions

cfd379d

Merge branch 'master' into natural_questions

10a2082

WIP: dealing with multi head inf aggregation

b9944d7

WIP: first attempt at merge()

3d490fb

WIP: Fixing merge fn

9469e2f

WIP: fixing merge

3060747

WIP: implement Span and DocumentPred classes

de35a4e

WIP: Fixing SQuAD

338a61d

WIP: Fixes for SQuAD

c6264ec

WIP: NQ and Squad seem to run through

ab3a017

Code cleanup and commenting

0b1613a

brandenchan requested a review from tholor April 22, 2020 15:20

brandenchan added 3 commits April 22, 2020 17:41

Set better default

e7d40f9

Refactor formatted_preds()

ce0a84e

Merge branch 'master' into natural_questions

05156b5

tholor requested changes Apr 24, 2020

View reviewed changes

brandenchan added 3 commits April 27, 2020 11:20

Merge branch 'master' into natural_questions

b0b9e78

Fix some tests not all

bada4ce

Addresses reviewer's comments

3ab6cb5

brandenchan and others added 12 commits April 28, 2020 11:58

Addresses more of reviewer's comments

cc6c7f6

Fix input dict conversion

41fe137

Merge branch 'master' into natural_questions

ffa6fb0

Remove rest_api_schema

2fb265b

Merge branch 'master' into natural_questions

3a3106f

All tests pass

2d0f88a

Remove class method from child Class

ac10d13

nq test implemented

7acc0d6

add test data for nq

39a6c76

Adjust code to squad inferencing (#367)

b4c0fdb

* Flatten prediction lists in adaptiive model * Add squad question id to output dict in inference mode

Merge remote-tracking branch 'origin/master' into natural_questions

0e241f6

Fix qa id issue, adjust tests

1398e4c

Timoeller and others added 7 commits May 15, 2020 17:57

Add nq output to nq inferencing

e244caa

Allows download of sample dataset

1b2d46b

Merge branch 'natural_questions' of https://github.com/deepset-ai/FARM …

95e9f8b

…into natural_questions

Add html tag tokens to vocab

f45bd7d

Add downsampling before dataprocessing

c1a3676

Add loading of trained model to example script

66d3da1

Add classification field to test

34c6e3b

brandenchan merged commit 768e084 into master May 27, 2020

Timoeller changed the title ~~WIP: Implement a Natural Questions Style Question Answering System~~ Implement a Natural Questions Style Question Answering System May 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement a Natural Questions Style Question Answering System #334

Implement a Natural Questions Style Question Answering System #334

brandenchan commented Apr 22, 2020 •

edited by tholor

Loading

brandenchan commented Apr 22, 2020

tholor left a comment

tholor Apr 24, 2020

brandenchan Apr 27, 2020

tholor Apr 24, 2020

brandenchan Apr 27, 2020

Timoeller commented May 14, 2020

Timoeller commented May 27, 2020

Implement a Natural Questions Style Question Answering System #334

Implement a Natural Questions Style Question Answering System #334

Conversation

brandenchan commented Apr 22, 2020 • edited by tholor Loading

brandenchan commented Apr 22, 2020

tholor left a comment

Choose a reason for hiding this comment

tholor Apr 24, 2020

Choose a reason for hiding this comment

brandenchan Apr 27, 2020

Choose a reason for hiding this comment

tholor Apr 24, 2020

Choose a reason for hiding this comment

brandenchan Apr 27, 2020

Choose a reason for hiding this comment

Timoeller commented May 14, 2020

Timoeller commented May 27, 2020

brandenchan commented Apr 22, 2020 •

edited by tholor

Loading