Integrate sentence transformers into benchmarks #843

Timoeller · 2021-02-17T18:24:28Z

Would be nice to compare the new sentencetransformers models, especially the
SentenceTransformer('nq-distilbert-base-v1') and how it compares against DPR finetuned on NQ.

Made sure to use cosine similarity (so had to use ES doc store).
Documents for Sentence Transformers should be a list of [title, text] lists.

Preliminary results do not look better than DPR:
for 100k docs we have mAP of 82.7 SBert vs 86.5 DPR
Details:

100k reference docs
'retriever': 'sentence_transformers', 'doc_store': 'elasticsearch', 'n_docs': 100000, 
'n_queries': 5637, 'retrieve_time': 1288.7816320069996, 
'queries_per_second': 4.37389846348259, 'seconds_per_query': 0.22862899272786938, 
'recall': 93.79102359411034, 'map@10': 82.74644426986096, 
'top_k': 10, 'date_time': datetime.datetime(2021, 2, 17, 19, 29, 25, 961023), 'error': None}


500k reference docs
{'retriever': 'sentence_transformers', 'doc_store': 'elasticsearch', 'n_docs': 500000, 
'n_queries': 5637, 'retrieve_time': 5642.244277841075, 
'queries_per_second': 0.9990705333582115, 'seconds_per_query': 1.0009303313537476, 
'recall': 89.62213943587014, 'map@10': 76.49362488771762, 
'top_k': 10, 'date_time': datetime.datetime(2021, 2, 17, 23, 44, 11, 757607), 'error': None}

test/benchmarks/utils.py

Timoeller · 2021-02-18T15:26:03Z

I checked the data and it seems ok.

No answers as well as long answers are removed from NQ dev set.
We put in 100 word passages containing an answer string as positive passages.

Lets revert the config and fix the mypy bug, then we are ready to merge from my side. What do you think @brandenchan ?

brandenchan

This all looks good to me. Just one very tiny comment. Its ready for merge as far as I'm concerned once the mypy bug is fixed and the config is reverted.

test/benchmarks/retriever.py

Timoeller · 2021-04-08T17:28:27Z

Hey @brandenchan I fixed mypy and reverted the config.

Could you double check, also with the conflicting docs files, so that we can merge?

brandenchan · 2021-04-08T19:08:04Z

Hey @brandenchan I fixed mypy and reverted the config.

Could you double check, also with the conflicting docs files, so that we can merge?

Ok nice! The conflicting docs seem to be because new arguments were added to functions in master, we updated doc strings and the api documentation was regenerated. I would in each conflict case take the change from master

brandenchan

Once documentation conflicts are resolved, this PR is ready for merge

…enchmark

Timoeller added 2 commits February 17, 2021 19:20

Integrate sentence transformers into benchmarks

3791930

Make work

19cc240

Timoeller commented Feb 17, 2021

View reviewed changes

test/benchmarks/utils.py Show resolved Hide resolved

Add doc store asserts

5522970

brandenchan approved these changes Feb 18, 2021

View reviewed changes

test/benchmarks/retriever.py Outdated Show resolved Hide resolved

tholor and others added 2 commits February 19, 2021 14:48

switch data downloads from s3 client to https. add license info

1ed9b31

Fix mypy, revert config

6a50d92

Timoeller changed the title ~~WIP: Integrate sentence transformers into benchmarks~~ Integrate sentence transformers into benchmarks Apr 8, 2021

Add latest docstring and tutorial changes

43e137b

Timoeller requested a review from brandenchan April 8, 2021 17:27

brandenchan approved these changes Apr 8, 2021

View reviewed changes

Timoeller and others added 2 commits April 9, 2021 17:09

Merge remote-tracking branch 'origin/master' into add_sentencetrans_b…

fba0966

…enchmark

Add latest docstring and tutorial changes

fe011d4

Timoeller merged commit 837dea4 into master Apr 9, 2021

Timoeller deleted the add_sentencetrans_benchmark branch April 9, 2021 15:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate sentence transformers into benchmarks #843

Integrate sentence transformers into benchmarks #843

Timoeller commented Feb 17, 2021 •

edited

Loading

Timoeller commented Feb 18, 2021

brandenchan left a comment •

edited

Loading

Timoeller commented Apr 8, 2021

brandenchan commented Apr 8, 2021

brandenchan left a comment

Integrate sentence transformers into benchmarks #843

Integrate sentence transformers into benchmarks #843

Conversation

Timoeller commented Feb 17, 2021 • edited Loading

Timoeller commented Feb 18, 2021

brandenchan left a comment • edited Loading

Choose a reason for hiding this comment

Timoeller commented Apr 8, 2021

brandenchan commented Apr 8, 2021

brandenchan left a comment

Choose a reason for hiding this comment

Timoeller commented Feb 17, 2021 •

edited

Loading

brandenchan left a comment •

edited

Loading