-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add latest benchmark run #652
Conversation
Noticeable changes:
|
Bit curious about this odd number Few minor thing observed -
|
We run this on the natural questions eval dataset, which includes 5791 questions.
At this point, it's always a p3.2xlarge with a V100 GPU and list those details on https://haystack.deepset.ai/bm/benchmarks/
Seems like very minor things to me, but feel free to fix it :)
Not a priority for now, but I could see that becoming relevant at a later point in time... |
The scale of the accuracy and speed measures are very different by default. #675 should change this so we don't have to manually edit data |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I manually checked this on the staging website environment and it looks good to me
Add results from the latest full benchmark run
Reader
Retriever Indexing
Retriever Querying