Fix DPR training batch size #898

brandenchan · 2021-03-17T16:43:40Z

We found that we could actually fit 16 samples per batch on a V100 GPU when training a DPR model at:

max_seq_len_query=64,
max_seq_len_passage=256

As pointed out by #896, the training batch size in the paper is actually 128 so we have set grad_acc_steps=8 to match the original experiment. We have also updated the expected performance metrics after training.

…into fix_dpr_bs2

review-notebook-app · 2021-03-17T16:43:44Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Timoeller

LG

brandenchan and others added 5 commits March 17, 2021 16:28

Adjust batch size

e77c1b7

Add latest docstring and tutorial changes

f9b2f87

Update training results

2842ec0

Merge branch 'fix_dpr_bs2' of https://github.com/deepset-ai/haystack …

a1c2e2b

…into fix_dpr_bs2

Add latest docstring and tutorial changes

7fbd5ff

brandenchan requested a review from Timoeller March 17, 2021 16:45

brandenchan mentioned this pull request Mar 17, 2021

Wrong batch size in the training DPR tutorial #896

Closed

Timoeller approved these changes Mar 17, 2021

View reviewed changes

brandenchan merged commit 24d0c4d into master Mar 17, 2021

brandenchan deleted the fix_dpr_bs2 branch March 17, 2021 17:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix DPR training batch size #898

Fix DPR training batch size #898

brandenchan commented Mar 17, 2021 •

edited

Loading

review-notebook-app bot commented Mar 17, 2021

Timoeller left a comment

Fix DPR training batch size #898

Fix DPR training batch size #898

Conversation

brandenchan commented Mar 17, 2021 • edited Loading

review-notebook-app bot commented Mar 17, 2021

Timoeller left a comment

Choose a reason for hiding this comment

brandenchan commented Mar 17, 2021 •

edited

Loading