fix: num_return_sequences should be less than num_beams, not top_k #5280

faaany · 2023-07-05T16:59:37Z

Related Issues

fixes num_return_sequences should be less than num_beams, not top_k #5279

Proposed Changes:

use num_beams instead of top_k and remove top_k, because the real top_k is used for top-k sampling or sampling decoding methods not beam search
let user set the num_return_sequences, not by us

How did you test it?

manual test

Notes for the reviewer

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

coveralls · 2023-07-05T17:17:52Z

Pull Request Test Coverage Report for Build 5518507439

0 of 0 changed or added relevant lines in 0 files are covered.
180 unchanged lines in 3 files lost coverage.
Overall coverage increased (+0.2%) to 45.011%

Files with Coverage Reduction	New Missed Lines	%
nodes/prompt/invocation_layer/hugging_face.py	3	91.73%
document_stores/elasticsearch/es7.py	8	80.43%
nodes/retriever/dense.py	169	26.45%

Totals
Change from base Build 5487781909:	0.2%
Covered Lines:	10379
Relevant Lines:	23059

💛 - Coveralls

julian-risch

@faaany Thank you for this contribution! The pull request looks very good to me already. Just two small things that could be improved and then we can merge it.
The warning message should include both values num_beams and num_return_sequences to make it easier for the user to understand how to set the parameters correctly.
Second, could you please add a small unit test that checks that the warning is displayed if
num_return_sequences > num_beams? You can add the test to the file test/prompt/invocation_layer/test_hugging_face.py and it should be similar to the test called test_ensure_token_limit_negative here: https://github.com/deepset-ai/haystack/blob/main/test/prompt/invocation_layer/test_hugging_face.py#L189
You shouldn't need a real model for that test so you can use mocking as in this other test:

haystack/test/prompt/invocation_layer/test_hugging_face.py

Line 254 in 08f1865

def test_constructor_with_various_kwargs(mock_pipeline, mock_get_task):

Happy to help with the mocking if that part is unclear. Just let me know. 🙂

… top_k_issue

faaany · 2023-07-07T14:57:00Z

done. My test run through, but I am not quite sure whether there is a better way to write this test. Feel free to leave comments, so I can update it quickly and write better unit tests in the future. Thanks a lot!

… top_k_issue

faaany · 2023-07-11T01:43:00Z

@julian-risch any feedback on this?

julian-risch

@faaany Thank you for another contribution to Haystack! The pull request looks very good to me now. 👍 I extended the tests a bit to also check that num_return_sequences is set to num_beams as expected. I also changed the docstring describing the test accordingly. The init of HFLocalInvocationLayer I was able to simplify. Have a look if you're interested. Your test was quite good already before this final polishing step. Great work! 🙂

faaany · 2023-07-11T13:56:58Z

@faaany Thank you for another contribution to Haystack! The pull request looks very good to me now. 👍 I extended the tests a bit to also check that num_return_sequences is set to num_beams as expected. I also changed the docstring describing the test accordingly. The init of HFLocalInvocationLayer I was able to simplify. Have a look if you're interested. Your test was quite good already before this final polishing step. Great work! 🙂

Wow, the tests look really good now! Thanks so much! Learned new stuff again. lol

…top_k (#5280)" This reverts commit 514f93a.

…top_k (#5280)" (#5434) This reverts commit 514f93a.

formatting

3418538

faaany requested a review from a team as a code owner July 5, 2023 16:59

faaany requested review from julian-risch and removed request for a team July 5, 2023 16:59

faaany and others added 2 commits July 5, 2023 18:21

remove top_k variable

8b8b997

Merge branch 'main' into top_k_issue

bf2ef08

julian-risch requested changes Jul 6, 2023

View reviewed changes

faaany and others added 4 commits July 7, 2023 21:49

Merge branch 'deepset-ai:main' into top_k_issue

ceb7ff4

add pytest

ee7e87a

add numbers

afef75e

Merge branch 'top_k_issue' of https://github.com/faaany/haystack into…

1345c92

… top_k_issue

github-actions bot added the topic:tests label Jul 7, 2023

faaany and others added 5 commits July 7, 2023 08:07

string formatting

0be3345

Merge branch 'deepset-ai:main' into top_k_issue

4a76b33

fix formatting

0c183a3

Merge branch 'top_k_issue' of https://github.com/faaany/haystack into…

4e8547e

… top_k_issue

revert

54b9068

julian-risch self-requested a review July 11, 2023 07:07

extend tests with assertions for num_return_sequences

e744559

github-actions bot added the type:documentation Improvements on the docs label Jul 11, 2023

julian-risch approved these changes Jul 11, 2023

View reviewed changes

julian-risch merged commit 514f93a into deepset-ai:main Jul 11, 2023

julian-risch added a commit that referenced this pull request Jul 25, 2023

Revert "fix: num_return_sequences should be less than num_beams, not …

03de7d1

…top_k (#5280)" This reverts commit 514f93a.

This was referenced Jul 25, 2023

Revert "fix: num_return_sequences should be less than num_beams, not … #5434

Merged

num_return_sequences must not be larger than num_beams in HFLocalInvocationLayer #5436

Closed

julian-risch added a commit that referenced this pull request Jul 25, 2023

Revert "fix: num_return_sequences should be less than num_beams, not …

5bb0a1f

…top_k (#5280)" (#5434) This reverts commit 514f93a.

anakin87 pushed a commit that referenced this pull request Jul 25, 2023

Revert "fix: num_return_sequences should be less than num_beams, not …

1ae70e9

…top_k (#5280)" (#5434) This reverts commit 514f93a.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: num_return_sequences should be less than num_beams, not top_k #5280

fix: num_return_sequences should be less than num_beams, not top_k #5280

faaany commented Jul 5, 2023 •

edited by julian-risch

Loading

coveralls commented Jul 5, 2023 •

edited

Loading

julian-risch left a comment

faaany commented Jul 7, 2023 •

edited

Loading

faaany commented Jul 11, 2023

julian-risch left a comment

faaany commented Jul 11, 2023

fix: num_return_sequences should be less than num_beams, not top_k #5280

fix: num_return_sequences should be less than num_beams, not top_k #5280

Conversation

faaany commented Jul 5, 2023 • edited by julian-risch Loading

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

coveralls commented Jul 5, 2023 • edited Loading

Pull Request Test Coverage Report for Build 5518507439

💛 - Coveralls

julian-risch left a comment

Choose a reason for hiding this comment

faaany commented Jul 7, 2023 • edited Loading

faaany commented Jul 11, 2023

julian-risch left a comment

Choose a reason for hiding this comment

faaany commented Jul 11, 2023

faaany commented Jul 5, 2023 •

edited by julian-risch

Loading

coveralls commented Jul 5, 2023 •

edited

Loading

faaany commented Jul 7, 2023 •

edited

Loading