Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Misleading situation when using the recall keyword #323

Closed
sephib opened this issue Apr 26, 2022 · 5 comments
Closed

Misleading situation when using the recall keyword #323

sephib opened this issue Apr 26, 2022 · 5 comments
Assignees

Comments

@sephib
Copy link

sephib commented Apr 26, 2022

Hi,
I think there is a misleading situation when using the recall keyword with the number of results that are returned from the app.query

When running

query_results = app.query(query=query, 
            query_model=query_model, 
            recall=recall_docs,
            )
query_results.get_hits()

The number of results is 10 (default length of hits)

I think the default hits should be the number of the docs in the recal_docs

query_results = app.query(query=query, 
            query_model=query_model, 
            recall=recall_docs,
            hits=len(_recall_docs)
            )
query_results.get_hits()

If the number of recall docs are less then 10 - I would expect to get less the 10 results, thus when the number of docs in the recall is more then 10, I would expect to get all the hits for the docs in the recall list.

Does this sound reasonable?

@kkraune
Copy link
Member

kkraune commented Apr 27, 2022

@sephib pls note that Thiago is not working on this right now, it will take some days for a response

@thigm85
Copy link
Contributor

thigm85 commented May 24, 2022

Hi @sephib, when I run an example with recall I get the number of hits equal to the number of documents I want to recall, which I think is the behavior that you expect.

For example, if you run this query notebook you see that in the end, we ask to recall two of the top 3 documents and we get exactly 2 documents back. Right?

@sephib
Copy link
Author

sephib commented May 26, 2022

The difference between your notebook example and my code is that you provide the body while I provide a query_model.
If the query_model has a hits property then it overrides the len of the recall documents. Unfortunately I can't currently verify this issue.

@thigm85
Copy link
Contributor

thigm85 commented May 26, 2022

Could you paste the code you used for me to try to reproduce the issue?

@thigm85
Copy link
Contributor

thigm85 commented Nov 2, 2022

Closing for lack of activity.

@thigm85 thigm85 closed this as completed Nov 2, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants