-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Same kind of answers are not generating from similar kind of documents #1308
Comments
Hi @SAIVENKATARAJU , my first suspicion would be that each of the product manuals might be split up differently by the PreProcessor. Maybe for one manual, the answer comes at the very beginning of a passage and in another, it is at the end. This can influence the model's predictions. If you are using an ElasticsearchRetriever, one thing you could try to resolve this would be to increase the PreProcessor's If this doesn't have the desired impact, we might have to dig deeper by looking at the Retriever's predictions using an EvalDocument Node. This might give us a better sense of which component exactly is causing the differing predictions across the different documents. |
Hey @brandenchan |
Hey @brandenchan , No good improvement with parameters tuning. is it good idea to create different index for different files? |
Hi @SAIVENKATARAJU , what's your intention with creating an index for each file? Is it that you would like to perform your query on just one of your files? (In this case I would recommend using metadata filtering) The next thing I would recommend would be to actually see the output of the retriever. To do this you want to initialize an |
Hi @SAIVENKATARAJU, since your last post we have implemented new debugging features (see this PR: #1558). Please check out the new master, test it out again, and if you still face the same problemlet us know by opening a new issue. Closing this for now. |
Hi,
I am trying to build extractive QA for watersoftner , I have product manuals for 4 different models. each of the manual contains some info about the WS , do's and dont's. most of the info is same across all the docs, but regarding specifications it may vary from doc to doc. I was trying to pull the answer from every doc. but exact answers are returning from only one or two docs, even though the same answer is present in all docs.
example:
Here I asked model, "can i install it outside", and explicitly searching in individual doc through filter. and output is below.
from the above answers our required answer is "if installing watersoftner.......", but its only coming from two of the docs. My pre-processing is below
it really helpful for me how to debug this kind of issues.
The text was updated successfully, but these errors were encountered: