Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

document import not extracting knowledge #91

Closed
giannik opened this issue Aug 2, 2023 · 2 comments
Closed

document import not extracting knowledge #91

giannik opened this issue Aug 2, 2023 · 2 comments
Assignees
Labels
bug Something isn't working

Comments

@giannik
Copy link

giannik commented Aug 2, 2023

after importing a pdf document and asking it questions about the document it has no knowledge of the content in the pdf . Is this still under development ? I am using qdrant for embeddings and no errors appear in console.

@glahaye glahaye added the bug Something isn't working label Aug 2, 2023
@glahaye
Copy link
Collaborator

glahaye commented Aug 2, 2023

@TaoChenOSU Mind taking a look?

@alliscode
Copy link
Member

alliscode commented Aug 4, 2023

Hi @giannik, thank you for creating this issue! The effectiveness of the document memories depends on a lot of factors, and it can sometimes be challenging to get the results you expect. One thing you can try is to examine the prompt that was sent to the model by clicking the information button in the top right corner of the response message. In the example that I'm trying with this document, it looks like this:

image

If the query resulted in a close enough match to the contents of the document, then within the prompt you should see a section for Chat Memories that will contain snippets from the document. For my example, this is the top bit of what I see:

image

If you don't see any document snippets in the prompt then it likely means that the query did not match the document memories with a high enough relevancy.

If you would like to look at and play with the code related to this, the code that handles chunking and importing documents is in DocumentImportController.cs, and the code that handles extracting the document memories related to a query is in ChatSkill.cs.

I'll close this issue but please feel free to create a new one if you need more help or find more issues, thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
No open projects
Development

No branches or pull requests

4 participants