Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error when using danish texts #322

Closed
kbvjeppesen opened this issue Nov 3, 2023 · 2 comments · Fixed by #336
Closed

Error when using danish texts #322

kbvjeppesen opened this issue Nov 3, 2023 · 2 comments · Fixed by #336
Labels
bug Something isn't working
Milestone

Comments

@kbvjeppesen
Copy link
Collaborator

Describe the bug
When uploading danish documents, they are not processed. If the documents are in English they are successfully processed.

To Reproduce

  1. Upload a danish text.
  2. In the upload status of the web app, the document appears as "Completed"
Screenshot 2023-11-03 at 07 59 44
  1. The chat does not use the document to answer.
  2. Going to the Azure Portal, and inspecting the 'Search Service'. We see that the 'Indexers' file all-files-indexers are returning an Error.
Screenshot 2023-11-03 at 07 51 24

The error is not much informative.
6. However, if we create an 'Debug session' in the 'Search service' on the danish PDF, we see more information:

Screenshot 2023-11-03 at 07 53 27
  1. So it seems like the PII skill does not support the danish language, but the documentation describes that it does: https://learn.microsoft.com/en-us/azure/ai-services/language-service/personally-identifiable-information/language-support?tabs=documents
  2. We have tried changing the PII language in the code to EN but without luck (in the language specification file). It is still the same error.

Expected behavior
That danish files works.

Desktop (please complete the following information):

  • Azure environment
  • Browser: chrome and safari

Alpha version details

  • GitHub branch: 0.3 Main
  • Latest commit: 0f9d4f6
@dayland
Copy link
Contributor

dayland commented Nov 3, 2023

@kbvjeppesen , we are aware of this issue on the 0.3-Gamma release. Unfortunately we have not yet identified a fix. However, the search index and document translation are changing quite a bit in the 0.4-Delta release and we believe it will solve this error. We are doing some final testing and anticipate 0.4-Delta release coming out in less than a week.

@dayland dayland added the bug Something isn't working label Nov 3, 2023
@dayland dayland added this to the 0.4-Delta milestone Nov 3, 2023
@kbvjeppesen
Copy link
Collaborator Author

Sounds good @dayland . When do you expect the 0.4-Delta release to come out?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants