Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

馃憖 Integrate Microsoft Doc Intelligence for OCR #71

Closed
asim-shrestha opened this issue May 16, 2024 · 2 comments 路 Fixed by #85
Closed

馃憖 Integrate Microsoft Doc Intelligence for OCR #71

asim-shrestha opened this issue May 16, 2024 · 2 comments 路 Fixed by #85
Labels
good first issue Good for newcomers

Comments

@asim-shrestha
Copy link
Contributor

Currently the only OCR service Tarsier supports is GoogleOCR vision. We're told Microsoft's model is the most performant so would love to also include it!

Documentation: https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/concept-read?view=doc-intel-4.0.0

@awtkns awtkns added the good first issue Good for newcomers label May 16, 2024
@ml5ah
Copy link
Contributor

ml5ah commented Jun 11, 2024

Document Intelligence has more advanced capabilities for processing multi-page documents. Since for tarsier, we're dealing with screenshots (single images), it's easier to integrate their OCR service as a start.
Please see #84 and #85

@asim-shrestha @awtkns

@awtkns
Copy link
Contributor

awtkns commented Jun 11, 2024

That sounds great @ml5ah !

@awtkns awtkns linked a pull request Jun 12, 2024 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants