OCR with Open Models blog #3137

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

merveenoyan merged 28 commits into main from ocr-blog

Oct 21, 2025

Contributor

merveenoyan commented Oct 21, 2025

No description provided.


          initial commit

d44a543

merveenoyan requested a review from pcuenca

October 21, 2025 14:47


          fix links

5c9d6d6

Contributor Author

merveenoyan commented Oct 21, 2025 •

edited

Loading

oops have to add to _blog.yml


          blog.yml

e2c2020

pcuenca approved these changes

View reviewed changes

_blog.yml Outdated Show resolved Hide resolved

_blog.yml Outdated Show resolved Hide resolved

ocr-open-models.md Outdated Show resolved Hide resolved

ocr-open-models.md Outdated Show resolved Hide resolved

ocr-open-models.md Outdated Show resolved Hide resolved

ocr-open-models.md Outdated Show resolved Hide resolved

ocr-open-models.md Outdated Show resolved Hide resolved

ocr-open-models.md Outdated Show resolved Hide resolved

ocr-open-models.md Outdated Show resolved Hide resolved

ocr-open-models.md Outdated Show resolved Hide resolved

pcuenca reviewed

View reviewed changes

ocr-open-models.md Outdated

Comment on lines 347 to 348

    
              - \[PP-OCR-v5\](https://huggingface.co/blog/baidu/ppocrv5)  

              - \[SOTA OCR on-device with Core ML and dots.ocr\](https://huggingface.co/blog/dots-ocr-ne)

Member

pcuenca Oct 21, 2025

Suggested change

      
            - \[PP-OCR-v5\](https://huggingface.co/blog/baidu/ppocrv5)  
          
            - \[SOTA OCR on-device with Core ML and dots.ocr\](https://huggingface.co/blog/dots-ocr-ne)
          
            - [PP-OCR-v5](https://huggingface.co/blog/baidu/ppocrv5)  
          
            - [SOTA OCR on-device with Core ML and dots.ocr](https://huggingface.co/blog/dots-ocr-ne)

pcuenca reviewed

View reviewed changes

ocr-open-models.md Outdated Show resolved Hide resolved

Vaibhavs10 reviewed

View reviewed changes

Member

Vaibhavs10 left a comment

Took a quick pass.

ocr-open-models.md Outdated Show resolved Hide resolved

ocr-open-models.md Outdated Show resolved Hide resolved

ocr-open-models.md Outdated

    
              ### Locality Awareness 

              Documents can have complex structures, like multi-column text blocks and floating figures. Older OCR models handled these documents by detecting words and then the layout of pages manually in post-processing to have the text rendered in reading order, which is brittle.  Modern OCR models, on the other hand, incorporate layout metadata to help preserve reading order and accuracy. This metadata is called “anchor”, it can come in bounding boxes. This process is also called as “grounding/anchoring” because it helps with hallucination.

Member

Vaibhavs10 Oct 21, 2025

Suggested change

      
            Documents can have complex structures, like multi-column text blocks and floating figures. Older OCR models handled these documents by detecting words and then the layout of pages manually in post-processing to have the text rendered in reading order, which is brittle.  Modern OCR models, on the other hand, incorporate layout metadata to help preserve reading order and accuracy. This metadata is called “anchor”, it can come in bounding boxes. This process is also called as “grounding/anchoring” because it helps with hallucination.
          
            Documents can have complex structures, like multi-column text blocks and floating figures. Older OCR models handled these documents by detecting words and then the layout of pages manually in post-processing to have the text rendered in reading order, which is brittle.  Modern OCR models, on the other hand, incorporate layout metadata to help preserve reading order and accuracy. This metadata is called “anchor”, it can come in bounding boxes. This process is also called as “grounding/anchoring” because it helps with reducing hallucination.

Member

Vaibhavs10 Oct 21, 2025

unresolving this because last line reads weird.

ocr-open-models.md Outdated Show resolved Hide resolved

ocr-open-models.md Outdated Show resolved Hide resolved

ocr-open-models.md Outdated Show resolved Hide resolved

ocr-open-models.md Outdated Show resolved Hide resolved

pcuenca reviewed

View reviewed changes

ocr-open-models.md Outdated Show resolved Hide resolved

pcuenca reviewed

View reviewed changes

Member

pcuenca left a comment

Please, double check all markdown formatting and links

ocr-open-models.md Outdated Show resolved Hide resolved

merveenoyan and others added 15 commits

October 21, 2025 17:30


          Update _blog.yml

e2469f1

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>


          Update _blog.yml

cafb7b9

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>


          Update ocr-open-models.md

acedfc2

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>


          Update ocr-open-models.md

58b7b66

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>


          Update ocr-open-models.md

0e984ae

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>


          Update ocr-open-models.md

9c735ca

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>


          Update ocr-open-models.md

58acc60

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>


          Update ocr-open-models.md

2ddf243

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>


          Update ocr-open-models.md

244bad3

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>


          Update ocr-open-models.md

6b950a8

Co-authored-by: vb <vaibhavs10@gmail.com>


          Update ocr-open-models.md

105020c

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>


          Update ocr-open-models.md

Co-authored-by: vb <vaibhavs10@gmail.com>


          Update ocr-open-models.md

a1ddcde

Co-authored-by: vb <vaibhavs10@gmail.com>


          Update ocr-open-models.md

a7c5928

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>


          Update ocr-open-models.md

7413fce

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Contributor Author

merveenoyan commented Oct 21, 2025 •

edited

Loading

@pcuenca all was working on hf.co/new-blog so I fixed only the not working ones, rest was ok actually

pcuenca reviewed

View reviewed changes

ocr-open-models.md Outdated Show resolved Hide resolved


          Update ocr-open-models.md

d7cc8a0

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

pcuenca reviewed

View reviewed changes

ocr-open-models.md Outdated Show resolved Hide resolved


          Update ocr-open-models.md

cb5a451

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Contributor Author

merveenoyan commented Oct 21, 2025

will check one last time

merveenoyan and others added 8 commits

October 21, 2025 17:51


          Update ocr-open-models.md

ed0e841

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>


          Update ocr-open-models.md

d2a41ad

Co-authored-by: vb <vaibhavs10@gmail.com>

fix

d70bccd


          Update ocr-open-models.md

3e6a365

Co-authored-by: vb <vaibhavs10@gmail.com>


          Update ocr-open-models.md

74b5976

Co-authored-by: vb <vaibhavs10@gmail.com>


          Update ocr-open-models.md

9dd85fe

Co-authored-by: vb <vaibhavs10@gmail.com>


          Update ocr-open-models.md

947b862


          Fix image links and improve code formatting

04c835b

Updated image links and formatting in the documentation.

Contributor Author

merveenoyan commented Oct 21, 2025

@pcuenca I went through it three times, I think it's good to go

merveenoyan merged commit 7b2875b into main

1 check passed

merveenoyan deleted the ocr-blog branch

October 21, 2025 16:44

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet