Skip to content

Conversation

@merveenoyan
Copy link
Contributor

No description provided.

@merveenoyan merveenoyan requested a review from pcuenca October 21, 2025 14:47
@merveenoyan
Copy link
Contributor Author

merveenoyan commented Oct 21, 2025

oops have to add to _blog.yml

Comment on lines 347 to 348
- \[PP-OCR-v5\](https://huggingface.co/blog/baidu/ppocrv5)
- \[SOTA OCR on-device with Core ML and dots.ocr\](https://huggingface.co/blog/dots-ocr-ne)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- \[PP-OCR-v5\](https://huggingface.co/blog/baidu/ppocrv5)
- \[SOTA OCR on-device with Core ML and dots.ocr\](https://huggingface.co/blog/dots-ocr-ne)
- [PP-OCR-v5](https://huggingface.co/blog/baidu/ppocrv5)
- [SOTA OCR on-device with Core ML and dots.ocr](https://huggingface.co/blog/dots-ocr-ne)

Copy link
Member

@Vaibhavs10 Vaibhavs10 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Took a quick pass.


### Locality Awareness

Documents can have complex structures, like multi-column text blocks and floating figures. Older OCR models handled these documents by detecting words and then the layout of pages manually in post-processing to have the text rendered in reading order, which is brittle. Modern OCR models, on the other hand, incorporate layout metadata to help preserve reading order and accuracy. This metadata is called “anchor”, it can come in bounding boxes. This process is also called as “grounding/anchoring” because it helps with hallucination.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Documents can have complex structures, like multi-column text blocks and floating figures. Older OCR models handled these documents by detecting words and then the layout of pages manually in post-processing to have the text rendered in reading order, which is brittle. Modern OCR models, on the other hand, incorporate layout metadata to help preserve reading order and accuracy. This metadata is called “anchor”, it can come in bounding boxes. This process is also called as “grounding/anchoring” because it helps with hallucination.
Documents can have complex structures, like multi-column text blocks and floating figures. Older OCR models handled these documents by detecting words and then the layout of pages manually in post-processing to have the text rendered in reading order, which is brittle. Modern OCR models, on the other hand, incorporate layout metadata to help preserve reading order and accuracy. This metadata is called “anchor”, it can come in bounding boxes. This process is also called as “grounding/anchoring” because it helps with reducing hallucination.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

unresolving this because last line reads weird.

Copy link
Member

@pcuenca pcuenca left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please, double check all markdown formatting and links

merveenoyan and others added 15 commits October 21, 2025 17:30
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: vb <vaibhavs10@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: vb <vaibhavs10@gmail.com>
Co-authored-by: vb <vaibhavs10@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
@merveenoyan
Copy link
Contributor Author

merveenoyan commented Oct 21, 2025

@pcuenca all was working on hf.co/new-blog so I fixed only the not working ones, rest was ok actually

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
@merveenoyan
Copy link
Contributor Author

will check one last time

merveenoyan and others added 8 commits October 21, 2025 17:51
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: vb <vaibhavs10@gmail.com>
Co-authored-by: vb <vaibhavs10@gmail.com>
Co-authored-by: vb <vaibhavs10@gmail.com>
Co-authored-by: vb <vaibhavs10@gmail.com>
Updated image links and formatting in the documentation.
@merveenoyan
Copy link
Contributor Author

@pcuenca I went through it three times, I think it's good to go

@merveenoyan merveenoyan merged commit 7b2875b into main Oct 21, 2025
1 check passed
@merveenoyan merveenoyan deleted the ocr-blog branch October 21, 2025 16:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants