[Feature Request] Recognize text styles including size, font type, color, boldness, italics #47

atlury · 2024-03-05T06:28:51Z

It would be great if you can look at adding a feature of style recognition and transfer. This along with layout preservation would be a great asset to the OCR pipeline.

atlury · 2024-03-05T06:41:35Z

Here is one paper/repo "remotely" related to the above

https://github.com/uchidalab/TrueTypeTransformer
https://das2022.univ-lr.fr/wp-content/uploads/OS-slides/72.pdf

atlury · 2024-03-05T07:08:59Z

Kosmos-2.5: A cutting-edge multimodal literate model revolutionizing text-intensive image understanding. This looks interesting, you can probably explore a bit.

To quote
"Kosmos-2.5 excels in: (1) generating spatially-aware text blocks, where each block of text is assigned its spatial coordinates within the image, and (2) producing structured text output that captures styles and structures into the markdown format. The model can be adapted for any text-intensive image understanding task with different prompts through supervised fine-tuning."

atlury changed the title ~~[Feature Request] Recognize text styles including size, font type, color, bold~~ [Feature Request] Recognize text styles including size, font type, color, boldness, italics Mar 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Recognize text styles including size, font type, color, boldness, italics #47

[Feature Request] Recognize text styles including size, font type, color, boldness, italics #47

atlury commented Mar 5, 2024

atlury commented Mar 5, 2024

atlury commented Mar 5, 2024

[Feature Request] Recognize text styles including size, font type, color, boldness, italics #47

[Feature Request] Recognize text styles including size, font type, color, boldness, italics #47

Comments

atlury commented Mar 5, 2024

atlury commented Mar 5, 2024

atlury commented Mar 5, 2024