Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doesn't recognize at all #26

Closed
websorokinweb opened this issue Feb 25, 2023 · 4 comments
Closed

Doesn't recognize at all #26

websorokinweb opened this issue Feb 25, 2023 · 4 comments

Comments

@websorokinweb
Copy link

websorokinweb commented Feb 25, 2023

from manga_ocr import MangaOcr

mocr = MangaOcr()
text = mocr('Shingeki No Kyojin - Raw/_Chapter 6/003.jpg')
print(text)

003
Got そして、日本では

I can't get what's wrong it's like common image + others are return something wrong too
Python 3.9.11
Windows 10

@websorokinweb websorokinweb changed the title Don't recognize at all Doesn't recognize at all Feb 25, 2023
@websorokinweb
Copy link
Author

I got. This package doesn't work for whole page. Only for bubbles

@HighLiuk
Copy link

@websorokinweb got the same problem here. I was so excited at first...

@HighLiuk
Copy link

@websorokinweb if you look at Mokuro, looks like it combines Comic Text Detector which is great to detect Japanese text and group them to bubbles, then for each bubble you use MangaOcr and voilà (at least in theory)

@HighLiuk
Copy link

@websorokinweb this should be fine:

from mokuro.manga_page_ocr import MangaPageOcr
from mokuro.utils import dump_json

if __name__ == '__main__':
    mpocr = MangaPageOcr()
    result = mpocr('path/to/image.jpg')
    dump_json(result, 'path/to/result.json')

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants