Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Program alters the original image when doing text segmentation #3

Open
a-lgil opened this issue Dec 28, 2020 · 2 comments
Open

Program alters the original image when doing text segmentation #3

a-lgil opened this issue Dec 28, 2020 · 2 comments

Comments

@a-lgil
Copy link

a-lgil commented Dec 28, 2020

When using SickZil to find supposed text on the image, the supposed text is erased and the image is altered to match the background.
Then, the supposed text is processed with OpenCV to detect the real text and translate it.

However, the supposed text detected by SickZil that isn't identified as text by OpenCV is erased from the original image. Most of the time this means that some strokes, random strands of hair or small text are deleted from the original image.

But other times entire objects or facial features are deleted from the image, such as here, where the right eye of the character is missing in the translated version, as well as some details on her hair:

image
image

A solution could be changing the aggressiveness of SickZil if that's possible since it seems it's too high right now.

Or you could just reintegrate those parts of the image picked up by SickZil but not recognised as text by OpenCV. That way, in the case of the images I've uploaded, the eye and hair details would be extracted from the image but then reintegrated when OpenCV doesn't recognise them.

@ttop32
Copy link
Owner

ttop32 commented Dec 29, 2020

Google ocr does not recognise sound effect word image as text
Reintegrate method may cause to display all hidden sound word image.

Currently, I have no clue to extract text from sound effect word

Other than that, I am considering to create my own manga text segmentation model using GAN
as my next new project(if I have nothing to do )

@a-lgil
Copy link
Author

a-lgil commented Dec 29, 2020

Google ocr does not recognise sound effect word image as text
Reintegrate method may cause to display all hidden sound word image.

Currently, I have no clue to extract text from sound effect word

Other than that, I am considering to create my own manga text segmentation model using GAN
as my next new project(if I have nothing to do )

Good luck with the text segmentation model then, let's hope everything goes smoothly

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants