[IMPROVEMENT] Modify -quant 0 option #932

thealphadollar · 2018-02-21T18:14:01Z

Please prefix your pull request with one of the following: [FEATURE] [FIX] [IMPROVEMENT].

In raising this pull request, I confirm the following (please check boxes):

I have read and understood the contributors guide.
I have checked that another pull request for this purpose does not exist.
I have considered, and confirmed that this submission will be valuable to others.
I accept that this submission may not be used, and the pull request closed at the will of the maintainer.
I give this submission freely, and claim no ownership to its content.

My familiarity with the project is as follows (check one):

I have never used CCExtractor.
I have used CCExtractor just a couple of times.
I absolutely love CCExtractor, but have not contributed previously.
I am an active contributor to CCExtractor.

This pull request aims at simplifying and making the -quant 0 parameter faster. It does so by reducing the number of distinct color shades available in the palette of the PNG image under processing.

An example would be, say a palette color is (248, 187, 027). The new algorithm reduces this image to (224, 160, 0). This is one quantised value amongst the (888) which can be formed under this algorithm. Since the color palette is reduced and our rect bitmap structures point to palette for their pixel color value; in a way this algorithm decreases the color value to the nearest multiple of 32 for R,G,B; effectively quantising it without much reduction in the actual image visibility.

As can be seen in the below screenshot, this method improves the time taken (92 seconds vs 100 seconds) and also gives a better result than no quantisation.

As is marked in the below screenshot (with no quant algorithm at all), a dialogue is read as "Are you off the deck". This is a mistake which does not happen and is read as "Are you off the clock" with the improved algorithm. The video is provided along with the timestamp to check for authenticity.

At last below is a diff between the two files to see the other error corrections that the method provides, (for example it correctly reads "I" (capital "i") which were read as "|")

For video file, refer issue #929

Minor Addition: Added below line which can be uncommented to output debug.png from ocr_bitmap() function
save_spupng("debug.png", indata, w, h, palette, alpha, 16);

cfsmp3 · 2018-02-21T18:47:08Z

case 0 means "pass the image unchanged to tessearact", so it cannot do any kind of processing.
You can add that new code to case 2 (but test that it actually helps in some cases, otherwise no point in adding it)
Remember that the help text needs to be updated as well.

thealphadollar · 2018-02-22T09:45:05Z

The proposed algorithm improves speed and gives slightly better results as is shown in the PR content.

Updated help and made it case 2 :)

Modify -quant 0 option

8d6a174

thealphadollar force-pushed the master branch from 24c3922 to 8d6a174 Compare February 22, 2018 09:43

cfsmp3 merged commit 9dc1e0a into CCExtractor:master Feb 23, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[IMPROVEMENT] Modify -quant 0 option #932

[IMPROVEMENT] Modify -quant 0 option #932

Uh oh!

thealphadollar commented Feb 21, 2018 •

edited

Loading

Uh oh!

cfsmp3 commented Feb 21, 2018

Uh oh!

thealphadollar commented Feb 22, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[IMPROVEMENT] Modify -quant 0 option #932

[IMPROVEMENT] Modify -quant 0 option #932

Uh oh!

Conversation

thealphadollar commented Feb 21, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cfsmp3 commented Feb 21, 2018

Uh oh!

thealphadollar commented Feb 22, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

thealphadollar commented Feb 21, 2018 •

edited

Loading