Update thresholding.py with Singh function #5490

lucArub · 2021-07-24T17:22:57Z

I add the threshold Singh function useful for text recognition.

T Romen Singh, Sudipta Roy, O Imocha Singh, Tejmani Sinam, Kh Manglem Singh.
"A New Local Adaptive Thresholding Technique in Binarization." IJCSI
International Journal of Computer Science Issues. 2011; 8(6-2): 271-276.

This technique is the one proposed in the article (Singh, 2011). It is a
locally adaptive thresholding technique that removes background by using
local mean and mean deviation. Indeed, the principal difference of this
method is that standard deviation $\delta(x,y)$
is not required. On the other hand, the threshold is calculated through
the local mean and mean deviation $\lambda(x,y)$ as:
$T (x, y) = m(x, y) \left[ 1 + k \left( \dfrac{\lambda(x,y)}{1-\lambda(x,y)} -1 \right) \right] \label{SI}$ ,
where $\lambda(x,y)=I(x,y)-m(x,y)$ is the local mean
deviation and is a bias. Its range is
Calculation of $\lambda(x,y)$ is straightforward by subtracting the mean the concerned pixel. Because of that, Singh's technique can binaries faster than other local techniques and it's also found to be better in terms of quality.

Description

Checklist

Docstrings for all functions
Gallery example in ./doc/examples (new features only)
Benchmark in ./benchmarks, if your changes aren't covered by an
existing benchmark
Unit tests
Clean style in the spirit of PEP8

For reviewers

Check that the PR title is short, concise, and will make sense 1 year
later.
Check that new functions are imported in corresponding __init__.py.
Check that new features, API changes, and deprecations are mentioned in
doc/release/release_dev.rst.

I add the threshold Singh function useful for text recognition. T Romen Singh, Sudipta Roy, O Imocha Singh, Tejmani Sinam, Kh Manglem Singh. "A New Local Adaptive Thresholding Technique in Binarization." IJCSI International Journal of Computer Science Issues. 2011; 8(6-2): 271-276.

pep8speaks · 2021-07-24T17:22:59Z

Hello @lucArub! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2021-09-21 14:14:26 UTC

grlee77

Hi @lucArub, thank you for the submission.

The paper corresponding to this has around 200 citations on Google scholar which, although less than "Niblack" or "Sauvola" methods which each have > 2k, is not bad. If you make the suggested modification to the existing _mean_std function, very few lines of new code will be needed, so there is no real concern from a maintenance standpoint.

The main thing missing at this point is a demo showing its use. I would suggest updating the existing gallery example /doc/examples/segmentation/plot_niblack_sauvola.py to also show the result of this threshold.

Also, was there a specific use case where you found the output of this method worked better than the existing thresholds? If so, that could potentially be a new, independent gallery example if there is appropriately licensed data it could be demonstrated on.

grlee77 · 2021-07-25T13:08:01Z

skimage/filters/thresholding.py

@@ -970,6 +970,106 @@ def _mean_std(image, w):
    # m*m when floating point error is considered
    s = np.sqrt(np.clip(g2 - m * m, 0, None))
    return m, s
+
+def _only_mean(image, w):


Rather than introduce a new function here, it would be better to modify the existing _mean_std function with a mean_only or omit_std argument that could be used to return the mean only. Then just return m (mean) early, before the computation of s (std).

Ok, thanks for the suggestion.

grlee77 · 2021-07-25T13:16:39Z

skimage/filters/thresholding.py

+    """
+
+   m = _only_mean(image, window_size)
+   d = image - m


For benefit of other reviewers, unlike the current threshold_sauvola and threshold_niblack, this method uses the local "mean deviation" as defined for d here rather than the usual local "standard deviation" used by those techniques. Thus, computation time should be less for this method.

The computation time is less for this method. In terms of thresholding quality is quite similar to the Sauvola technique.
If you want you can have a look at those simple results that I obtained for my university using ground truth images of DIBCO 2009 dataset.

https://github.com/lucArub/localthresholding-

grlee77 · 2021-07-25T13:32:44Z

skimage/filters/thresholding.py

+        International Journal of Computer Science Issues. 2011; 8(6-2): 271-276.
+


Suggested change

International Journal of Computer Science Issues. 2011; 8(6-2): 271-276.

International Journal of Computer Science Issues. 2011; 8(6-2): 271-276.

http://ijcsi.org/papers/IJCSI-8-6-3-275-280.pdf

We may as well also provide the URL to the (freely available) publication. As far as I could tell, there does not appear to be an associated DOI.

grlee77 · 2021-07-25T13:40:23Z

The main benefit of this technique over Niblack and Sauvola appears to be computation time. Thresholding results appear qualitatively similar for the image in the publication.

I think the computation time comparison in the publication is likely vs. non-optimized Niblack and Sauvola (i.e. without use of integral images to speed up the computation of the local mean and standard deviation). Still, the Singh method as implemented here will be faster than those methods, although I suspect closer to a factor of two or so since it would have only one call to correlate_sparse instead of two.

Avoids need for a separate _only_mean function, reducing code duplication.

STYLE: allow existing _mean_std to return mean only

lucArub closed this Jul 24, 2021

lucArub reopened this Jul 24, 2021

lucArub changed the title ~~Update thresholding.py~~ Update thresholding.py with Singh function Jul 24, 2021

Update thresholding.py

7515729

grlee77 requested changes Jul 25, 2021

View reviewed changes

grlee77 reviewed Jul 25, 2021

View reviewed changes

grlee77 mentioned this pull request Aug 9, 2021

2021's calendar of community management #5169

Closed

lucArub added 19 commits August 19, 2021 17:36

Update thresholding.py

2f1ec8b

Update thresholding.py

4eb4e75

Update thresholding.py

6bd1bd7

Update __init__.py

d500994

Update thresholding.py

13ab562

Update thresholding.py

629195c

Update thresholding.py

7144eb5

Update thresholding.py

5ec08ca

Update thresholding.py

9919930

Update thresholding.py

f36c5f8

Update thresholding.py

0d347be

Update thresholding.py

86dcf57

Update thresholding.py

34712fe

Update thresholding.py

53a33d6

Update thresholding.py

330d18f

Update thresholding.py

3f46463

Update thresholding.py

0fc9951

Update thresholding.py

c5f63db

Update thresholding.py

8dcefb3

grlee77 mentioned this pull request Aug 20, 2021

Update and rename plot_niblack_sauvola.py to plot_niblack_sauvola_sin… #5523

Closed

lucArub added 2 commits August 20, 2021 19:25

Update plot_niblack_sauvola.py

edb0615

Update plot_niblack_sauvola.py

02c3c3f

grlee77 added the type: new feature label Aug 20, 2021

lucArub and others added 15 commits August 21, 2021 09:49

Rename plot_niblack_sauvola.py to plot_niblack_sauvola_singh.py

59debf2

Update plot_niblack_sauvola_singh.py

9ad8b35

Update plot_niblack_sauvola_singh.py

b6b1ee1

Update thresholding.py

dadfd7f

Update plot_niblack_sauvola_singh.py

159eb1a

Update thresholding.py

d788015

Update thresholding.py

7009510

Update thresholding.py

535eacb

Update plot_niblack_sauvola_singh.py

39d589c

Update plot_niblack_sauvola_singh.py

d9c7e3a

Update plot_niblack_sauvola_singh.py

cd8eb12

Update thresholding.py

ebd7801

Update thresholding.py

e055813

Update thresholding.py

88cdb02

STYLE: allow existing _mean_std to return mean only

894f139

Avoids need for a separate _only_mean function, reducing code duplication.

grlee77 mentioned this pull request Aug 29, 2021

STYLE: allow existing _mean_std to return mean only lucArub/scikit-image#1

Merged

lucArub added 6 commits August 30, 2021 09:01

Merge pull request #1 from grlee77/mean-std-refactor

b4a4362

STYLE: allow existing _mean_std to return mean only

Update thresholding.py with Singh function.

2d60789

Update thresholding.py

5902ff8

Update thresholding.py

e0b6917

Update thresholding.py

3ca85f8

Merge branch 'main' into patch-1

27244fa

lucArub closed this Sep 21, 2021

lucArub reopened this Sep 21, 2021

grlee77 added 🙏 Feature request and removed type: new feature labels Feb 22, 2022

mkcor mentioned this pull request Feb 28, 2022

2022's calendar of community management #6165

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update thresholding.py with Singh function #5490

Update thresholding.py with Singh function #5490

lucArub commented Jul 24, 2021 •

edited

pep8speaks commented Jul 24, 2021 •

edited

grlee77 left a comment

grlee77 Jul 25, 2021

lucArub Jul 25, 2021

grlee77 Jul 25, 2021

lucArub Jul 25, 2021 •

edited

grlee77 Jul 25, 2021

grlee77 commented Jul 25, 2021 •

edited

		International Journal of Computer Science Issues. 2011; 8(6-2): 271-276.

	International Journal of Computer Science Issues. 2011; 8(6-2): 271-276.
	International Journal of Computer Science Issues. 2011; 8(6-2): 271-276.
	http://ijcsi.org/papers/IJCSI-8-6-3-275-280.pdf

Update thresholding.py with Singh function #5490

Are you sure you want to change the base?

Update thresholding.py with Singh function #5490

Conversation

lucArub commented Jul 24, 2021 • edited

Description

Checklist

For reviewers

pep8speaks commented Jul 24, 2021 • edited

Comment last updated at 2021-09-21 14:14:26 UTC

grlee77 left a comment

Choose a reason for hiding this comment

grlee77 Jul 25, 2021

Choose a reason for hiding this comment

lucArub Jul 25, 2021

Choose a reason for hiding this comment

grlee77 Jul 25, 2021

Choose a reason for hiding this comment

lucArub Jul 25, 2021 • edited

Choose a reason for hiding this comment

grlee77 Jul 25, 2021

Choose a reason for hiding this comment

grlee77 commented Jul 25, 2021 • edited

lucArub commented Jul 24, 2021 •

edited

pep8speaks commented Jul 24, 2021 •

edited

lucArub Jul 25, 2021 •

edited

grlee77 commented Jul 25, 2021 •

edited