Fix edgecase bugs in segmentation.relabel_sequential #4465

uschmidt83 · 2020-02-18T11:22:39Z

Description

Fix edgecase bugs in segmentation.relabel_sequential (see modified tests).

Checklist

Docstrings for all functions
Gallery example in ./doc/examples (new features only)
Benchmark in ./benchmarks, if your changes aren't covered by an
existing benchmark
Unit tests
Clean style in the spirit of PEP8

For reviewers

Check that the PR title is short, concise, and will make sense 1 year
later.
Check that new functions are imported in corresponding __init__.py.
Check that new features, API changes, and deprecations are mentioned in
doc/release/release_dev.rst.
Consider backporting the PR with @meeseeksdev backport to v0.14.x

pep8speaks · 2020-02-18T11:22:43Z

Hello @uschmidt83! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

In the file skimage/segmentation/_join.py:

Line 124:39: E261 at least two spaces before inline comment

Comment last updated at 2020-02-19 16:40:02 UTC

uschmidt83 · 2020-02-18T13:42:25Z

skimage/segmentation/_join.py

+    if offset == 1 and np.all(labels0 == new_labels0):
+        if not (labels == 0).any():
+            labels = np.concatenate(([0], labels))
        return label_field, labels, labels


Maybe it would be best to not handle this special case separately and simply remove these lines of code.

@uschmidt83 I'm actually confused about what's happening in this edge case. The test example has a lot going on so it's still not clear to me. Could you elaborate on the problem you're fixing and why it goes wrong in the current code?

Hi @jni, these lines intended to check if label_field already contains sequential labels, hence there would be nothing to do. Unfortunately, the logic only works if offset == 1 and label_field contains the 0 label. Hence, I added these things.

PS: I was actually encountering this bug in a real example, I didn't go on some esoteric bug hunt ;)

This check was probably intended to avoid some computation, but it is unnecessary.
I just pushed a commit that (IMO) simplifies the function and makes it easier to understand and maintain.

jni

@uschmidt83 I'm a bit confused about this code, though I don't doubt it's necessary. Would you mind explaining the error and the fix? And why removing those lines altogether would help?

jni · 2020-02-19T01:02:06Z

skimage/segmentation/tests/test_join.py

@@ -30,9 +30,15 @@ def test_join_segmentations():
        join_segmentations(s1, s3)


+def _check_maps(ar, ar_relab, fw, inv):
+    assert_array_equal(fw[ar], ar_relab)
+    assert_array_equal(inv[ar_relab], ar)


Ooh I like this. =)

jni · 2020-02-19T01:05:03Z

skimage/segmentation/_join.py

+    if offset == 1 and np.all(labels0 == new_labels0):
+        if not (labels == 0).any():
+            labels = np.concatenate(([0], labels))
        return label_field, labels, labels


@uschmidt83 I'm actually confused about what's happening in this edge case. The test example has a lot going on so it's still not clear to me. Could you elaborate on the problem you're fixing and why it goes wrong in the current code?

jni

@uschmidt83 great, you're absolutely right! I've made a minor change suggestion but I'm approving without it. Thanks!

jni · 2020-02-19T13:07:19Z

skimage/segmentation/_join.py

-    new_labels0 = np.arange(offset, offset + len(labels0))
-    if np.all(labels0 == new_labels0):
-        return label_field, labels, labels
+    new_m = offset - 1 + len(labels0)


Is this short for new_maximum or new_max_label? If so, I wouldn't mind this being the new name rather than new_m.

Is this short for new_maximum or new_max_label?

Yes. After relabeling, relabeled.max() == new_m.

If so, I wouldn't mind this being the new name rather than new_m.

I personally don't have an opinion on how to call this variable. I just chose the name because m is already used:

scikit-image/skimage/segmentation/_join.py

Line 124 in 7d70b5e

m = label_field.max()

@uschmidt83 who would write such code??? 😂 😂 😂

Well, our preference has evolved to more self-explanatory variable names, so I would support a renaming of m to max_label and new_m to new_max_label. Feel free to leave that one to us, though, as indeed it is an orthogonal change to your contribution. Thank you for this fix!

I just changed the variable names as requested.

uschmidt83 · 2020-02-19T14:00:46Z

skimage/segmentation/_join.py

-    required_type = np.min_scalar_type(offset + len(labels0))
+    new_max_label = offset - 1 + len(labels0)
+    new_labels0 = np.arange(offset, int(new_max_label + 1))
+    required_type = np.min_scalar_type(new_max_label)


I also changed another subtlety regarding required_type.
(It is amazing how much complexity can be in such a small function.)

(It is amazing how much complexity can be in such a small function.)

I know! Thank you again for your persistence here! It's a much better function thanks to your contributions!

skimage/segmentation/_join.py

rfezzani

Good job, thank you @uschmidt83 ;-)

- Implicit (reasonable) assumption: the values for max_label and new_max_label fit into the 'int' data type. - Avoid making a copy of 'label_field' if the output type has to be "upgraded".

rfezzani

Thank you again @uschmidt83 👏

Fix edgecase bugs in segmentation.relabel_sequential

dd10b7b

PEP 8

f65d80c

uschmidt83 commented Feb 18, 2020

View reviewed changes

jni reviewed Feb 19, 2020

View reviewed changes

Simplify relabel_sequential

7d70b5e

jni approved these changes Feb 19, 2020

View reviewed changes

Make variable names more self-explanatory

53a32b3

uschmidt83 commented Feb 19, 2020

View reviewed changes

rfezzani reviewed Feb 19, 2020

View reviewed changes

skimage/segmentation/_join.py Outdated Show resolved Hide resolved

rfezzani reviewed Feb 19, 2020

View reviewed changes

skimage/segmentation/_join.py Outdated Show resolved Hide resolved

rfezzani reviewed Feb 19, 2020

View reviewed changes

skimage/segmentation/_join.py Outdated Show resolved Hide resolved

rfezzani reviewed Feb 19, 2020

View reviewed changes

Tweak relabel_sequential some more

15c2dc9

- Implicit (reasonable) assumption: the values for max_label and new_max_label fit into the 'int' data type. - Avoid making a copy of 'label_field' if the output type has to be "upgraded".

rfezzani approved these changes Mar 9, 2020

View reviewed changes

rfezzani merged commit 88af67f into scikit-image:master Mar 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix edgecase bugs in segmentation.relabel_sequential #4465

Fix edgecase bugs in segmentation.relabel_sequential #4465

uschmidt83 commented Feb 18, 2020 •

edited

pep8speaks commented Feb 18, 2020 •

edited

uschmidt83 Feb 18, 2020

jni Feb 19, 2020

uschmidt83 Feb 19, 2020 •

edited

uschmidt83 Feb 19, 2020

jni left a comment

jni Feb 19, 2020

rfezzani Feb 19, 2020

jni Feb 19, 2020

jni left a comment

jni Feb 19, 2020

uschmidt83 Feb 19, 2020

jni Feb 19, 2020

uschmidt83 Feb 19, 2020

uschmidt83 Feb 19, 2020

jni Feb 19, 2020

rfezzani left a comment

rfezzani left a comment

Fix edgecase bugs in segmentation.relabel_sequential #4465

Fix edgecase bugs in segmentation.relabel_sequential #4465

Conversation

uschmidt83 commented Feb 18, 2020 • edited

Description

Checklist

For reviewers

pep8speaks commented Feb 18, 2020 • edited

Comment last updated at 2020-02-19 16:40:02 UTC

Choose a reason for hiding this comment

Choose a reason for hiding this comment

uschmidt83 Feb 19, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jni left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jni left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rfezzani left a comment

Choose a reason for hiding this comment

rfezzani left a comment

Choose a reason for hiding this comment

uschmidt83 commented Feb 18, 2020 •

edited

pep8speaks commented Feb 18, 2020 •

edited

uschmidt83 Feb 19, 2020 •

edited