Pattern Matching: Compare experimental with simulated data #233

onatlandsmyr · 2020-10-16T20:55:28Z

Matching diffraction patterns

Compare experimental and simulated patterns and return keep_n sorted match results. The match result consists of simulation indices and metric results having the shape; (nx*ny, keep_n).

The functions use dask functionality but accept both dask and numpy arrays as inputs. It is possible to divide the computation by slicing up simulations regardless of inputs being dask or numpy arrays. This is achieved by passing n_slices as keyword to pattern_match.

Mainly to be used by StaticDictionaryIndexing and DynamicDictionaryIndexing and not directly by the user.

This PR depends on #231 .

Progress of the PR

Docstrings for all functions
Unit tests with pytest for all lines
Clean style in as per black

Minimal example of the bug fix or new feature

>>> import kikuchipy as kp
>>> import numpy as np
>>> from kikuchipy.indexing import pattern_match
>>> s = kp.signals.EBSD(np.zeros((10, 10, 10, 10)))
>>> s = kp.signals.EBSD(np.zeros((10, 10, 10, 10)))
>>> simulated = np.zeros((1000, 10, 10))
>>> simulation_indices, metric_results = pattern_match(s.data, simulated, keep_n=30, metric="zncc")
>>> s_best_match = kp.signals.EBSD(simulated[simulation_indices[:, 0]].reshape(10,10))

For reviewers

Check that the PR title is short, concise, and will make sense 1 year
later.
Check that new functions are imported in corresponding __init__.py.
Check that new features, API changes, and deprecations are mentioned in
the unreleased section in doc/changelog.rst.

Framework for calculating similarities between 2D gray-tone images of equal size.

length of shape -> ndim, and removed unnecessary squeeze call.

Produce value of 1 with equal pattern and template

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

…ity-metrics

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

…y into similarity-metrics

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Change data shape from (N,nm) to (nm,N) to correspond better with cdist in scipy and general logic.

…ity-metrics

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

When compute=False and n_slices is not None.

hakonanes · 2020-10-19T20:25:13Z

I think we should rename this submodule to pattern matching, because many notable resources use "template matching" for describing finding a smaller image template in a larger image:

Wikipedia: https://en.wikipedia.org/wiki/Template_matching
the OpenCV library: https://docs.opencv.org/master/d4/dc6/tutorial_py_template_matching.html

Further, I think renaming patterns -> experimental and templates -> simulated should be done.

What do you think, @onatlandsmyr?

onatlandsmyr · 2020-10-19T23:31:21Z

That sounds better and more clear. I guess I should also change patterns and templates throughout SimilarityMetric/#231, for consistency. Pattern matching is probably better since we have an implicit understanding of it being diffraction patterns. Out of context, they both would be the same thing for me. "Simulation matching" may be an alternative, but I'll stick with pattern matching for now.

And more importantly: patterns->experimental, templates->simulated

hakonanes · 2020-10-20T08:29:10Z

That sounds better and more clear. I guess I should also change patterns and templates throughout SimilarityMetric/#231, for consistency. Pattern matching is probably better since we have an implicit understanding of it being diffraction patterns. Out of context, they both would be the same thing for me. "Simulation matching" may be an alternative, but I'll stick with pattern matching for now.

I agree that simulation matching would be equally descriptive. Pattern matching is used in the literature by e.g. Gert Nolze, Aimo Winkelmann, Angus Wilkinson, and others. Therefore I think it is safe to call it that!

hakonanes · 2020-10-22T08:34:27Z

I've closed one review comment and re-commented on the remaining three. I'll go over the PR one more time after these are resolved, and then we'll see if anything remains.

I'll update this branch with master and solve any potential conflicts now.

…late-matching

Updated tests and docs

…matching

Also removed match_result tuple

hakonanes · 2020-10-22T14:27:36Z

Great work, @onatlandsmyr!

I'm touching up formatting and API reference now, will then approve and we'll let the checks pass a last time before merging.

onatlandsmyr · 2020-10-22T14:43:56Z

Thank you, @hakonanes, for a great review and putting the finishing touch! It's a pleasure contributing to kikuchipy.

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes · 2020-10-22T14:58:16Z

Thank you, @hakonanes, for a great review and putting the finishing touch! It's a pleasure contributing to kikuchipy.

Thanks, that's nice to hear (:

hakonanes · 2020-10-22T15:05:43Z

I see that two tests where you divide by zero throw warnings:

kikuchipy/indexing/tests/test_pattern_matching.py::TestPatternMatching::test_pattern_match_one_to_one
  /home/hakon/kode/kikuchipy/kikuchipy/indexing/similarity_metrics.py:402: RuntimeWarning: invalid value encountered in true_divide
    expt /= (expt ** 2).sum(axis=expt_sum_axis, keepdims=True) ** 0.5

kikuchipy/indexing/tests/test_pattern_matching.py::TestPatternMatching::test_pattern_match_one_to_one
  /home/hakon/kode/kikuchipy/kikuchipy/indexing/similarity_metrics.py:403: RuntimeWarning: invalid value encountered in true_divide
    sim /= (sim ** 2).sum(axis=sim_sum_axis, keepdims=True) ** 0.5

I would like to be warned when there are zero-intensities in my patterns, so I think this is okay. However, the tests shouldn't throw these I think, so I'll just add a np.errstate catch.

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

onatlandsmyr and others added 30 commits October 12, 2020 23:57

feat(SimilarityMetric): ZNCC, NDP and user defined metrics

1fc5bd8

Framework for calculating similarities between 2D gray-tone images of equal size.

fix(SimilarityMetric): squeeze dimensions of similarity matrix

2e8adc9

Merge branch 'similarity-metrics'

9d42d70

fix(SimiliarityMetric): rechunk dask arrays after type conversion

ce40f63

Merge branch 'similarity-metrics'

8d1b68b

refactor(SimilarityMetric)

089e0d1

length of shape -> ndim, and removed unnecessary squeeze call.

test(SimilarityMetric): Many to many ZNCC

83e87c7

Produce value of 1 with equal pattern and template

refactor: removed metricscopes including ANY

0754523

wip: tests cover most cases

c1d70db

test: remove print statement

096540e

Merge branch 'similarity-metrics' into template-matching

0d5bbb4

refactor: indexation -> indexing

21c8c63

Reformat docstrings, add Ole to credits, add indexing module to doc

405a587

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

refactor: renamed variables and made functions private

23d93d8

Merge remote-tracking branch 'origin/similarity-metrics' into similar…

aaa0f12

…ity-metrics

Merge branch 'similarity-metrics' into template-matching

e3f3815

refactor: underscore prefix

9eb4310

Merge branch 'similarity-metrics' into template-matching

6aa285e

refactor: underscore prefix

c992d0b

Merge branch 'similarity-metrics' into template-matching

fbe77be

Update docstring table, and more

eed1303

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Merge branch 'similarity-metrics' of github.com:onatlandsmyr/kikuchip…

9ed54a2

…y into similarity-metrics

Add indexing module to kikuchipy/__init__.py

b6f729f

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Add static dictionary indexing note in changelog

f6388c5

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

fix: change standard shape of similarity matrix

be02b8d

Change data shape from (N,nm) to (nm,N) to correspond better with cdist in scipy and general logic.

test: update tests to new output standard

635e24d

Merge remote-tracking branch 'origin/similarity-metrics' into similar…

97ea036

…ity-metrics

Merge branch 'similarity-metrics' into template-matching

c8035c3

fix: _is_compatible to be working for all scopes

a9573d8

Clarify parameters in docstrings, some minor syntax changes, repr

2555edf

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

fix: raise NotImplementedError

5a7167b

When compute=False and n_slices is not None.

onatlandsmyr mentioned this pull request Oct 18, 2020

Static pattern matching framework, orientation similarity map and crystal map merging #234

Merged

6 tasks

refactor: template_match -> pattern_match

bb0ad78

And more importantly: patterns->experimental, templates->simulated

onatlandsmyr changed the title ~~Template Matching: Compare patterns with templates and keep n sorted results~~ Pattern Matching: Compare experimental with simulated data and keep n sorted results Oct 20, 2020

onatlandsmyr changed the title ~~Pattern Matching: Compare experimental with simulated data and keep n sorted results~~ Pattern Matching: Compare experimental with simulated data Oct 20, 2020

onatlandsmyr added 2 commits October 20, 2020 02:51

refactor(SimilarityMetric): patterns->experimental, templates->simulated

08d31be

docs: data -> patterns

ad46855

test: slicing and compute=False raise NotImplementedError

754b993

hakonanes and others added 3 commits October 22, 2020 10:36

Merge branch 'master' of https://github.com/pyxem/kikuchipy into temp…

8044627

…late-matching

refactor: split pattern_match more nicely into two

90a95dd

Updated tests and docs

Merge remote-tracking branch 'origin/template-matching' into pattern-…

7f34857

…matching

refactor: metric_result -> scores

e58e405

Also removed match_result tuple

Touch up docstring formatting, add to API ref, update changelog

1af5fdb

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes marked this pull request as ready for review October 22, 2020 14:56

hakonanes self-requested a review October 22, 2020 14:56

hakonanes approved these changes Oct 22, 2020

View reviewed changes

Update test to not throw runtime warning

9cf3b47

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes merged commit 191b06b into pyxem:master Oct 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pattern Matching: Compare experimental with simulated data #233

Pattern Matching: Compare experimental with simulated data #233

onatlandsmyr commented Oct 16, 2020 •

edited by hakonanes

hakonanes commented Oct 19, 2020

onatlandsmyr commented Oct 19, 2020 •

edited

hakonanes commented Oct 20, 2020

hakonanes commented Oct 22, 2020

hakonanes commented Oct 22, 2020

onatlandsmyr commented Oct 22, 2020

hakonanes commented Oct 22, 2020

hakonanes commented Oct 22, 2020

Pattern Matching: Compare experimental with simulated data #233

Pattern Matching: Compare experimental with simulated data #233

Conversation

onatlandsmyr commented Oct 16, 2020 • edited by hakonanes

Matching diffraction patterns

Progress of the PR

Minimal example of the bug fix or new feature

For reviewers

hakonanes commented Oct 19, 2020

onatlandsmyr commented Oct 19, 2020 • edited

hakonanes commented Oct 20, 2020

hakonanes commented Oct 22, 2020

hakonanes commented Oct 22, 2020

onatlandsmyr commented Oct 22, 2020

hakonanes commented Oct 22, 2020

hakonanes commented Oct 22, 2020

onatlandsmyr commented Oct 16, 2020 •

edited by hakonanes

onatlandsmyr commented Oct 19, 2020 •

edited