Internal improvements to the EBSD orientation/projection center refinement code #405

hakonanes · 2021-08-04T17:54:52Z

Description of the change

Simplify testing of refinement functionality and related EBSD tests
Update API reference with EBSD refinement methods
Simplify checks of whether parameters passed to refinement methods are valid. One of these, the check of a EBSD detector's compatibility to the EBSD signal's navigation and signal shapes, is moved to a separate private function because it can be useful elsewhere.
Update order of contributor credits
Rename gnomonic and Lambert projections from project/iproject to vector2xy/xy2vector.
Greatly improve performance of generation of a dictionary of simulated EBSD patterns by master pattern projection using Numba. These are partly based on work by @friedkitteh in the _refinement.py module. We get a huge speed-up!

See #402 and the closed PR #387 for further details.

Progress of the PR

Docstrings for all functions
Unit tests with pytest for all lines
Clean code style by running black via pre-commit
Re-add PC refinement
Re-add full refinement
Re-add respect chunks in lazy signals during refinement
Add refinement to pattern matching notebook
Re-add nice print of refinement information

Minimal example of the bug fix or new feature

No updates to the public API are made, hence no new functionality is added in this PR.

For reviewers

The PR title is short, concise, and will make sense 1 year later.
New functions are imported in corresponding __init__.py.
New features, API changes, and deprecations are mentioned in the
unreleased section in doc/changelog.rst.

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes · 2021-08-05T11:28:19Z

The ubuntu-latest/py.8/pip tests took 290 s for the #403 PR. With the latest commits, they now take 137 s (link). This is not a fair comparison yet, since the coverage of the _refinement.py module dropped from 100% to 90%, and the ebsd.py coverage dropped from 100% to 99%. Still I hope to keep the test time significantly below 290 s, which was a significant jump from 106 s for a PR before the refinement tests were added.

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes · 2021-08-16T10:04:24Z

@friedkitteh Do you know if dask.delayed() respects chunks? I was wondering if iterating over a lazy EBSD signal using for idx in np.ndindex(navigation_shape):, which runs through the signal from upper left to lower right along the first axis (rows), and passing patterns[idx] to dask.delayed(), respect the underlying chunks of patterns upon a call to compute()? As in:

https://github.com/pyxem/kikuchipy/pull/405/files#diff-9ac7dbbc84a37f18d62897b66ed2534fc1ff71cabec3e0023d22d12bfe657aacR129

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes · 2021-08-16T13:59:26Z

By the way, I got marginally better results when refining using global SHGO compared to local Nelder-Mead for the built in "large" Ni data set by passing these keyword arguments:

    method_kwargs=dict(
        sampling_method="sobol",
        options=dict(f_tol=1e-4),
        minimizer_kwargs=dict(method="Nelder-Mead", options=dict(ftol=1e-4)),
    ),

to EBSD.refine_orientation(). SHGO used 160 s and achieved a mean NCC of 0.4412814 while Nelder-Mead used 59 s and achieved a marginally worse mean NCC of 0.4412797. Note that ftol=1e-4 is default for Nelder-Mead via method="minimize".

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

friedkitteh · 2021-08-16T15:51:40Z

@friedkitteh Do you know if dask.delayed() respects chunks? I was wondering if iterating over a lazy EBSD signal using for idx in np.ndindex(navigation_shape):, which runs through the signal from upper left to lower right along the first axis (rows), and passing patterns[idx] to dask.delayed(), respect the underlying chunks of patterns upon a call to compute()?

I think the way you have it now should work as you have intended. If you wanted to work on a chunk-by-chunk basis, you could call my_dask_array.to_delayed() and have an np.ndarray with shape equal to the chunks, some trickery would then probably be needed to ensure the results are built back in the correct order.

Also, I think it would probably be faster to use to_delayed(), as I assume it is very expensive to interact with the Dask array all the time?

hakonanes · 2021-08-17T08:01:21Z

I think the way you have it now should work as you have intended. If you wanted to work on a chunk-by-chunk basis, you could call my_dask_array.to_delayed() and have an np.ndarray with shape equal to the chunks, some trickery would then probably be needed to ensure the results are built back in the correct order.

I thought about using map_blocks(), but I believe dask has more freedom with dask.delayed() and will lead to less idle time for workers.

Also, I think it would probably be faster to use to_delayed(), as I assume it is very expensive to interact with the Dask array all the time?

You're referring to this part of the "best practices" guide? Yes, we might earn some time on that. Will look into it.

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes · 2021-08-17T16:57:21Z

Only the final touches to the API reference and the changelog remains, will finish this tomorrow!

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes added 4 commits August 4, 2021 16:00

Move refinement arg check to private method, add to API reference

b1eaedb

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Rework parts of refinement tests to smaller unit tests

e416786

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Update contributing with welcoming news that [skip ci] is supported

f6dcaf5

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Update package credits [skip ci]

cdb5742

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes added documentation This relates to the documentation tests This relates to the tests maintenance This relates to package maintenance labels Aug 4, 2021

hakonanes added this to the v0.5.0 milestone Aug 4, 2021

hakonanes added 3 commits August 5, 2021 11:23

Simplify refinement tests with fewer allowed opt. iters

ff54047

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Update contributing guide with a testing tip

58ed0c0

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Use orix' create_coordinate_arrays() in xmap tests

7237bae

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes mentioned this pull request Aug 5, 2021

Minor improvements and changes of orientation/PC refinement before v0.5 release #402

Closed

7 tasks

hakonanes added 7 commits August 6, 2021 15:42

Use Numba to speed up projection of sim. EBSD from MP

77196b1

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Finish overhaul of Numba speed-up of dictionary generation

02f2b53

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Change project/iproject naming for Gnomonic and Lambert

453dde5

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Improve refinement performance with more Numba

cdcf54b

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Update black in pre-commit config file

a9e0da4

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Restructure refinement module into smaller files

d3d878f

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Move re-computation of proj. centers to loop during refinement

1b5bf31

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes added the enhancement New feature or request label Aug 13, 2021

hakonanes added 2 commits August 13, 2021 19:49

Clean up existing tests, not 100% coverage yet

5d4ae0b

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Return to near 100% coverage without PC/full refinement

2ac7cc9

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Re-add print of refinement results

e544a2d

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Re-add projection center refinement

c1043da

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Complete re-add of PC refinement and tests

c917860

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes added 2 commits August 17, 2021 16:39

Re-add orientation/PC (full) refinement, return to 100% coverage

4025385

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Update pattern matching notebook with refinement methods and analysis

d07ef15

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

Final touches to the API reference and pattern matching notebook

54606db

Signed-off-by: Håkon Wiik Ånes <hwaanes@gmail.com>

hakonanes merged commit 85997bb into pyxem:master Aug 18, 2021

hakonanes deleted the internal-improvements-refinement branch August 18, 2021 11:46

This was referenced Aug 19, 2021

Jump in memory use when computing dictionary of simulated EBSD patterns from MasterPattern.get_patterns() #325

Closed

Unpin dask following dask 2021.8.1 #418

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Internal improvements to the EBSD orientation/projection center refinement code #405

Internal improvements to the EBSD orientation/projection center refinement code #405

hakonanes commented Aug 4, 2021 •

edited

Loading

hakonanes commented Aug 5, 2021

hakonanes commented Aug 16, 2021 •

edited

Loading

hakonanes commented Aug 16, 2021 •

edited

Loading

friedkitteh commented Aug 16, 2021

hakonanes commented Aug 17, 2021

hakonanes commented Aug 17, 2021

Internal improvements to the EBSD orientation/projection center refinement code #405

Internal improvements to the EBSD orientation/projection center refinement code #405

Conversation

hakonanes commented Aug 4, 2021 • edited Loading

Description of the change

Progress of the PR

Minimal example of the bug fix or new feature

For reviewers

hakonanes commented Aug 5, 2021

hakonanes commented Aug 16, 2021 • edited Loading

hakonanes commented Aug 16, 2021 • edited Loading

friedkitteh commented Aug 16, 2021

hakonanes commented Aug 17, 2021

hakonanes commented Aug 17, 2021

hakonanes commented Aug 4, 2021 •

edited

Loading

hakonanes commented Aug 16, 2021 •

edited

Loading

hakonanes commented Aug 16, 2021 •

edited

Loading