Cosym target: faster numpy operations #1639

rjgildea · 2021-03-29T15:27:12Z

Significant speedup of computation of cosym target functional, gradients and curvatures.

Use np.square(x) instead of np.power(x, 2)
Avoid explicit loops over number of dimensions
Functional now ~10x quicker
Gradients ~20x quicker
Curvatures ~4x quicker

For a real dials.cosym example, 44 datasets in P3121, this change cut about 25s off the total runtime:

real	1m18.899s
user	1m4.166s
sys	0m2.899s

real	0m52.221s
user	0m39.257s
sys	0m2.973s

This is much faster and give ~6-fold speedup to target.compute_functional() (based on Rij matrix of size 400x400).

Remove the need for one tmp variable, pre-declare another and re-use by passing this as the `out=` parameter to np operations.

Remove need for explicit loop over number of dimensions

Avoid doubly nested loop over number of dimensions.

rjgildea · 2021-03-29T15:41:54Z

@dwpaley it would be interesting to hear what performance impact this has on your use case.

codecov · 2021-03-29T16:02:46Z

Codecov Report

Merging #1639 (4f9112a) into main (7cfb6ec) will decrease coverage by 0.01%.
The diff coverage is 100.00%.

❗ Current head 4f9112a differs from pull request most recent head f9b854d. Consider uploading reports for the commit f9b854d to get more accurate results

@@            Coverage Diff             @@
##             main    #1639      +/-   ##
==========================================
- Coverage   66.63%   66.62%   -0.02%     
==========================================
  Files         616      616              
  Lines       68949    68926      -23     
  Branches     9601     9593       -8     
==========================================
- Hits        45943    45920      -23     
  Misses      21070    21070              
  Partials     1936     1936

benjaminhwilliams

I haven't seen the algebra, so I can't check the consistency of the code with the desired arithmetic, but I presume that's not at question here. Changes look good and improve clarity, with a nice performance improvement. I have one small suggestion (affects two places). I haven't tested my suggestion though, so YMMV. Otherwise, LGTM.

algorithms/symmetry/cosym/target.py

Co-authored-by: Ben Williams <benjaminhwilliams@users.noreply.github.com>

elena-pascal · 2021-03-30T10:31:48Z

algorithms/symmetry/cosym/target.py

        for i in range(self.dim):
-            grad[i * NN : (i + 1) * NN] = np.matmul(wrij_matrix, coords[i])
+            grad[i] = np.matmul(wrij_matrix, coords[i])


matmul can also take stacks of matrices and would interpret it as such if coords dimensions is > 2. I think
grad = np.matmul(wrij_matrix, coords) should also work

Maybe for future considerations

Significant speedup of computation of cosym target functional, gradients and curvatures. * Use np.square(x) instead of np.power(x, 2) * Avoid explicit loops over number of dimensions * Functional now ~10x quicker * Gradients ~20x quicker * Curvatures ~4x quicker

Features -------- - ``dials.cosym``: Significantly faster via improved computation of functional, gradients and curvatures (#1639) - ``dials.integrate``: Added parameter ``valid_foreground_threshold=``, to require a minimum fraction of valid pixels before profile fitting is attempted (#1640) Bugfixes -------- - ``dials.cosym``: Cache cases where Rij is undefined, rather than recalculating each time. This can have significant performance benefits when handling large numbers of sparse data sets. (#1634) - ``dials.cosym``: Fix factor of 2 error when calculating target weights (#1635) - ``dials.cosym``: Fix broken ``engine=scipy`` option (#1636) - ``dials.integrate``: Reject reflections with a high number of invalid pixels, which were being integrated since 3.4.0. This restores better merging statistics, and prevents many reflections being incorrect profiled as zero-intensity. (#1640)

Features -------- - ``dials.cosym``: Significantly faster via improved computation of functional, gradients and curvatures (#1639) - ``dials.integrate``: Added parameter ``valid_foreground_threshold=``, to require a minimum fraction of valid pixels before profile fitting is attempted (#1640) Bugfixes -------- - ``dials.cosym``: Cache cases where Rij is undefined, rather than recalculating each time. This can have significant performance benefits when handling large numbers of sparse data sets. (#1634) - ``dials.cosym``: Fix factor of 2 error when calculating target weights (#1635) - ``dials.cosym``: Fix broken ``engine=scipy`` option (#1636) - ``dials.integrate``: Reject reflections with a high number of invalid pixels, which were being integrated since 3.4.0. This restores better merging statistics, and prevents many reflections being incorrect profiled as zero-intensity. (#1640) - Fix rare crash in symmetry calculations when no resolution limit could be calculated (#1641)

rjgildea and others added 9 commits March 29, 2021 09:18

Use np.square(x) instead of np.power(x, 2)

e1fa251

This is much faster and give ~6-fold speedup to target.compute_functional() (based on Rij matrix of size 400x400).

np.power(x, 2) -> np.square(x)

39a7023

Minor np vector computation optimisations

e992457

Remove the need for one tmp variable, pre-declare another and re-use by passing this as the `out=` parameter to np operations.

Slight simplification of gradient calculation

7f729c1

Simplify functional calculation

1e8e2ae

Remove need for explicit loop over number of dimensions

Much faster gradient calculations

d3e560b

Avoid doubly nested loop over number of dimensions.

Faster curvatures

2d08be6

news

42338db

Rename newsfragments/XXX.misc to newsfragments/1639.misc

4f9112a

ndevenish force-pushed the main branch from 90cdcbb to 7cfb6ec Compare March 29, 2021 16:38

benjaminhwilliams requested changes Mar 30, 2021

View reviewed changes

algorithms/symmetry/cosym/target.py Outdated Show resolved Hide resolved

algorithms/symmetry/cosym/target.py Outdated Show resolved Hide resolved

ndevenish and others added 2 commits March 30, 2021 11:11

Update and rename 1639.misc to 1639.feature

c639f97

Further simplify gradients calculation

f9b854d

Co-authored-by: Ben Williams <benjaminhwilliams@users.noreply.github.com>

benjaminhwilliams approved these changes Mar 30, 2021

View reviewed changes

rjgildea requested a review from benjaminhwilliams March 30, 2021 10:29

benjaminhwilliams approved these changes Mar 30, 2021

View reviewed changes

elena-pascal reviewed Mar 30, 2021

View reviewed changes

rjgildea merged commit 1c8b68c into main Mar 30, 2021

rjgildea deleted the cosym-target-performance branch March 30, 2021 11:24

DiamondLightSource-build-server mentioned this pull request Mar 31, 2021

DIALS 3.4.1 #1637

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cosym target: faster numpy operations #1639

Cosym target: faster numpy operations #1639

rjgildea commented Mar 29, 2021

rjgildea commented Mar 29, 2021

codecov bot commented Mar 29, 2021 •

edited

benjaminhwilliams left a comment

elena-pascal Mar 30, 2021

Cosym target: faster numpy operations #1639

Cosym target: faster numpy operations #1639

Conversation

rjgildea commented Mar 29, 2021

rjgildea commented Mar 29, 2021

codecov bot commented Mar 29, 2021 • edited

Codecov Report

benjaminhwilliams left a comment

Choose a reason for hiding this comment

elena-pascal Mar 30, 2021

Choose a reason for hiding this comment

codecov bot commented Mar 29, 2021 •

edited