Speed up fitting in HyperSpy #488

francisco-dlp · 2015-03-20T14:15:04Z

Fitting in HyperSpy is slow, or at least it is not as fast as it could be. For multidimensional datasets parallesation can help in some cases and is in the works (see #242). Another complementary approach is to speed up function evaluation. One way to do this would be to use numexpr. Any other ideas?

dnjohnstone · 2015-07-10T15:10:45Z

This is a particular problem I'm facing right now with trying to fit many 2D Gaussians in a stack of images... i.e. ~40 2D Gaussians in ~5000 images.

I have some code written in C that parallelises over the individual peaks and is really a lot faster than anything I can write in HyperSpy right now. I guess there are two points here, first we could perhaps achieve a speed up by implementing some parts of the optimisation in C but I guess this might go against some of the wider HyperSpy principles?

Secondly, an option to parallelise over components in the model may be of particular benefit in cases (like mine, and often atomic resolution images) where the peaks in each spectrum/image are quite well spaced. In short fitting 4 variables 40 times is probably a better bet than trying to fit 160 in one go.

Any thoughts or points that need doing would be appreciated to guide doing that!

Any thoughts or

francisco-dlp · 2015-07-10T15:46:15Z

#573 partially solves this issue as, when using numexpr, the speed should be very close to C code. However a new Expression2D component will be needed for Model2D. Hopefully it'll require only minor changes.

Regarding parallelization, if you mention that you have ~5000 images, then I think that running the fit of each image in parallel (as implemented in #242) instead of parallelizing the fit of the individual images should provide a similar or better boost in speed without compromising accuracy. Actually, I might be wrong about this but, if for parallelizing the fit on an individual image you assume that the contribution of all other peaks is negligible, then you may not need to fit at all, as just estimating the parameters of the 2D Gaussians analytically might be good enough for the purpose.

tjof2 · 2017-02-14T16:08:56Z

How much of this is fixed/solved with #1101?

francisco-dlp · 2017-02-14T16:33:34Z

#573, #1101 and #1321 should fully address this.

tjof2 · 2020-01-29T14:47:20Z

#573, #1101 and #1321 should fully address this.

New Expression component #573 and NEW: SAMFire #1101 are merged
NEW: Linear fitting #1321 was closed in favour of ~~New: Linear fitting 2.0 #1462, which is still open~~ Linear fitting #2422

francisco-dlp added the type: proposal label Mar 20, 2015

francisco-dlp added this to the Wish list milestone Mar 20, 2015

francisco-dlp mentioned this issue Mar 20, 2015

Enh parallel multifit #242

Closed

francisco-dlp mentioned this issue Jun 12, 2015

New Expression component #573

Merged

4 tasks

tjof2 mentioned this issue Sep 9, 2020

Linear fitting #2422

Merged

5 tasks

tjof2 linked a pull request Sep 9, 2020 that will close this issue

Linear fitting #2422

Merged

5 tasks

jlaehne closed this as completed in #2422 Mar 30, 2022

ericpre modified the milestones: Wish list, v1.7 Mar 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up fitting in HyperSpy #488

Speed up fitting in HyperSpy #488

francisco-dlp commented Mar 20, 2015

dnjohnstone commented Jul 10, 2015

francisco-dlp commented Jul 10, 2015

tjof2 commented Feb 14, 2017

francisco-dlp commented Feb 14, 2017

tjof2 commented Jan 29, 2020 •

edited

Speed up fitting in HyperSpy #488

Speed up fitting in HyperSpy #488

Comments

francisco-dlp commented Mar 20, 2015

dnjohnstone commented Jul 10, 2015

francisco-dlp commented Jul 10, 2015

tjof2 commented Feb 14, 2017

francisco-dlp commented Feb 14, 2017

tjof2 commented Jan 29, 2020 • edited

tjof2 commented Jan 29, 2020 •

edited