FFT based M2L #73

isuruf · 2021-07-02T17:53:18Z

Replaces #36

For FFT based M2L, we first need to take the FFT of the source coeffs. This is given by
sumpy.expansion.local.LocalExpansionBase.m2l_preprocess_exprs and done
separately in a new loopy kernel produced by sumpy.e2e.E2EFromCSRWithFFTPreprocess.
This happens in a separate loopy kernel because the FFT of the inputs needs to happen
only once for each source box. Since the M2L is done for each target box, if we didn't do the
FFT separately, source boxes will be processed using FFT multiple times.

After that the derivatives which are precomputed for translation classes need to be
FFTed as well. This is done in the same loopy kernel that computes the derivatives.

M2L is then just the elementwise multiplication of the FFT of the source coeffs and
the FFT of the derivatives. After adding these for each target box, we do an inverse FFT
for each target box. This needs to happen outside the inner loop of iterating the source
boxes in M2L to avoid redundant calculations. The expressions involved are given by
sumpy.expansion.local.LocalExpansionBase.m2l_postprocess_exprs.

One other major change is that the loopy instructions depend on whether the input to
M2L is real or not. If it's real, FFT and then inverse FFT gives us complex expressions
with imaginary part close to zero. We need to truncate the imaginary part if the input was
real, but keep the imaginary part as it is if the input was complex. Therefore
E2EFromCSE.get_kernel gained an additional parameter result_dtype.

This reverts commit 29270d1.

isuruf · 2021-07-09T06:52:30Z

@inducer, this is ready.

Note that I have disabled FFT tests for H2DLocal and Y2DLocal because they are inaccurate. The scaling here is not working properly. I'll try to fix that in a follow-up PR if that's okay.

inducer

🎉 This is getting close!

sumpy/e2e.py

sumpy/expansion/local.py

sumpy/fmm.py

sumpy/tools.py

…ion involved

dtype and type are different things in numpy and even though they compare equal, they are hashed differently.

inducer · 2021-08-16T22:03:00Z

This looks good. Thanks for being patient with me and seeing this through to completion!

isuruf added 30 commits March 8, 2020 00:30

Remove passing a sac around

1bcf666

Add FFT

2e9e727

Generate Toeplitz matrix

ab662aa

Fix missing import and formatting

83e9214

Fix rscaling

a17c60f

use get_scaled_multipole

56682e0

Use UnevaluatedExpr for the ratio of rscales

0978271

Rewrite cosines and sines using cosines of angle in the first quadrant

320cc8a

Add a test to check for FFT

ef2731f

Fix formatting

5001688

Python2 has no notion of local variables

76ac6fc

Add a numeric test that M2L is toeplitz using simple translation

166a20d

Use 1e-10 for now

906abb3

Use _fft_uneval_expr

beec30c

Add a new derivative taker for Laplace

91d838f

Use the derivative taker everywhere

bb5a2b5

Get tests to pass

26cba1d

Make get_derivative_taker part of expansion and not kernel

f3d2949

Add support Laplace 2D

e96bc87

Use new derivativetaker with Laplace

3fdaa5f

Pass around a sac

e4f6379

Fix Laplace 2D mi=(2,0) case

0c3b5cf

Move efficient scaling to the deriv taker

cc68439

Use the sac

6189de9

Use the sac more

bcf5310

Fix typo

44376fa

Fix benchmark

4e0fed6

Increase the timeout a bit

e1bef31

Add RadialDerivativeTaker

a0907c0

Switch over Helmholtz and Biharmonic

688655b

isuruf added 4 commits July 8, 2021 23:53

debug H2D FFT failure

29270d1

Revert "debug H2D FFT failure"

7ad7901

This reverts commit 29270d1.

Simplify test generation

22e4930

Ban Fourier/Bessel expansions with FFT

6bc2a23

Fix precompute M2L CL event management

cd9a7ba

inducer reviewed Aug 10, 2021

View reviewed changes

isuruf added 12 commits August 13, 2021 12:33

Use getattr

452180f

dict() -> {}

9045aa6

get_translation_loopy_insns -> get_loopy_insns as there's no translat…

b12efdc

…ion involved

fix grammar

6ed5f6b

drop using latex in code comment

fb662f0

Use dictionary comprehension

dfd4d94

Use an else statement instead of returning in if

bc1e172

explain why FFT for Bessel based expansions is disabled

b5cdb03

Add a FIXME comment

66c1296

Make sure type is used in dictionary lookup

0c59fc4

dtype and type are different things in numpy and even though they compare equal, they are hashed differently.

separate E2EFromCSR and M2LUsingTranslationClassesDependentData

2fec1d0

fix bad quotes

d48ab2c

isuruf force-pushed the fft branch from ca7ea99 to d48ab2c Compare August 13, 2021 10:51

isuruf added 6 commits August 13, 2021 16:42

Fix import of M2LUsingTranslationClassesDependentData

1deb69a

Fix get_kernel signature for E2EFromCSR

2553507

add M2LUsingTranslationClassesDependentData to __all__

bda56e4

fix signature of get_translation_loopy_insns too

2e27e72

remove unneeded complex_dtype for E2EFromCSR

d6077f0

Merge branch 'main' into fft

2afbcb6

isuruf requested a review from inducer August 14, 2021 04:37

inducer merged commit a9a6806 into inducer:main Aug 16, 2021

isuruf deleted the fft branch August 17, 2021 04:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FFT based M2L #73

FFT based M2L #73

isuruf commented Jul 2, 2021 •

edited

isuruf commented Jul 9, 2021

inducer left a comment

inducer commented Aug 16, 2021

FFT based M2L #73

FFT based M2L #73

Conversation

isuruf commented Jul 2, 2021 • edited

isuruf commented Jul 9, 2021

inducer left a comment

Choose a reason for hiding this comment

inducer commented Aug 16, 2021

isuruf commented Jul 2, 2021 •

edited