dim_sum and sum returning a different result #126

mattiarighi · 2020-02-12T08:27:49Z

Describe the bug
I'm encountering a strange behaviour of the sum function, which returns a different result than dim_sum when applied to a 4 dimensional variable.

Provide the following:
The 4 D variable tmp which I'm processing looks like this:

Variable: tmp
Type: float
Total Size: 2388787200 bytes
            597196800 values
Number of Dimensions: 4
Dimensions and sizes:   [12] x [48] x [720] x [1440]
Coordinates: 
Number Of Attributes: 1
  _FillValue :  9.96921e+36

The two sum statements I'm comparing are as follows:

print(sum(tmp))
print(dim_sum(dim_sum(dim_sum(dim_sum(tmp)))))

and they return different values:

(0)     1.674379e+12
(0)     1.829703e+12

Note that the same operation with avg / dim_avg gives identical results.

Computing environment

Liinux
Linux version 2.6.32-754.14.2.el6.x86_64 (mockbuild@x86-029.build.eng.bos.redhat.com)
NCL 6.5.0 at DKRZ (same issue with NCL 6.6.2 via conda)

The text was updated successfully, but these errors were encountered:

mattiarighi · 2020-02-12T09:06:30Z

This problem can be reproduced using this simple script:

begin

  var = new((/12, 48, 720, 1440/), float)
  var = 0.5

  total1 = sum(var)
  total2 = dim_sum(dim_sum(dim_sum(dim_sum(var))))
  
  print("total1 = " + total1)
  print("total2 = " + total2)

end

which returns:

(0)     total1 = 8.38861e+06
(0)     total2 = 2.98598e+08

The correct result is total2, since 12x48x720x1440 x 0.5 = 2.98598e+08.

rbrownrigg · 2020-02-12T13:56:04Z

Hi, I can't explain it precisely, but it looks like the difference is due to floating-point arithmetic. Try this simple change to your script and note the results: var = new((/12, 48, 720, 1440/), double) var = 0.5d Rick

…

On Wed, Feb 12, 2020 at 2:06 AM Mattia Righi ***@***.***> wrote: This problem can be reproduced using this simple script: begin var = new((/12, 48, 720, 1440/), float) var = 0.5 total1 = sum(var) total2 = dim_sum(dim_sum(dim_sum(dim_sum(var)))) print("total1 = " + total1) print("total2 = " + total2) end which returns: (0) total1 = 8.38861e+06 (0) total2 = 2.98598e+08 The correct result is total2, since 12x48x720x1440 x 0.5 = 2.98598e+08. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#126?email_source=notifications&email_token=ADLWOXSZRY323D5RB2PIOPLRCO3ZRA5CNFSM4KTVAPK2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELP7SNA#issuecomment-585103668>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADLWOXTKQGCQ4IWQZGG7QU3RCO3ZRANCNFSM4KTVAPKQ> .

mattiarighi · 2020-02-13T10:47:37Z

Hi @rbrownrigg, thanks for your answer.
Indeed, declaring var as double solves the problem.

However, the problem occurs also for relatively small variables, e.g.:

var = new((/5000, 5000/), float)

which is surprising, also given that the corresponding avg operation works fine.

pilotchute · 2020-05-26T19:07:42Z

Looks like the issue was floating point precision, and declaring var to be double solved the problem. So, I'm going to close this ticket.

If there is a more fundamental problem, please open another ticket.

mattiarighi mentioned this issue Feb 17, 2020

Issue with NCL "sum" function ESMValGroup/ESMValTool#1527

Closed

pilotchute closed this as completed May 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dim_sum and sum returning a different result #126

dim_sum and sum returning a different result #126

mattiarighi commented Feb 12, 2020

mattiarighi commented Feb 12, 2020

rbrownrigg commented Feb 12, 2020 via email

mattiarighi commented Feb 13, 2020

pilotchute commented May 26, 2020

dim_sum and sum returning a different result #126

dim_sum and sum returning a different result #126

Comments

mattiarighi commented Feb 12, 2020

mattiarighi commented Feb 12, 2020

rbrownrigg commented Feb 12, 2020 via email

mattiarighi commented Feb 13, 2020

pilotchute commented May 26, 2020