numpy.dot has undocumented dtype behavior, resulting in different values than (a * b).sum() #12344

itamarst · 2018-11-06T18:11:56Z

I would expect (a * b).sum() and np.dot(a, b) to be the same for 1D arrays. However, the former upscales(?) the dtype, and the latter doesn't, so they give different results. This may be expected behavior, but if so the np.dot docs should mention the dtype behavior.

Reproducing code example:

In [36]: a = np.array([254, 2, 200], dtype=np.uint8)

In [37]: b = np.array([True, False, True], dtype=np.bool)

In [38]: (a * b).sum()
Out[38]: 454

In [39]: np.dot(a, b)
Out[39]: 198

In [40]: np.dot(a, b).dtype
Out[40]: dtype('uint8')

Numpy/Python version information:

Numpy 1.15.0, Python 3.6.5

The text was updated successfully, but these errors were encountered:

sturlamolden · 2018-11-28T01:25:11Z

Integer overflow: 198 + 256 = 454.

Should np.dot check for overflow before outputting an integer?

mattip · 2018-12-23T09:19:44Z

The current design is to choose the output dtype with no consideration overflow. Overflow issues seem to frequently trip up users, perhaps an overall vision of how to deal with them is needed

itamarst · 2018-12-23T15:51:39Z

As mentioned above, a minimal approach would be to document the expected behavior better.

eric-wieser · 2018-12-23T21:28:48Z

If anything, I'd consider the behavior of sum to be more surprising here, and in need of more documentation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

numpy.dot has undocumented dtype behavior, resulting in different values than (a * b).sum() #12344

numpy.dot has undocumented dtype behavior, resulting in different values than (a * b).sum() #12344

itamarst commented Nov 6, 2018 •

edited

sturlamolden commented Nov 28, 2018

mattip commented Dec 23, 2018

itamarst commented Dec 23, 2018

eric-wieser commented Dec 23, 2018

numpy.dot has undocumented dtype behavior, resulting in different values than (a * b).sum() #12344

numpy.dot has undocumented dtype behavior, resulting in different values than (a * b).sum() #12344

Comments

itamarst commented Nov 6, 2018 • edited

Reproducing code example:

Numpy/Python version information:

sturlamolden commented Nov 28, 2018

mattip commented Dec 23, 2018

itamarst commented Dec 23, 2018

eric-wieser commented Dec 23, 2018

itamarst commented Nov 6, 2018 •

edited