dtype "f64" silently results in "float32" #5790

cdeil · 2015-04-23T15:08:19Z

I've recently been bitten by this:

>>> import numpy as np
>>> np.dtype('f64')
dtype('float32')
>>> np.__version__
'1.9.2'

Some proposals what to change:

Warning for 'f64' input?
Error for 'f64' input?
Make 'f64' result in 'float64'

I know change is difficult for numpy because of backwards-compatibility concerns.
Hopefully it's possible to be more strict here ... in my code I was getting slightly incorrect results for half a year because I was using 32-bit floats where I know I needed to use 64-bit floats, but for some reason wrote "f64" instead of "float64".

The text was updated successfully, but these errors were encountered:

rgommers · 2015-04-23T15:46:36Z

I would consider this a bug, so no need to worry about backwards compat. First thought: an error makes sense here.

jaimefrio · 2015-04-23T15:52:06Z

The correct code for float64 is 'f8', you have to give it the number of bytes, not bits.

But the behavior does seem to be wrong, it seems to be checking the letter, and if the number doesn't make sense, it just returns the default type:

>>> np.dtype('f')
dtype('float32')
>>> np.dtype('f8')
dtype('float64')
>>> np.dtype('f123')
dtype('float32')
>>> np.dtype('i')
dtype('int32')
>>> np.dtype('i8')
dtype('int64')
>>> np.dtype('i123')
dtype('int32')

I agree with Ralf that raising an error makes all the sense in the world here.

cdeil · 2015-04-23T15:54:47Z

+1 to raising an error.

If it's a simple change to make I could attach a commit here.
Is it? In which file / function?

jaimefrio · 2015-04-23T16:21:30Z

The relevant code is here. It seems there has been a deprecation warning in place since 1.7:

>>> import numpy as np
>>> import warnings
>>> warnings.simplefilter('always')
>>> np = np.dtype('f64')
__main__:1: DeprecationWarning: Specified size is invalid for this data type.
Size will be ignored in NumPy 1.7 but may throw an exception in future versions.

Has the time come to turn this into an error in 1.10? I guess discussing on the list would be in order. Can you send a message to the mailing list, Christoph?

charris · 2015-04-23T16:46:22Z

Raising an error looks like the right thing. Because it has been deprecated, I don't think list discussion is needed.

cdeil · 2015-04-23T16:52:16Z

Looking at

numpy/numpy/core/src/multiarray/conversion_utils.c

Line 1023 in 2e3778a

PyArray_TypestrConvert(int itemsize, int gentype)

it's not clear at all to me what needs to be done.

Could someone else here please take care of this change (or discussion if needed)?

Thanks!

jaimefrio · 2015-04-23T16:53:36Z

I'll look into it today or tomorrow.

cdeil · 2015-05-04T15:28:36Z

@jaimefrio - ping

jaimefrio · 2015-05-05T13:27:10Z

There's a tricky thing here, which has me swamped with test errors in trying to fix this... If you look at the code I linked above, one of the behaviors that is explicitly deprecated is using codes 'O4' or 'O8' for object arrays. Getting rid of the deprecation warning, including some special casing to handle pickling that should still accept, is not much of a problem. The issue is that, with current master:

In [5]: np.dtype('O').str
Out[5]: '|O8'

That is, we have deprecated, and want to turn into an error, a dtype string descriptor... that is the one dtype.str is producing!

I see 3 ways to move forward on this:

Leave the object dtypes as is, and turn into errors all other dtypes only.
Turn into errors all dtypes except object, change the string representation of object dtypes to remove the trailing size, but leave the object dtypes interpretation as is, still accepting the trailing size descriptor.
Turn all dtypes to errors and remove the trailing size from object dtype descriptors.

Doing 1 is just punting on the object behavior, not my favorite. Doing 3 may be a little too aggressive, although it is the state we wish to one day arrive at, and what I am leaning towards right now. Doing 2 now, and turning trailing sizes on object dtypes into errors in the next release may be the aristotelian golden mean.

Thoughts are welcome!

mhvk · 2015-05-05T13:48:47Z

FWIW: I'd split the solution into two, one implementing (1) and thus solving the present issue, the other going after the object dtypes (where I am not completely sure what the best solution would be; probably your (2) first, then (3), just to be sure, though maybe directly going to (3) is fine).

njsmith · 2015-05-05T22:44:14Z

Sounds to me like you've answered your question :-).

No reason to delay the deprecation->changes for non-O dtypes. For O we
definitely should stop emitting those integers regardless, since they are
truly meaningless. Given the situation, it would make sense to be a but
lenient and keep accepting O4 and O8 with warnings for now, so they'll take
another release cycle to catch up with the other dtypes. (As a small
refinement we could start erroring on all O sizes that aren't 4 or 8 now.)
But the main thing is that in a few releases we eventually stabilize on the
right thing, I.e. all meaningless size specifiers are errors.
On May 5, 2015 6:27 AM, "Jaime" notifications@github.com wrote:

There's a tricky thing here, which has me swamped with test errors in
trying to fix this... If you look at the code I linked above, one of the
behaviors that is explicitly deprecated is using codes 'O4' or 'O8' for
object arrays. Getting rid of the deprecation warning, including some
special casing to handle pickling that should still accept, is not much of
a problem. The issue is that, with current master:

In [5]: np.dtype('O').str
Out[5]: '|O8'

That is, we have deprecated, and want to turn into an error, a dtype
string descriptor... that is the one dtype.str is producing!

I see 3 ways to move forward on this:

Leave the object dtypes as is, and turn into errors all other
dtypes only.

Turn into errors all dtypes except object, change the string
representation of object dtypes to remove the trailing size, but leave the
object dtypes interpretation as is, still accepting the trailing size
descriptor.

Turn all dtypes to errors and remove the trailing size from object
dtype descriptors.

Doing 1 is just punting on the object behavior, not my favorite. Doing 3
may be a little too aggressive, although it is the state we wish to one day
arrive at, and what I am leaning towards right now. Doing 2 now, and
turning trailing sizes on object dtypes into errors in the next release may
be the aristotelian golden mean.

Thoughts are welcome!

—
Reply to this email directly or view it on GitHub
#5790 (comment).

Fixes numpy#5790

rgommers added 00 - Bug component: numpy._core labels Apr 23, 2015

jaimefrio added a commit to jaimefrio/numpy that referenced this issue May 6, 2015

MANT: Turn deprecated dtype string warnings into errors

45bbce7

Fixes numpy#5790

jaimefrio mentioned this issue May 6, 2015

MANT: Turn deprecated dtype string warnings into errors #5840

Merged

charris closed this as completed in #5840 May 6, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dtype "f64" silently results in "float32" #5790

dtype "f64" silently results in "float32" #5790

cdeil commented Apr 23, 2015

rgommers commented Apr 23, 2015

jaimefrio commented Apr 23, 2015

cdeil commented Apr 23, 2015

jaimefrio commented Apr 23, 2015

charris commented Apr 23, 2015

cdeil commented Apr 23, 2015

jaimefrio commented Apr 23, 2015

cdeil commented May 4, 2015

jaimefrio commented May 5, 2015

mhvk commented May 5, 2015

njsmith commented May 5, 2015

dtype "f64" silently results in "float32" #5790

dtype "f64" silently results in "float32" #5790

Comments

cdeil commented Apr 23, 2015

rgommers commented Apr 23, 2015

jaimefrio commented Apr 23, 2015

cdeil commented Apr 23, 2015

jaimefrio commented Apr 23, 2015

charris commented Apr 23, 2015

cdeil commented Apr 23, 2015

jaimefrio commented Apr 23, 2015

cdeil commented May 4, 2015

jaimefrio commented May 5, 2015

mhvk commented May 5, 2015

njsmith commented May 5, 2015