ENH: DOC: improving error messages for casting errors in ufuncs #14828 #14843

micha2718l · 2019-11-06T20:34:10Z

ENH: DOC: improving error messages for casting errors in ufuncs

This PR adds functionallity to create better error messages in the case that
casting can not be completed in application of ufuncs.
There are probably still some bugs or edge cases, as well as formatting issues;
but the functions seem to work to give messages such as:
UFuncTypeError: Output of ufunc add(uint64, int64, out=uint64) resolved to the add(float64, float64, out=float64) loop, but np.can_cast(np.float64, np.uint64, casting='same_kind') is False

Fixes #14828

catch up to numpy

catch up to master

mattip · 2019-11-07T00:25:52Z

In principle this looks like an improvement. Could you show a direct comparison of old-to-new? Also, is there a short-hand notation we could use to keep the error message length from getting out of hand, perhaps dtype.str would be more compact

eric-wieser · 2019-11-07T00:32:46Z

numpy/core/src/umath/ufunc_type_resolution.c

+            PyTuple_SetItem(froms, j, (PyObject *)PyArray_DESCR(operands[j]));
+            PyTuple_SetItem(tos, j, (PyObject *)dtypes[j]);


Missing Py_INCREF(value) here - PyTuple_SetItem steals references.

Also, you might as well use PyTuple_SETITEM, it's faster and no more dangerous

Thanks for all your helpful comments so far!
I included the Py_INCREF on the dtypes[j] object and it removed a segfault in an edge case I found. I don't think there needs to be one with regards to the PyArray_DESCR(operands[j]) object because PyArray_DESRC returns a borrowed reference, correct me if i misunderstand something there.

Both are borrowed references, and this is exactly why you do need to incref them an extra time - when you call a function like SETITEM that steals a reference, you must make sure the reference is yours to give (ie not borrowed).

numpy/core/src/umath/ufunc_type_resolution.c

eric-wieser · 2019-11-07T00:35:34Z

These lines will likely need to change too:

numpy/numpy/core/_methods.py

Lines 83 to 96 in dcc1fc2

    
           # try to deal with broken casting rules 
        
           try: 
        
               return ufunc(*args, out=out, **kwargs) 
        
           except _exceptions._UFuncOutputCastingError as e: 
        
               # Numpy 1.17.0, 2019-02-24 
        
               warnings.warn( 
        
                   "Converting the output of clip from {!r} to {!r} is deprecated. " 
        
                   "Pass `casting=\"unsafe\"` explicitly to silence this warning, or " 
        
                   "correct the type of the variables.".format(e.from_, e.to), 
        
                   DeprecationWarning, 
        
                   stacklevel=2 
        
               ) 
        
               return ufunc(*args, out=out, casting="unsafe", **kwargs)

eric-wieser · 2019-11-07T00:39:40Z

I think it might be worth renaming the python class members to be something like:

@_display_as_base
class _UFuncCastingError(UFuncTypeError):
    def __init__(self, ufunc, casting, call_dtypes, loop_dtypes):
        super().__init__(ufunc)
        self.casting = casting
        self.call_dtypes = call_dtypes
        self.loop_dtypes = loop_dtypes

@_display_as_base
class _UFuncInputCastingError(_UFuncCastingError):
    # other members as before
    @property
    def from_(self):
        return self.call_dtypes[self.in_i]
    @property
    def to(self):
        return self.loop_dtypes[self.in_i]

@_display_as_base
class _UFuncOutputCastingError(_UFuncCastingError):
    # other members as before
    @property
    def from_(self):
        return self.loop_dtypes[self.ufunc.nin + self.out_i]
    @property
    def to(self):
        return self.call_dtypes[self.ufunc.nin + self.out_i]

This has the nice benefit of not changing the meaning of the to and from properties

micha2718l

@mattip

In principle this looks like an improvement. Could you show a direct comparison of old-to-new? Also, is there a short-hand notation we could use to keep the error message length from getting out of hand, perhaps dtype.str would be more compact

For the original (mentioned in issue #14843) example:

import numpy as np
x = np.arange(5, dtype='u8')
y = np.arange(5, dtype='i8')
x += y

The result for the current codebase is:
TypeError: Cannot cast ufunc add output from dtype('float64') to dtype('uint64') with casting rule 'same_kind'

The result from this change is:
UFuncTypeError: Output of ufunc add(uint64, int64, out=uint64) resolved to the add(float64, float64, out=float64) loop, but np.can_cast(np.float64, np.uint64, casting='same_kind') is False

I agree that that a shorter message is better, but I opted for the np.* form because it allows the code from the message to be directly copy and pasted into a REPL to play with it; if shorter takes precedence I can take the suggestion and make it less verbose.

micha2718l · 2019-11-10T22:35:09Z

numpy/core/src/umath/ufunc_type_resolution.c

+                PyTuple_SET_ITEM(froms, j, Py_None);
+            } else {
+                PyTuple_SET_ITEM(froms, j, (PyObject *)PyArray_DESCR(operands[j]));
+            }


This check seems necessary in some cases where the operands are not all available, is there a chance that the dtypes will contain NULL?

Good question, I don't know the answer

adeak · 2020-04-02T23:29:17Z

As only a regular user of numpy the mixed (with respect to the presence of np) error message seems a bit weird to me. The first two function calls and their arguments are without np, and I think it would look both shorter and better to omit them altogether:

UFuncTypeError: Output of ufunc add(uint64, int64, out=uint64) resolved to the add(float64, float64, out=float64) loop, but can_cast(float64, uint64, casting='same_kind') is False

It's true that you can't directly throw the last call into a REPL if you only have numpy imported as np, but you can alternatively import those three names instead. Or trust the interpreter when it says the result is False :)

micha2718l · 2020-04-03T00:14:35Z

As only a regular user of numpy the mixed (with respect to the presence of np) error message seems a bit weird to me. The first two function calls and their arguments are without np, and I think it would look both shorter and better to omit them altogether:

UFuncTypeError: Output of ufunc add(uint64, int64, out=uint64) resolved to the add(float64, float64, out=float64) loop, but can_cast(float64, uint64, casting='same_kind') is False

It's true that you can't directly throw the last call into a REPL if you only have numpy imported as np, but you can alternatively import those three names instead. Or trust the interpreter when it says the result is False :)

I totally see your point with the consistency. Any other opinions out there? If it were my choice I would want to have np on all parts, though it does create longer messages. I think there are valid points on both sides.

numpy/core/src/umath/ufunc_type_resolution.c

eric-wieser · 2020-04-03T03:36:47Z

numpy/core/_exceptions.py

-        i_str = "{} ".format(self.out_i) if self.ufunc.nout != 1 else ""
+        if self.ufunc.nout > 1:
+            from_outs = ", ".join(["out{}={}".format(i+1, f.name) for i, f in enumerate(self.from_) if i>=self.ufunc.nin])
+            to_outs = ", ".join(["out{}={}".format(i+1, t.name) for i, t in enumerate(self.to) if i>=self.ufunc.nin])


This doesn't produce a legal call, it should be:

Suggested change

to_outs = ", ".join(["out{}={}".format(i+1, t.name) for i, t in enumerate(self.to) if i>=self.ufunc.nin])

to_outs = "out=({})".format(", ".join(t.name for t in self.to[self.ufunc.nin:]))

(same for the other three lines)

numpy/core/src/umath/ufunc_type_resolution.c

eric-wieser · 2020-04-03T03:42:35Z

Any other opinions out there? If it were my choice I would want to have np on all parts

Numpy is not the only module capable of defining ufuncs, so prepending np. before add would most likely be incorrect for errors from, say, scipy ufuncs

eric-wieser · 2020-04-21T11:25:14Z

@micha2718l, do you think you'll have time to come back to this?

mattip · 2020-10-16T07:37:44Z

@micha2718l it would be nice to finish this up. We are nearing the cutoff for the next release.

merge origin nump/numpy

micha2718l · 2020-10-27T20:34:43Z

@mattip @eric-wieser
I think I took care of the outstanding issues with this PR. Let me know if there is any other details to think about.

adeak · 2020-10-28T00:42:19Z

I only have the same trivial remark as earlier: the mixture of np.can_cast and np./non-np. types in that error message look a bit weird to me :)

But this might be bikeshedding territory, and I don't have the perspective to tell whether my preference is actually better (but unless I missed it I don't think we got a clear core dev statement on the matter, beyond that np.add would definitely be off). I'm sorry if I just missed it.

micha2718l · 2020-10-28T02:37:10Z

I only have the same trivial remark as earlier: the mixture of np.can_cast and np./non-np. types in that error message look a bit weird to me :)

But this is might be bikeshedding territory, and I don't have the perspective to tell whether my preference is actually better (but unless I missed it I don't think we got a clear core dev statement on the matter, beyond that np.add would definitely be off). I'm sorry if I just missed it.

I missed that comment when I returned to this (and it had been a while so I had forgot about your comment). I still think there are valid points on either side, if any core have an opinion it is welcome. I'm open to any of the three potential options here, all np, no np, or the hybrid as is; which balances brevity and usefulness IMHO

eric-wieser · 2020-10-28T08:43:19Z

Thanks for coming back to this! I think this comment still applies: #14843 (comment)

numpy/core/_exceptions.py

…sage

eric-wieser · 2020-10-28T14:08:53Z

What are your thoughts on this comment: #14843 (comment)?

eric-wieser · 2020-10-28T14:13:11Z

numpy/core/_exceptions.py

+        else:
+            from_outs = "out={}".format(self.from_[-1])
+            to_outs = "out={}".format(self.to[-1])
+
        return (


Might be worth a helper function somewhere,

def _ufunc_call_sig(ufunc, arg_dtypes): assert len(arg_dtypes) == ufunc.nin + ufunc.nout if ufunc.nout > 1: outs = "out=({})".format(", ".join(f.name for f in arg_dtypes[ufunc.nin:])) else: outs = "out={}".format(arg_dtypes[-1]) return f"{}({}, {})".format(ufunc.__name__, ", ".join(arg_dtypes[:ufunc.nin]), outs)

which you can then call as _ufunc_call_sig(self.ufunc, self.from) and _ufunc_call_sig(self.ufunc, self.to) in the error messages

I do like this concept, I'm putting it together now to see how that works out. Helper function likely good, but either way something to have the meanings of from/to not switching around is a good thing.

I'm putting it together now to see how that works out.

Were you able to finish this up?

@eric-wieser @mattip Sorry about the 4 year delay, I had some time and was reminded of this. I do believe the current state should reflect the merged changes from main as well as the helper function being in place. I know there is a lot going on currently with the 2.0 release, but when you do get a minute please let me know if anything looks like it is still missing or should change with this PR.

micha2718l added 7 commits November 5, 2019 16:07

Merge pull request #1 from numpy/master

ec1d648

catch up to numpy

pushing all args out of casting errors

0905541

format more explicit error messages for output

9dbbe6b

add input casting message, clean up

0cae4f0

from-to TO operands-dtypes

e3852d7

clean out code

938b913

Merge pull request #3 from numpy/master

38a9197

catch up to master

mattip added the 01 - Enhancement label Nov 6, 2019

eric-wieser reviewed Nov 7, 2019

View reviewed changes

numpy/core/src/umath/ufunc_type_resolution.c Outdated Show resolved Hide resolved

micha2718l added 6 commits November 10, 2019 16:14

add check for NULL in froms/tos and add Py_INCREF due to PyTupleSET_ITEM

3e4b2f0

add check for missing operands

028686d

adding missing Py_INCREF

114c705

adjusting test to accommodate new cast error

ce25875

change return from NULL to -1

d71f6bb

fixing deprecation warning, show relevant dtypes

56b39ef

micha2718l commented Nov 13, 2019

View reviewed changes

eric-wieser reviewed Apr 3, 2020

View reviewed changes

numpy/core/src/umath/ufunc_type_resolution.c Outdated Show resolved Hide resolved

eric-wieser reviewed Apr 3, 2020

View reviewed changes

numpy/core/src/umath/ufunc_type_resolution.c Outdated Show resolved Hide resolved

eric-wieser reviewed Apr 3, 2020

View reviewed changes

numpy/core/src/umath/ufunc_type_resolution.c Outdated Show resolved Hide resolved

eric-wieser reviewed Apr 3, 2020

View reviewed changes

numpy/core/src/umath/ufunc_type_resolution.c Outdated Show resolved Hide resolved

eric-wieser reviewed Apr 3, 2020

View reviewed changes

numpy/core/src/umath/ufunc_type_resolution.c Outdated Show resolved Hide resolved

mattip removed the 04 - Documentation label Sep 2, 2020

micha2718l added 5 commits October 18, 2020 14:02

Merge pull request #5 from numpy/master

bae2dbc

merge origin nump/numpy

[DOC] issue-14828 updating error message in guide

0327900

[ENH][DOC] cleaning up duplicated code

b14b380

[ENH][DOC] fixing ref counting in Py_BuildValue

58ba2a3

[ENH][DOC] clean up code around 'outs' portion of error message

c5a0106

eric-wieser reviewed Oct 28, 2020

View reviewed changes

numpy/core/_exceptions.py Outdated Show resolved Hide resolved

eric-wieser reviewed Oct 28, 2020

View reviewed changes

numpy/core/_exceptions.py Outdated Show resolved Hide resolved

micha2718l added 2 commits October 28, 2020 09:18

[ENH][DOC] fixing last from/to _outs strings and removing np from mes…

d55b23f

…sage

[ENH][DOC] updating quickstart to not include np in output error

be2d94f

eric-wieser reviewed Oct 28, 2020

View reviewed changes

kratsg mentioned this pull request Nov 14, 2020

json2xml breaks for a simple JSON scikit-hep/pyhf#1177

Closed

Base automatically changed from master to main March 4, 2021 02:04

InessaPawson added 52 - Inactive Pending author response and removed 61 - Stale labels Jun 16, 2022

micha2718l added 9 commits February 10, 2024 20:24

Merge remote-tracking branch 'upstream/main' into issue-14828

0233ced

fix raise_output_casting_error inputs

36c7c98

Merge branch 'main' into issue-14828

404cd3f

style

4e22d7a

style

69184e6

quickstart docs tests

75f9050

Merge branch 'main' into issue-14828

e0bd530

helper function for _ufunc_call_sig

c60d31e

Merge branch 'main' into issue-14828

034bc5d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: DOC: improving error messages for casting errors in ufuncs #14828 #14843

ENH: DOC: improving error messages for casting errors in ufuncs #14828 #14843

micha2718l commented Nov 6, 2019 •

edited by eric-wieser

Loading

mattip commented Nov 7, 2019

eric-wieser Nov 7, 2019

micha2718l Nov 10, 2019

eric-wieser Nov 10, 2019

eric-wieser commented Nov 7, 2019 •

edited

Loading

eric-wieser commented Nov 7, 2019 •

edited

Loading

micha2718l left a comment •

edited

Loading

micha2718l Nov 10, 2019

eric-wieser Apr 21, 2020

adeak commented Apr 2, 2020 •

edited

Loading

micha2718l commented Apr 3, 2020

eric-wieser Apr 3, 2020

eric-wieser Apr 21, 2020

eric-wieser commented Apr 3, 2020

eric-wieser commented Apr 21, 2020

mattip commented Oct 16, 2020

micha2718l commented Oct 27, 2020

adeak commented Oct 28, 2020 •

edited

Loading

micha2718l commented Oct 28, 2020

eric-wieser commented Oct 28, 2020

eric-wieser commented Oct 28, 2020

eric-wieser Oct 28, 2020 •

edited

Loading

micha2718l Oct 28, 2020

eric-wieser Nov 10, 2020

micha2718l Mar 6, 2024 •

edited

Loading

		PyTuple_SetItem(froms, j, (PyObject *)PyArray_DESCR(operands[j]));
		PyTuple_SetItem(tos, j, (PyObject *)dtypes[j]);

	to_outs = ", ".join(["out{}={}".format(i+1, t.name) for i, t in enumerate(self.to) if i>=self.ufunc.nin])
	to_outs = "out=({})".format(", ".join(t.name for t in self.to[self.ufunc.nin:]))

ENH: DOC: improving error messages for casting errors in ufuncs #14828 #14843

Are you sure you want to change the base?

ENH: DOC: improving error messages for casting errors in ufuncs #14828 #14843

Conversation

micha2718l commented Nov 6, 2019 • edited by eric-wieser Loading

mattip commented Nov 7, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

eric-wieser commented Nov 7, 2019 • edited Loading

eric-wieser commented Nov 7, 2019 • edited Loading

micha2718l left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adeak commented Apr 2, 2020 • edited Loading

micha2718l commented Apr 3, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

eric-wieser commented Apr 3, 2020

eric-wieser commented Apr 21, 2020

mattip commented Oct 16, 2020

micha2718l commented Oct 27, 2020

adeak commented Oct 28, 2020 • edited Loading

micha2718l commented Oct 28, 2020

eric-wieser commented Oct 28, 2020

eric-wieser commented Oct 28, 2020

eric-wieser Oct 28, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

micha2718l Mar 6, 2024 • edited Loading

Choose a reason for hiding this comment

micha2718l commented Nov 6, 2019 •

edited by eric-wieser

Loading

eric-wieser commented Nov 7, 2019 •

edited

Loading

eric-wieser commented Nov 7, 2019 •

edited

Loading

micha2718l left a comment •

edited

Loading

adeak commented Apr 2, 2020 •

edited

Loading

adeak commented Oct 28, 2020 •

edited

Loading

eric-wieser Oct 28, 2020 •

edited

Loading

micha2718l Mar 6, 2024 •

edited

Loading