WIP: Deprecate undefined == comparisons #18856

nalimilan · 2016-10-09T20:44:39Z

This PR is a first stab at deprecating cross-type comparisons for which we previously fell back on === (#15983).

The first commit uses === where it makes sense and can be merged separately (#18853).

The second commit is more controversial, and could also be discussed in a separate PR: it changes all in and find* methods to use isequal consistently (until know only Dict did this for in). Relevant discussions are #9381 (about in), and #16269 and #18668 (about find). This change allows making == an error for arbitrary comparisons without breaking too much code. Indeed, it is relatively common to store heterogeneous non-comparable types in arrays or non-standard dicts (which use == contrary to Dict): in particular, strings and chars are stored together in the LineEdit/REPL code, but that's also the case of other high-level objects like Base.Multimedia.display. Note that this change is not strictly needed to make == throw an error for not-comparable types: the code could be made more careful instead. But I started with this minimally invasive strategy (in terms of lines of code to change) to be able to go to the next step without wasting time.

The third commit changes more comparisons to be robust against the change in behavior of ==. It doesn't make much sense in isolation, but it doesn't break the tests either.

The fourth commit is the most interesting and challenging. Indeed, isequal falls back to ==, yet we don't want the former to print deprecation warnings nor throw errors (after the next release). So we need a way for isequal to change the behavior of == for its needs. So far I've found two possible approaches:

Use a try... catch block to catch the NotComparableError thrown by ==. This is unacceptable for performance unless the compiler is able to determine statically that the exception is thrown reliably by a method. Also, it doesn't work to silence deprecation warnings (for the first phase).
Use a mechanism similar to @inbounds/@boundscheck to tell == that no exception should be thrown. isequal would set this flag when calling ==. This is the one I've retained here, temporarily abusing @inbounds; if we choose it, a separate macro would be warranted (e.g. @comparable).
Actually, this second approach is quite powerful, and it would offer a alternative behavior: instead of having == throw and isequal return false, both methods could throw by default, unless they are marked with @comparable. in and find* would use it by default to allow comparing arbitrary types. The advantage of this solution is that we wouldn't have to conflate two meanings into isequal: IEEE floating point semantics and arbitrary comparisons don't necessarily go together. The downside is that it could be more complex for users.

Anyway, this is obviously quite rough. Comments and advice welcome.

vtjnash · 2016-10-09T21:31:23Z

base/Enums.jl

@@ -9,6 +9,9 @@ abstract Enum

 Base.convert{T<:Integer}(::Type{T}, x::Enum) = convert(T, box(Int32, x))

+Base.:(==)(x::Enum, y::Integer) = Int32(x) == y
+Base.:(==)(x::Integer, y::Enum) = x == Int32(y)


I don't think these shouldnt exist. Enum and Integer aren't compatible types. Why was this needed?

I think I added this because of comparisons done with integer codes returned by C, e.g. this one. But we can tell people to call Cint before ==, like in most other places (e.g. here). Will fix.

Done. These lines were broken from the start, so that's the first bug uncovered by the new strict rules.

vtjnash · 2016-10-09T21:34:34Z

base/array.jl

@@ -992,7 +993,7 @@ julia> findnext(A,3)
 """
 function findnext(A, start::Integer)
    for i = start:length(A)
-        if A[i] != 0
+        if !isequal(A[i], 0)


There was a recent proposal to make this 'iszero(x)' or zero(T), since == 0 doesn't really work right for some custom types

Yes, that would fit well with this PR. Then we would need have the fallback iszero calll isequal (rather than ==). And since isequal(0, -0.0) == false, we should define iszero on floats so that iszero(-0.0) == true

TotalVerb · 2016-10-10T07:26:22Z

So the new semantics will be: == numeric equality, isequal general equality, and === egality?

nalimilan · 2016-10-10T07:32:23Z

So the new semantics will be: == numeric equality, isequal general equality, and === egality?

== will not be only for numeric values. I'd say it should be used to compare "comparable" values, i.e. it offers the additional safety that it will error if you try to compare inadvertently values from two types which can never be equal.

TotalVerb · 2016-10-10T07:48:14Z

That makes sense. Could we preserve a subset of useful comparisons between noncomparable types?

Expr and anything... useful for metaprogramming. === won't do here.
Void and anything. === works but not in all cases e.g. Union{String,Void}

I would prefer not to use isequal in these cases.

Without conversions, these comparisons always returned false.

nalimilan · 2016-10-10T08:22:51Z

Yeah, these exceptions should definitely be considered. Void is very special already in that it's not really useful to compare nothing only with values of the same type. Expressions are also frequently used in contexts where other types can appear, but I can't tell what's the best behaviour for them.

Use isequal() or type checks in all places where the types might not be comparable.

JeffBezanson · 2016-10-10T21:50:24Z

I really want to avoid having a long complicated list saying when to use isequal vs. ==. With only three functions (isequal, ==, ===) trying to implement many distinctions, you get possibly-unwanted behaviors piggybacking on others. For example, if I have x == 0.0 and I want it to be type-permissive, I'll switch to isequal(x, 0.0), but then the behavior on -0.0 also changes.

+1 to the first commit though.

TotalVerb · 2016-10-10T22:09:29Z

Maybe we should add some more equality predicates. We have a countable list of operators ====, =====, ======, etc. available. Looking forward to writing x ============== y in the near future...

Who could've thought that equality was such a hard problem?

StefanKarpinski · 2016-10-11T05:16:39Z

Who could've thought that equality was such a hard problem?

Anyone who paid attention to decades of Lisp debates on the subject :)

StefanKarpinski · 2016-10-11T05:32:27Z

@JeffBezanson: I think there's something very telling about the fact that the rest of this change forces the part that you like – and that without the errors that a stricter == imposes, it seems hard to have the discipline to use === and == as appropriate. I still like that this change gives the three equalities three very clear roles:

===: can compare any kinds of values, captures programmatic distinguishability (egal)
isequal: can compare any kinds of values, captures equivalence classes of values
==: can only make comparisons that "make sense", captures intuitive equality

We've tried to "sweep isequal under the carpet", but it doesn't really work that well – you end up explaining it anyway and the fact that == and isequal are almost the same makes the explanation really confusing and makes it really hard to know which one to use. This distinction makes it easier. Are you comparing two completely arbitrary values? Then use === or isequal. Are you comparing two things that should "make sense to compare" or give an error? Then use ==.

TotalVerb · 2016-10-11T07:41:53Z

Jeff's point about negative zeroes still worries me. Why is it that type-permissive equality comes with additional stuff, like NaN and zero behaviour? It seems that two orthogonal concepts are being combined.

nalimilan · 2016-10-11T08:38:37Z

Jeff's point about negative zeroes still worries me. Why is it that type-permissive equality comes with additional stuff, like NaN and zero behaviour? It seems that two orthogonal concepts are being combined.

They are theoretically orthogonal, but in practice there's one strong reason to have them linked: the fact that one wants to be able to use NaN as a dictionary key. I would be inclined to make isequal(0.0, -0.0) since that's the source of all pains around in (cf. #9381). Then the meaning of isequal would be much clearer.

JeffBezanson · 2016-10-11T14:42:27Z

I think there's something very telling about the fact that the rest of this change forces the part that you like

Yes, but it also forces changes that I don't like, namely randomly changing several comparisons from == to isequal. The problem is that it's not so easy to know when you're comparing two arbitrary values in generic code. In general find and in might be looking for an arbitrary value in an arbitrary container, but somebody who writes 0.0 in A likely expects floating-point equality. There's an entire (inconclusive) issue on that topic, so it would be a bit odd to casually flip that switch now just to work around some newly-added error cases.

I agree that if == and isequal behaved the same on -0.0 and NaN then this change would be easier to accept, since type-permissiveness would be the only distinction to think about. I don't find it all that clear to say that == is for comparisons that "make sense", since then the debate is about which comparisons make sense. The list of exceptions discussed above. e.g. Union{T,Void}, or Expr == Symbol is exhibit A.

JeffBezanson · 2016-10-11T15:13:30Z

base/operators.jl

+isequal(::Void, ::Void) = true
+isequal(::Void, ::Any) = false
+isequal(::Any, ::Void) = false
+isequal(x::Union{Method, TypeName}, y::Union{Method, TypeName}) = x === y


Where did this come up? Would it be possible to address by changing some more calls to use ===?

Unfortunately, it's hard to track since the failure happens during bootstrap, and the backtrace doesn't mention the function name (even with julia-debug). Is there anything I can do to improve that?

StefanKarpinski · 2016-10-11T19:18:04Z

The problem is that it's not so easy to know when you're comparing two arbitrary values in generic code. In general find and in might be looking for an arbitrary value in an arbitrary container, but somebody who writes 0.0 in A likely expects floating-point equality.

I opened #9381 specifically because it's probably better to use isequal when computing x in A rather than == (and you seemed to agree, at least initially). I think that NaN in [NaN] being false is worse than 0.0 in [-0.0] being false. Of course, the latter could be fixed if we make isequal(-0.0, 0.0) true; another argument to consider in #9381.

StefanKarpinski · 2016-10-11T19:25:41Z

I don't find it all that clear to say that == is for comparisons that "make sense", since then the debate is about which comparisons make sense. The list of exceptions discussed above. e.g. Union{T,Void}, or Expr == Symbol is exhibit A.

There is some room for debate here, but it's clear that some comparisons don't make any sense like "foo" == 1.2. I'm not sure what the exact criterion to use should be but for dict keys we used isequal(x, convert(T, x)) to check if x was a sane value of type T for a while. This one is harder to define because of the desire for symmetry.

JeffBezanson · 2016-10-11T19:49:08Z

I thought of a more formal way to express what I'm thinking. This change breaks a type embedding property: when you move from domain A to a super-domain B (e.g. moving from real to complex), elements in A should behave the same. So moving from a comparison that works on Float64 to one that works on Union{Float64, X} should not compare Float64s differently.

StefanKarpinski · 2016-10-11T19:55:00Z

Are you talking about collections? If we make "foo" == 1.2 an error, that doesn't necessarily affect ["foo"] == [1.2] (although perhaps that's what this PR does – I didn't notice that).

JeffBezanson · 2016-10-11T20:10:53Z

I'm mostly talking about moving to wider types. For example I start with x::Float64 == y::Float64, and then change something such that x and y now might be strings, or nothings: x::Union{Float64,String} == y::Union{Float64,String}. Under this change that no longer works, but nor can I use isequal since it changes how Float64s compare, so I have to write my own equality function at that point.

Having a function that both expands the domain within which equality works, and also changes how some previously-comparable elements compare is a gotcha.

Of course, one solution is to use isequal as much as possible, e.g. for in and other places. Hey, I'd be fine with that --- even better, we could just define const (==) = isequal :) But the point is, further use of isequal further undermines the argument that == should be strict. I would expect somebody to be puzzled that 1 == "x" is an error, but 1 in ["x"] is fine.

davidanthoff · 2016-10-13T23:52:54Z

I'm not sure this is relevant here, so please feel free to ignore if this is off base. But for Query.jl it would be really key that x::T == x::Nullable{T} works.

TotalVerb · 2016-10-14T00:10:12Z

@davidanthoff But as far as I can tell, that doesn't do what is expected.

julia> 1 == Nullable(1)
false

davidanthoff · 2016-10-14T00:18:14Z

@TotalVerb Yes, my hope is that the definition will be changed in base so that 1 == Nullable(1) would return true.

Right now I overwrite that definition in the Query package, but that of course is terrible (type piracy and all of that).

nalimilan · 2016-10-14T12:34:28Z

The current fallback on === is indeed a bit surprising, and that's one of the cases where it could be better do raise an error rather than returning false. Indeed, for nullable-nullable comparisons, we currently have a == method which throws just to avoid the fallback on === (which can give misleading results for null values, cf. #16923).

But let's not add Nullable in this discussion, it's mostly unrelated. We just need to choose what == method we want to define. Experience shows that this question is controversial, so don't leave it derail this PR.

I'm mostly talking about moving to wider types. For example I start with x::Float64 == y::Float64, and then change something such that x and y now might be strings, or nothings: x::Union{Float64,String} == y::Union{Float64,String}. Under this change that no longer works, but nor can I use isequal since it changes how Float64s compare, so I have to write my own equality function at that point.

Having a function that both expands the domain within which equality works, and also changes how some previously-comparable elements compare is a gotcha.

@JeffBezanson I guess one can see it that way, but it can also indicate that we need == methods to allow comparing nothing with any other type. Indeed, your example wouldn't make a lot of sense if you replaced Void with String or Char. If you did that, then the fact that == throws an error can be seen as a feature: if code was written expecting numbers, writing x == '1' or x == "1" can only give useless (at best) or confusing (at worst) results. That's just the same as what would happen if you passed a types for which + isn't defined.

Of course, one solution is to use isequal as much as possible, e.g. for in and other places. Hey, I'd be fine with that --- even better, we could just define const (==) = isequal :) But the point is, further use of isequal further undermines the argument that == should be strict. I would expect somebody to be puzzled that 1 == "x" is an error, but 1 in ["x"] is fine.

I wonder whether the core of the issue is that we don't have an easy to type operator for isequal. Maybe we need another operator? (=== would have been a perfect fit. :-)

FWIW, Go panics when comparing "not comparable" values. Of course it's a static language so it's a bit different.

StefanKarpinski · 2016-10-14T16:43:25Z

I have to say that I don't find the x::Union{Float64,String} == y::Union{Float64,String} particularly convincing. That seems like bizarre code that should probably be refactored. I'd be interested in an example of this happening in the wild to see if it is actually as unnatural as it seems.

nalimilan · 2016-10-14T16:58:00Z

@StefanKarpinski Concrete cases of Union{T, Void} in Base are actually with String, Symbol, or Function. If you want to have a look, several changes in the first and fourth commits above are there to deal with that.

EDIT: perhaps more to the point, cases of Union{S, T} in Base other than Void were the mixing of Char and String in dictionaries in LineEdit/REPL code, which isn't a good idea AFAICT because the gain due to using a simpler type (Char) is most likely cancelled by the type instability. There also are comparisons between Symbol,Expr, QuoteNode and GlobalRef, but these are arguably special types.

JeffBezanson · 2016-10-14T17:05:17Z

Sure, you would not use Union{Float64,String}, but currently you don't even need to think about it, whereas here an ad-hoc list of special cases (e.g. Void) might be introduced. The real issues are

Coupling type-permissive comparison to -0.0 and NaN comparison is a gotcha. I can easily imagine switching between == and isequal for one of those reasons, while forgetting that you're also pulling in the other one.
I'm not convinced that it can simultaneously be important for 1 == "1" to give an error, yet allow 1 in ["1"].

I agree it would be great to have some sort of infix for isequal, but that doesn't fully address the in issue, since what the obvious operators == and in do out of the box is the most important thing.

JeffBezanson · 2016-10-14T17:10:30Z

Heh, I just noticed that x in (y,) is shorter than isequal(x,y), making it a passable infix for isequal. We could even add a specialization for 1-tuples so it has no cost over calling isequal directly. Of course x in [y] is even shorter but harder to optimize.

nalimilan · 2017-05-20T16:55:36Z

Doesn't look like this PR has a chance of being merged.

Use === instead of == where appropriate

4190701

nalimilan force-pushed the nl/notcomparable branch from 7abc029 to 930d683 Compare October 9, 2016 21:04

vtjnash reviewed Oct 9, 2016

View reviewed changes

nalimilan force-pushed the nl/notcomparable branch from 930d683 to 3a740f8 Compare October 10, 2016 07:27

nalimilan added 2 commits October 10, 2016 10:16

Do not compare enums with integers directly

73faf81

Without conversions, these comparisons always returned false.

Use isequal() instead of == for in() and find*()

2c69587

nalimilan force-pushed the nl/notcomparable branch from 3a740f8 to e062942 Compare October 10, 2016 08:17

nalimilan added 3 commits October 10, 2016 15:09

Make all comparisons safe for == behavior change

82fbd52

Use isequal() or type checks in all places where the types might not be comparable.

Deprecate ==(::Any, ::Any) fallback

fb0f333

Hack: don't force bounds checking when running tests

424d391

nalimilan force-pushed the nl/notcomparable branch from 9ec527e to 424d391 Compare October 10, 2016 13:12

JeffBezanson reviewed Oct 11, 2016

View reviewed changes

kshyatt added the deprecation This change introduces or involves a deprecation label Jan 8, 2017

nalimilan closed this May 20, 2017

nalimilan deleted the nl/notcomparable branch May 20, 2017 16:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Deprecate undefined == comparisons #18856

WIP: Deprecate undefined == comparisons #18856

nalimilan commented Oct 9, 2016

vtjnash Oct 9, 2016

nalimilan Oct 10, 2016

nalimilan Oct 10, 2016

vtjnash Oct 9, 2016

nalimilan Oct 10, 2016

TotalVerb commented Oct 10, 2016

nalimilan commented Oct 10, 2016

TotalVerb commented Oct 10, 2016

nalimilan commented Oct 10, 2016

JeffBezanson commented Oct 10, 2016

TotalVerb commented Oct 10, 2016 •

edited

StefanKarpinski commented Oct 11, 2016

StefanKarpinski commented Oct 11, 2016 •

edited

TotalVerb commented Oct 11, 2016

nalimilan commented Oct 11, 2016

JeffBezanson commented Oct 11, 2016

JeffBezanson Oct 11, 2016

nalimilan Oct 11, 2016

StefanKarpinski commented Oct 11, 2016

StefanKarpinski commented Oct 11, 2016 •

edited

JeffBezanson commented Oct 11, 2016

StefanKarpinski commented Oct 11, 2016

JeffBezanson commented Oct 11, 2016

davidanthoff commented Oct 13, 2016

TotalVerb commented Oct 14, 2016

davidanthoff commented Oct 14, 2016

nalimilan commented Oct 14, 2016

StefanKarpinski commented Oct 14, 2016 •

edited

nalimilan commented Oct 14, 2016 •

edited

JeffBezanson commented Oct 14, 2016

JeffBezanson commented Oct 14, 2016

nalimilan commented May 20, 2017

WIP: Deprecate undefined == comparisons #18856

WIP: Deprecate undefined == comparisons #18856

Conversation

nalimilan commented Oct 9, 2016

vtjnash Oct 9, 2016

Choose a reason for hiding this comment

nalimilan Oct 10, 2016

Choose a reason for hiding this comment

nalimilan Oct 10, 2016

Choose a reason for hiding this comment

vtjnash Oct 9, 2016

Choose a reason for hiding this comment

nalimilan Oct 10, 2016

Choose a reason for hiding this comment

TotalVerb commented Oct 10, 2016

nalimilan commented Oct 10, 2016

TotalVerb commented Oct 10, 2016

nalimilan commented Oct 10, 2016

JeffBezanson commented Oct 10, 2016

TotalVerb commented Oct 10, 2016 • edited

StefanKarpinski commented Oct 11, 2016

StefanKarpinski commented Oct 11, 2016 • edited

TotalVerb commented Oct 11, 2016

nalimilan commented Oct 11, 2016

JeffBezanson commented Oct 11, 2016

JeffBezanson Oct 11, 2016

Choose a reason for hiding this comment

nalimilan Oct 11, 2016

Choose a reason for hiding this comment

StefanKarpinski commented Oct 11, 2016

StefanKarpinski commented Oct 11, 2016 • edited

JeffBezanson commented Oct 11, 2016

StefanKarpinski commented Oct 11, 2016

JeffBezanson commented Oct 11, 2016

davidanthoff commented Oct 13, 2016

TotalVerb commented Oct 14, 2016

davidanthoff commented Oct 14, 2016

nalimilan commented Oct 14, 2016

StefanKarpinski commented Oct 14, 2016 • edited

nalimilan commented Oct 14, 2016 • edited

JeffBezanson commented Oct 14, 2016

JeffBezanson commented Oct 14, 2016

nalimilan commented May 20, 2017

TotalVerb commented Oct 10, 2016 •

edited

StefanKarpinski commented Oct 11, 2016 •

edited

StefanKarpinski commented Oct 11, 2016 •

edited

StefanKarpinski commented Oct 14, 2016 •

edited

nalimilan commented Oct 14, 2016 •

edited