Rename None to Union() and Nothing to Void? #8423

johnmyleswhite · 2014-09-20T00:36:31Z

In the discussion of #8152, there was some concern about the potential existence of three "NULL"-like types in Julia:

None
Nothing
Nullable

One suggestion was to rename types to clarify their purpose. @JeffBezanson suggested renaming None to Union() and Nothing to Void to reflect their respective roles as the empty union of zero types and the result of functions that "do not return a value".

I personally think this would be a great change.

The text was updated successfully, but these errors were encountered:

eschnett · 2014-09-20T01:11:15Z

In the spirit of "None -> Union()", one could also "Nothing -> ()", i.e. the empty tuple. Functions return multiple values as tuples, so a function returning nothing returns ().

johnmyleswhite · 2014-09-20T01:12:47Z

I like that idea, but believe it might require a change in semantics rather than a change in names.

nalimilan · 2014-09-20T09:38:21Z

And what about NA from DataArrays? What can it be replaced with, in the perspective of renaming DataArray to NullableArray and making it consistent with Nullable?

EDIT: for reference, https://github.com/johnmyleswhite/NullableTypes.jl/pull/3

quinnj · 2014-09-20T10:37:06Z

Would we really need a separate type for NullableArrays? Or would they just
become Array{Nullable{T},1}?

On Sat, Sep 20, 2014 at 5:38 AM, Milan Bouchet-Valat <
notifications@github.com> wrote:

And what about NA from DataArrays? What can it be replaced with, in the
perspective of renaming DataArray to NullableArray and making it
consistent with Nullable?

—
Reply to this email directly or view it on GitHub
#8423 (comment).

StefanKarpinski · 2014-09-20T13:20:18Z

Arrays of Nullables would not have good performance characteristics or memory efficiency.

StefanKarpinski · 2014-09-20T13:23:47Z

I'm fine with renaming None to Union() but I think renaming Nothing to Void would be a mistake. The empty tuple is a completely valid and useful value – using it to indicate that there's nothing interesting to return is not a good idea.

JeffBezanson · 2014-09-20T16:40:12Z

Yes we cannot use () to mean "nothing". For example it is the size of a 0-dimensional array. In that case there is definitely a value there, representing 0 dimensions.

I think making Void === Nothing is effectively a bugfix. A Void ccall returns nothing in julia, so anything else is just bound to cause problems.

I would much prefer simply renaming Nothing to Void, but I'm willing to accept Nothing === Void in the interest of fixing the bug.

johnmyleswhite · 2014-09-20T17:30:58Z

@nalimilan: My plan is to remove NA as a concept from Julia completely because it has no coherent place in the type system. In R, NA is shorthand for what might be called NA_logical; that is, NA is a value of type logical. But the existence of multiple NA values for each of R's fundamental types gives rise to some paradoxical situations. See this gist for one example: https://gist.github.com/johnmyleswhite/fd6cbed2f691a9119cfe

In Julia, we started with an approach in which NA was a singleton object of a completely novel type, NAtype. This was convenient at the start, but problematic in the long-run because it induced endemic type-instability in all code that interacted with any source of NA, since that code always produced Union(NAtype, T) as the inferred type. From the perspective of the current Julia compiler, we might as well have been producing Any everywhere as the output type -- we were actively sabotaging everything clever about Julia's compiler's design.

One could improve support for union types in Julia substantially using techniques like polymorphic inline caching, but Jeff and others felt that this was not the best way to move forward. I've also come to feel that Julia doesn't use Union types in a way that's meaningfully similar to languages like ML or Haskell, where the Union(NAtype, T) pattern makes sense and is referred to as sum types. In those languages, the compiler forces one to decompose a sum type into separate cases for each possible type in the sum via explicit pattern matching. In other words. sum types are used to push branching into the type system's verifier for program correctness. Julia does not verify exhaustivity when working with union types.

From my perspective, Julia's union types tend to be used only for writing out a catch-all case that expresses a generic statement about all types. Specialized cases are handled via multiple dispatch, rather than pattern matching with exhaustive case analysis. As such, I don't think Julia's union type will ever come to be used in the same way that functional languages use their sum types. This makes me think that the old Union(NAtype, T) pattern in Julia should be expunged from the language completely.

johnmyleswhite · 2014-09-20T17:38:06Z

@quinnj: The plan is to maintain a separate NullableArray type rather than use Array{Nullable}. The arguments for and against that I see are as follows.

Pros:

NullableArray is more efficient because it uses bits to store Boolean information, rather than bytes.
NullableArray mimics the data structure of Nullable more closely by decomposing values and missingness masks. This memory layout means that the values component can be operated on by standard functions over Julia's Array{T} type.
By using a more standard memory layout, we can avoid redefining operations. We get, for example, FFT's on NullableArray's for free.
By reusing existing functions, we also get to avoid having to define elementary operations between two Nullable{T <: Number} objects. We don't, for example, have to define + between two Nullable objects in order to define linear algebra operations.

Cons:

People will surely create Array{Nullable} objects. It will take some cultural force to ensure that people learn why those objects aren't supported by most libraries.
We need to reinvent lots of functionality currently defined over AbstractArray, including things like map, reduce, etc. But we have to do that anyway, because those functions aren't able to cope with Nullable objects anyway.
We have to do a lot more work to support an extra data structure.

I think we'll end up revisiting this question a few times, but I think the current plan is the best one we have so far.

johnmyleswhite · 2014-09-20T17:38:41Z

@JeffBezanson and @StefanKarpinski: How about starting by making Void === Nothing and then considering a complete transition to Void over time?

JeffBezanson · 2014-09-20T17:41:09Z

Yes that's probably what we'd do anyway as a deprecation process.

If we had a general way to do the array-of-structs-to-struct-of-arrays transformation, then NullableArray could become redundant. But that is a significant challenge.

eschnett · 2014-09-20T20:30:54Z

On Sat, Sep 20, 2014 at 1:38 PM, John Myles White notifications@github.com
wrote:

@quinnj https://github.com/quinnj: The plan is to maintain a separate
NullableArray type rather than use Array{Nullable}. The arguments for and
against that I see are as follows.

Pros:

NullableArray is more efficient because it uses bits to store
Boolean information, rather than bytes.

NullableArray mimics the data structure of Nullable more closely by
decomposing values and missingness masks. This memory layout means that the
values component can be operated on by standard functions over Julia's
Array{T} type.

By using a more standard memory layout, we can avoid redefining
operations. We get, for example, FFT's on NullableArray's for free.

By reusing existing functions, we also get to avoid having to define
elementary operations between two Nullable{T <: Number} objects. We
don't, for example, have to define + between two Nullable objects in
order to define linear algebra operations.

Cons:

People will surely create Array{Nullable} objects. It will take some
cultural force to ensure that people learn why those objects aren't
supported by most libraries.

We need to reinvent lots of functionality currently defined over
AbstractArray, including things like map, reduce, etc. But we have to
do that anyway, because those functions aren't able to cope with
Nullable objects anyway.

We have to do a lot more work to support an extra data structure.

Nullable{T} is in many ways similar to Array{T} with a size that is
constrained to be either 0 or 1. We may want to introduce "map" or "reduce"
or the iterator interface for nullable types. Similarly, the .+ kind of
operators would make sense, if both nullable objects are either null or
non-null.

-erik

Erik Schnetter schnetter@gmail.com
http://www.perimeterinstitute.ca/personal/eschnetter/

StefanKarpinski · 2014-09-20T21:13:47Z

Can't we support both by just writing DataFrames et al. to work with any representation that takes indices and produces Nullables? That would include both Array{Nullable} and NullableArray.

johnmyleswhite · 2014-09-21T00:52:03Z

We can certainly support both in some cases, but it seems pointless to recreate the work we've done to support things like matrix multiplication for Array{Nullable}.

nalimilan · 2014-09-21T10:24:38Z

@johnmyleswhite Fine, but then how would you set an element of a NullableArray to be null, without a NULL object equivalent to the old NA? Will that require something like setnull(a, i) instead of a[i] = NULL?

johnmyleswhite · 2014-09-21T16:07:08Z

a[i] = Nullable{Int}()

StefanKarpinski · 2014-09-21T16:58:45Z

We can have some value of type Nullable{None}() and support conversion from that to any kind of Nullable, which should do the trick. I'm not sure what we want to call that, but NA or NULL would be reasonable.

johnmyleswhite · 2014-09-21T17:02:20Z

It is worth noting that we didn't opt into using const NULL = Nullable{None}() during our discussion in #8152.

There are some potential gains from having NULL, but I worry about negative consequences. In particular, I really don't want to wind up in a situation in which two distinct values that render as NULL behave in powerfully different ways, as occurs in R: https://gist.github.com/johnmyleswhite/fd6cbed2f691a9119cfe

nalimilan · 2014-09-21T17:15:03Z

@johnmyleswhite Two big differences from the R behavior illustrated in your commit are that 1) NULL in Julia would not be equal to Nullable{Bool}(), but to Nullable{None}(), and that 2) indexing with missing values fails in Julia (at least for now - but even if it didn't indexing with Nullable{None}() could still trigger an error).

johnmyleswhite · 2014-09-21T17:30:26Z

@nalimilan: That solves the specific issue in that gist, but doesn't solve the broader lesson: it's dangerous to teach people to think about values that have ill-defined positions in the type system. Even though Nullable{Int}() is more verbose, it's more clear. And it's something you'll probably write quite rarely.

StefanKarpinski · 2014-09-21T17:56:43Z

I think that having NULL as a shorthand for Nullable{None}() is pretty reasonable though. We have handy shorthands for a lot of things that behave generically and this seems to me no different.

johnmyleswhite · 2014-09-21T18:01:25Z

Abstractly, I agree that having NULL as shortand for Nullable{None}() is reasonable. What worries me is how this NULL would be used.

I'm happy to allow a[i] = NULL. Indeed, that was my original idea for creating a NULL constant.

But I'm worried about people writing functions like:

function mean(na::NullableArray)
  s, n = 0.0, 0
  for i in 1:length(na)
    if isnull(na[i])
      return NULL
    else
      s += get(na[i])
      n += 1
    end
  end
  return Nullable(s / n)
end

I'm especially worried that the brevity of NULL will lead people to use it without understanding its position in the type system. That is to say: its strength is what makes it dangerous, because people will misuse things that are brief.

JeffBezanson · 2014-09-21T18:09:51Z

It is also confusingly different from what NULL means in C and java and all
other languages that have it.

eschnett · 2014-09-21T18:13:01Z

This problem could also be avoided by allowing type annotations on functions -- similar to local variables -- and then automatically converting the returned value to this type.

johnmyleswhite · 2014-09-21T18:13:38Z

That's #1090.

johnmyleswhite · 2014-09-21T18:16:47Z

If people want shorter ways to create appropriate Nullable objects, I'd rather use Null(T) and NotNull(x::T).

StefanKarpinski · 2014-09-21T18:17:56Z

I'm also ok with that.

eschnett · 2014-09-21T18:28:04Z

Is the name Nullable too long?

I believe Swift uses a question mark after the type to indicate types that are nullable, e.g. Int?. Nullable types could also be useful also for optional arguments, and then a very short syntax to create null objects as default values would also be quite handy.

johnmyleswhite · 2014-09-21T18:29:54Z

A bunch of languages use ? to indicate Nullable objects. I'm not super fond of it, but I also personally don't feel that Nullable is too long.

eschnett · 2014-09-21T19:07:11Z

I thought the discussion above was about introducing Null(T) as abbreviation of Nullable{T}().

johnmyleswhite · 2014-09-21T19:09:04Z

Yes, but I don't actually think we need to make any changes. If we do make changes to provide shorthand for typed nulls, I think Null(T) is the way to do that.

eschnett · 2014-09-21T19:29:21Z

Or null(T), since Null(T) looks as if it constructed a type Null.

nalimilan · 2014-09-21T19:55:18Z

OTOH there's 0 and zero(T), so patterns requiring people to (sometimes) carefully choose the type of variables and return values already exist in Julia. Having both NULL and Nullable(T) would follow this schema.

To me the strongest argument in favor of NULL is that a short string is needed to print missing values in NullableArrays. For this use case, including the type would be a waste of space; and it would be good that the value used for printing can be used in the code. But instead of NULL, null() without any type argument could be fine too.

johnmyleswhite · 2014-09-22T00:36:40Z

I think we can set up NullableArray objects to print out null entries as NULL without needing to define NULL. Think of NULL as the showcompact for Nullable.

johnmyleswhite added needs decision A decision on this change is needed kind:breaking This change will break code labels Sep 20, 2014

JeffBezanson added this to the 0.4 milestone Sep 20, 2014

JeffBezanson self-assigned this Sep 20, 2014

JeffBezanson closed this as completed in 2ef8d31 Sep 25, 2014

ivarne referenced this issue in JuliaGraphics/Gtk.jl Sep 29, 2014

Ptr{None} -> Ptr{Void}

f34005b

This was referenced Sep 29, 2014

Ptr{None} -> Ptr{Void} JuliaMath/GSL.jl#23

Merged

Ptr{None} -> Ptr{Void} JuliaAstro/FITSIO.jl#15

Merged

Ptr{None} -> Ptr{Void} JuliaCloud/AWS.jl#24

Merged

PythonNut mentioned this issue Oct 17, 2014

creation of BitArray using comprehension #3166

Closed

nalimilan mentioned this issue Dec 14, 2014

additional convert methods for Nullable #9351

Merged

ihnorton mentioned this issue Dec 15, 2014

RFC: Missing values by Sentinels #9363

Closed

amitmurthy mentioned this issue Dec 15, 2014

Less verbose way of initializing a Nullable field to null #9364

Closed

StefanKarpinski mentioned this issue Dec 18, 2017

rename Void => Nothing with alias Cvoid = Nothing #25082

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rename None to Union() and Nothing to Void? #8423

Rename None to Union() and Nothing to Void? #8423

johnmyleswhite commented Sep 20, 2014

eschnett commented Sep 20, 2014

johnmyleswhite commented Sep 20, 2014

nalimilan commented Sep 20, 2014

quinnj commented Sep 20, 2014

StefanKarpinski commented Sep 20, 2014

StefanKarpinski commented Sep 20, 2014

JeffBezanson commented Sep 20, 2014

johnmyleswhite commented Sep 20, 2014

johnmyleswhite commented Sep 20, 2014

johnmyleswhite commented Sep 20, 2014

JeffBezanson commented Sep 20, 2014

eschnett commented Sep 20, 2014

StefanKarpinski commented Sep 20, 2014

johnmyleswhite commented Sep 21, 2014

nalimilan commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

StefanKarpinski commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

nalimilan commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

StefanKarpinski commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

JeffBezanson commented Sep 21, 2014

eschnett commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

StefanKarpinski commented Sep 21, 2014

eschnett commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

eschnett commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

eschnett commented Sep 21, 2014

nalimilan commented Sep 21, 2014

johnmyleswhite commented Sep 22, 2014

Rename None to Union() and Nothing to Void? #8423

Rename None to Union() and Nothing to Void? #8423

Comments

johnmyleswhite commented Sep 20, 2014

eschnett commented Sep 20, 2014

johnmyleswhite commented Sep 20, 2014

nalimilan commented Sep 20, 2014

quinnj commented Sep 20, 2014

StefanKarpinski commented Sep 20, 2014

StefanKarpinski commented Sep 20, 2014

JeffBezanson commented Sep 20, 2014

johnmyleswhite commented Sep 20, 2014

johnmyleswhite commented Sep 20, 2014

johnmyleswhite commented Sep 20, 2014

JeffBezanson commented Sep 20, 2014

eschnett commented Sep 20, 2014

StefanKarpinski commented Sep 20, 2014

johnmyleswhite commented Sep 21, 2014

nalimilan commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

StefanKarpinski commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

nalimilan commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

StefanKarpinski commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

JeffBezanson commented Sep 21, 2014

eschnett commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

StefanKarpinski commented Sep 21, 2014

eschnett commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

eschnett commented Sep 21, 2014

johnmyleswhite commented Sep 21, 2014

eschnett commented Sep 21, 2014

nalimilan commented Sep 21, 2014

johnmyleswhite commented Sep 22, 2014