Argument order for rand! #8246

simonster · 2014-09-05T18:00:28Z

Is there a reason that the output array comes last instead of first as is the case for most other mutating functions? Right now we could change the argument order and add a deprecation, but we should decide whether we want to change this before #6003.

andreasnoack · 2014-09-05T18:38:59Z

This has been discussed somewhere before and I think @johnmyleswhite gave the answer. Personally, I don't like the rule that the mutated argument should be the first one. It could be awkward for some functions and for all the matrix multiplication and division functions I'd prefer that we followed the BLAS conventions strictly and that would mean that the mutating argument was last (as well as adding scalar α and β arguments). For some linear algebra functions you'd also want to give a workspace array as argument to avoid reallocation and such an argument would make more sense as the last argument.

johnmyleswhite · 2014-09-05T18:42:43Z

I'm happy with any rule personally. Whether the mutated arguments are first or last doesn't matter much to me, although the argument against putting mutated arguments last is that it breaks functions with varargs.

simonster · 2014-09-05T18:57:41Z

I don't have a preference on the rule, just that there should be one, or that we should get #249. But I also think at this point we don't want to break all code that uses mutating functions by moving the output last.

As far as matrix functions, maybe α, β, and workspace should be keyword arguments? Although right now that incurs kwsorter overhead.

iamed2 · 2014-09-05T19:09:10Z

Perhaps there could be a convention where the names of mutating arguments end in an exclamation point?

rand!{T}(r::Range{T},A::AbstractArray{T,N})

would become

rand!{T}(r::Range{T},A!::AbstractArray{T,N})

in docs and when calling methods().

timholy · 2014-09-05T19:16:06Z

I don't think there are any established conventions, here or elsewhere. While BLAS puts it last, more often than not libc puts them first (memcpy, fgets, strcpy). But even libc is inconsistent (sscanf, although there's a reason for that violation). I have a slight preference for first, but above all would prefer reasonable consistency (whatever that may be)

I would also conceptually distinguish output from mutating. mutating to me means it's OK to destroy it. output means just that, it's acting as a return value. As discussed elsewhere, there are sometimes performance advantages in passing in a dedicated output, even if you could just use the input to return the output. This is one of the things I definitely don't like about the BLAS/LAPACK API.

timholy · 2014-09-05T19:17:42Z

@iamed2, it's a good suggestion, but keyword arguments have an amusing gotcha on this: #8020.

timholy · 2014-09-05T19:19:33Z

Better link for the reason for separating output from input: #7513 (comment)

andreasnoack · 2014-09-06T20:19:04Z

@timholy I can't see the performance advantage for triangular solve on my machine

julia> minimum([@elapsed (for i = 1:100;MyTest.mysolve1(A, b, x);end) for j = 1:10])
0.306543869
julia> minimum([@elapsed (for i = 1:100;copy!(x,b);MyTest.mysolve2(A, x);end) for j = 1:10])
0.305641564

I'm not that surprised about the result. Efficiency seems to have been a pretty high priority in the development of BLAS.

The code is

module MyTest

    function mysolve1{T,S}(A::Triangular{T,S,:L,false}, b, x = b)
        n = size(A, 1)
        Ad = A.data
        @inbounds begin
            for j = 1:n
                xj = b[j]
                for i = 1:j-1
                    xj -= Ad[j,i]*x[i]
                end
                x[j] = xj/A[j,j]
            end
        end
        x
    end

    function mysolve2{T,S}(A::Triangular{T,S,:L,false}, b)
        n = size(A, 1)
        Ad = A.data
        @inbounds begin
            for j = 1:n
                bj = b[j]
                for i = 1:j-1
                    bj -= Ad[j,i]*b[i]
                end
                b[j] = bj/A[j,j]
            end
        end
        b
    end
end

andreasnoack · 2014-09-06T20:34:40Z

@simonster I really think it is a mistake that we haven't followed the BLAS convention completely and I also hope that we can still break code to make things right.

Actually, I began the work to change the convention some time ago, but the development stalled in discussion.

timholy · 2014-09-06T21:06:38Z

Sure, for this kind of algorithm it won't matter. Compare the running time to that of copy!ing x---for this algorithm it's access to the elements of A, not x, that matters.

But that doesn't mean that it's a good model to apply to other algorithms that may be cache-limited on the output.

rfourquet · 2014-09-15T05:51:13Z

All rand methods are of the form rand([rng], [out]), with:

rng is the the generator, and defaults to the global one
out is the type information of the output:
- first the type of randoms, which defaults to Float64
- and the dimensions of the generated array, or nothing for a scalar

Note that if the rng is given, the type of randoms can not be specified, it is assumed that it's the responsibility of rng.

So it makes sense that an output array A comes last in rand!. Using @timholy's terminology (#8246 (comment)), A can be seen as a "mutating" argument which provides both the type and dimensions (out above) to the rand! function. It also happens to be an "output" argument. Hence the signature: rand!([rng], out).

rfourquet · 2014-09-15T06:56:58Z

To insist on the "output" characteristic of A, why not A[:] = randgen([rng], T) where randgen([rng]]) an infinite iterator producing random T's (or maybe with a bit of magic in the relevant getindex to deduce automatically T=eltype(A)) ?

simonster · 2014-09-15T17:29:20Z

~~I suppose there is also some analogy to fill!, which takes the array last.~~

simonster · 2014-09-15T17:31:26Z

Actually that's wrong: the argument order for fill is fill(x, sz) or fill!(out, x), which supports the idea that the output of rand! should come first even if it means inverting the order of "generator" and "output" between rand and rand!.

simonster · 2014-09-15T17:32:48Z

It's also arguable that any variant of rand that takes an explicit generator object should be rand! since it modifies the state of the generator...

ivarne · 2014-09-16T08:40:20Z

@simonster The same argument was tried on IO functions, but I think it was decided that even though most of them modify a IO object, only those utilizing pre-allocated memory, should have the ! suffix.

rfourquet · 2014-09-16T09:36:13Z

And it would be less useful: the bang ! would loose a bit of its warning power as it would be often used in a harmless context, e.g. print!(io, 'c') or rand!(rng, Int).

It would be nice if rand! and fill! had the same rules, I prefer that of rand!.

timholy · 2014-09-16T10:07:42Z

Here's my personal list of advantages for either order. Please add to it. Since there is no convention among C libraries, I'm not including "BLAS does it that way" because one can equally well say "libc does it the other way".

Advantages of output first:

func!(out, A) looks more like out = func(A), and having the output next to the ! associates the two.
func!(out, input, [temporaryworkspacevariable]) clearly distinguishes (optional) mutating arguments from output arguments. It's slightly less clear where to stop with func!(input, out, [temporaryworkspacevariable]).
Multidimensional functions with a varargs input cannot take the output last. setindex! is the poster child for this behavior: you simply can't write this function as setindex!(val, indexes..., output). (Many people might also wish it were setindex!(output, indexes..., val) but that's also not possible, since indexes must come last.)

Advantages of input first:

Semantically, people almost always talk about input before they talk about output.
If you think about the output as an optional argument, it makes sense for it to be last. HOWEVER, this is a bit of a red herring, because we have two separate functions, fill and fill!, so there's really no sense in which an output argument is actually an optional argument.

To me, the varargs/setindex! example (which I only just noticed now, long after I had developed my vague preference for output first) basically settles the argument: outputs should be first. All the other points are wishy-washy subjective issues, but the varargs/setindex! is a clear technical constraint.

…s fixed

johnmyleswhite · 2014-09-16T17:11:11Z

Completely agree with @timholy: the varargs case strongly argues for putting all output arguments at the front.

timholy · 2014-09-16T17:28:35Z

Ah, and reading up I see you made that same point at the top. Could have saved myself some time...

…s fixed

ViralBShah · 2014-11-18T18:46:14Z

Given that we are on this path, I guess we just need to go ahead and change the calling sequences for rand! to be consistent with the outputs first rule.

johnmyleswhite · 2014-11-18T23:05:53Z

+1

ViralBShah · 2014-11-19T02:46:21Z

@rfourquet Since you have considerably improved the RNG codebase and are the one most familiar with it, would it be possible for you to create a PR for this change?

rfourquet · 2014-11-19T09:50:32Z

Yes I will do that. But now that objects can be call'ed, I just wanted to mention this alternative: change rand /rand! to call (kind of like in C++11) and use rand/rand! only for the global RNG, e.g.:

rng = MersenneTwister()
x = rng()
a = Array(Int, 10)
rng(a)
rand!(a) # similar to Base.Random.GLOBAL_RNG(a)
...

This would imply having the rng as first argument of call, but is mostly independant of this issue otherwise.

ViralBShah · 2014-11-19T14:00:12Z

I like the suggestion. Would love to hear what others have to say.

andreasnoack · 2014-11-19T14:10:49Z

I also like it. How would this fit into Distributions.jl?

ivarne · 2014-11-19T14:10:57Z

I don't like overloading call for this purpose (yet). It is a step away from the current Julian APIs with documented generic functions (like rand) that packages can extend for their own types.

ViralBShah · 2014-11-19T14:14:48Z

Cc: @dmbates @lindahua @johnmyleswhite @simonbyrne

simonbyrne · 2014-11-19T14:44:01Z

This could be interesting, but I'm slightly skeptical. What does rng(a) do in the above example?

rfourquet · 2014-11-19T14:51:57Z

Yes rng(a) looks ambiguous now (rand(rng, A) vs rand!(rng, A)), but this could probably be sorted out.

rfourquet · 2014-11-19T15:32:06Z

I asked if it would be possible to have the rng!(A) syntax for rand!(rng, A), but it wasn't. In C++, rand(rng, A) would be written as something like distribution(A)(rng) (cf. e.g. n3551.pdf), but this probably belongs to another discussion.

rfourquet · 2014-11-20T10:54:01Z

The current syntax to fill an array A with values from 1:9 is rand!([rng,] 1:9, A), with two possible ways to change the argument order: rand!([rng,] A, 1:9) and rand!(A, [rng,] 1:9).
I prefer the first one because rng is the state allowing rand! to work, so I see the pair (rand!, rng) as a "function object", and I would give rng the highest priority to be first argument of rand!; and like in fill!(A, 9), the output array still comes before the source of values 1:9.
What do you think?

johnmyleswhite · 2014-11-20T11:02:26Z

Given the proposed rule that outputs must always come first, I'd prefer rand!(A, [rng,] 1:9) unless there's a function rand!(A, 1:9) that produces a modified rng object as its output.

ViralBShah · 2014-11-20T17:28:04Z

While I like the consistency, Putting the state argument second doesn't appeal aesthetically. I would love to have rand!(rng, A, 1:9). I wish I could come up with a better justification.

toivoh · 2014-11-20T19:53:55Z

Well, the call does actually mutate both of the two first arguments :)
I also think that putting the rng first feels right. Maybe its by analogy
to println, where the optional io argument goes first.

StefanKarpinski · 2014-11-20T20:03:45Z

I feel like having the RNG anywhere but first or as a keyword arg looks really odd.

ivarne · 2014-11-20T20:35:43Z

If we look at this from a dot oriented language, it would clearly be written as rng.rand!(A, 1:10), not A.rand!(rng, 1:10). Therefore I think the "rng object first" trumps the "mutated array first" rule (in this case).

lindahua · 2014-11-21T09:35:31Z

rand is often used within tight loops. RNGs as keyword arguments would cause substantial performance hit, as we have RNGs of different types.

change argument order for rand! (fix #8246)

The API to fill randomly an array A is changed from rand!([rng], [::Range], A) to rand!([rng], A, [::AbstractArray]). Enabling [::AbstractArray] instead of only [::Range] depended on choosing first the argument order.

simonster added the decision label Sep 5, 2014

simonster mentioned this issue Sep 14, 2014

Make rand work with AbstractArray instead of only with Range #8309

Merged

rfourquet added a commit to rfourquet/julia that referenced this issue Sep 16, 2014

remove rand!(::AbstractArray, ::AbstractArray) until JuliaLang#8246 i…

e859850

…s fixed

rfourquet added a commit to rfourquet/julia that referenced this issue Sep 17, 2014

remove rand!(::AbstractArray, ::AbstractArray) until JuliaLang#8246 i…

78a1f14

…s fixed

rfourquet added a commit to rfourquet/julia that referenced this issue Sep 17, 2014

remove rand!(::AbstractArray, ::AbstractArray) until JuliaLang#8246 i…

91b58f7

…s fixed

rfourquet added a commit to rfourquet/julia that referenced this issue Sep 30, 2014

remove rand!(::AbstractArray, ::AbstractArray) until JuliaLang#8246 i…

1adbe3e

…s fixed

rfourquet added a commit to rfourquet/julia that referenced this issue Sep 30, 2014

remove rand!(::AbstractArray, ::AbstractArray) until JuliaLang#8246 i…

d9814ff

…s fixed

rfourquet mentioned this issue Nov 21, 2014

change argument order for rand! (fix #8246) #9092

Merged

rfourquet closed this as completed in 4045c52 Nov 21, 2014

ViralBShah pushed a commit that referenced this issue Nov 21, 2014

Merge pull request #9092 from JuliaLang/rf/randbang-arg-order

ca7f6d1

change argument order for rand! (fix #8246)

ViralBShah added the domain:randomness Random number generation and the Random stdlib label Nov 22, 2014

simonster mentioned this issue Jan 26, 2015

At_mul_B! has different methods for sparse and dense matrices #9930

Closed

Argument order for rand! #8246

Argument order for rand! #8246

Comments

simonster commented Sep 5, 2014

andreasnoack commented Sep 5, 2014

johnmyleswhite commented Sep 5, 2014

simonster commented Sep 5, 2014

iamed2 commented Sep 5, 2014

timholy commented Sep 5, 2014

timholy commented Sep 5, 2014

timholy commented Sep 5, 2014

andreasnoack commented Sep 6, 2014

andreasnoack commented Sep 6, 2014

timholy commented Sep 6, 2014

rfourquet commented Sep 15, 2014

rfourquet commented Sep 15, 2014

simonster commented Sep 15, 2014

simonster commented Sep 15, 2014

simonster commented Sep 15, 2014

ivarne commented Sep 16, 2014

rfourquet commented Sep 16, 2014

timholy commented Sep 16, 2014

johnmyleswhite commented Sep 16, 2014

timholy commented Sep 16, 2014

ViralBShah commented Nov 18, 2014

johnmyleswhite commented Nov 18, 2014

ViralBShah commented Nov 19, 2014

rfourquet commented Nov 19, 2014

ViralBShah commented Nov 19, 2014

andreasnoack commented Nov 19, 2014

ivarne commented Nov 19, 2014

ViralBShah commented Nov 19, 2014

simonbyrne commented Nov 19, 2014

rfourquet commented Nov 19, 2014

rfourquet commented Nov 19, 2014

rfourquet commented Nov 20, 2014

johnmyleswhite commented Nov 20, 2014

ViralBShah commented Nov 20, 2014

toivoh commented Nov 20, 2014

StefanKarpinski commented Nov 20, 2014

ivarne commented Nov 20, 2014

lindahua commented Nov 21, 2014