Arbitrary dimension one-hot arrays #1448

darsnack · 2021-01-01T21:05:12Z

This supersedes #1447. It should address the same issues:

fix Issues about OneHotVector/OneHotMatrix #1445, Multi-dimensional onehot #1229
probably fix also onecold does not work on CuMatrix #864, onecold is very slow #556, Huge performance difference between sparse and dense representation on GPU #189

This PR introduces a new one-hot N-dimensional array type, OneHotArray. Like #1447, this approach avoids the pointer allocations associated with OneHotMatrix being an array of OneHotVectors. It also lifts the "height" into the type parameter to avoid unnecessary allocation. Unlike #1447, this approach does not introduce a new primitive type. Instead, a "one-hot vector" is represented with a single subtype of Integer that is configurable by the user. By default, the exposed API will use UInt32.

Fundamentally, the primitive type is necessary because wrapping a UInt32 as a OneHotVector will suffer memory penalties when you create an Array{<:OneHotVector}. But if we begin by designing for N-dimensions, then OneHotVector is just the specialized 1D case (similar to how Vector{T} = Array{T, 1}).

Performance

I compared against the same tests mentioned in #1447. Please suggest more if you want to.

Huge performance difference between sparse and dense representation on GPU #189

#master
julia> x = Flux.onehotbatch(rand(1:100, 50), 1:100);

julia> W = rand(128, 100);

julia> @btime $W * $x;
  5.095 μs (13 allocations: 50.86 KiB)

julia> cW, cx = cu(W), cu(x);

julia> @btime $cW * $cx;
  24.948 μs (86 allocations: 3.11 KiB)

#1447
julia> x = Flux.onehotbatch(rand(1:100, 50), 1:100);

julia> W = rand(128, 100);

julia> @btime $W * $x;
  5.312 μs (3 allocations: 50.36 KiB)

julia> cW, cx = cu(W), cu(x);

julia> @btime $cW * $cx;
  8.466 μs (61 allocations: 1.69 KiB)

# this PR
julia> x = Flux.onehotbatch(rand(1:100, 50), 1:100);

julia> W = rand(128, 100);

julia> @btime $W * $x;
  4.708 μs (3 allocations: 50.56 KiB)

julia> cW, cx = cu(W), cu(x);

julia> @btime $cW * $cx;
  8.576 μs (63 allocations: 1.73 KiB)

onecold is very slow #556

#master
julia> valY = randn(1000, 128);

julia> @btime Flux.onecold($valY);
  365.712 μs (1131 allocations: 38.16 KiB)

julia> @btime Flux.onecold($(gpu(valY)));
┌ Warning: Performing scalar operations on GPU arrays: This is very slow, consider disallowing these operations with `allowscalar(false)`
└ @ GPUArrays ~/.julia/packages/GPUArrays/jhRU7/src/host/indexing.jl:43
  1.330 s (781248 allocations: 31.59 MiB)

#1447
julia> valY = randn(1000, 128);

julia> @btime Flux.onecold($valY);
  524.767 μs (8 allocations: 4.00 KiB)

julia> @btime Flux.onecold($(gpu(valY)));
  27.563 μs (169 allocations: 5.56 KiB)

# this PR
julia> valY = randn(1000, 128);

julia> @btime Flux.onecold($valY);
  493.017 μs (8 allocations: 4.53 KiB)

julia> @btime Flux.onecold($(gpu(valY)));
  26.702 μs (171 allocations: 5.61 KiB)

Summary

This should basically be #1447 but simpler to maintain w/ fewer changes. Tests are passing, though I think we should add more tests for one-hot data (currently our test set seems pretty sparse). Performance matches #1447 where I have tested, but please suggest more performance tests. In theory, any performance difference between #1447 and this PR should be recoverable.

PR Checklist

Tests are added
Entry in NEWS.md
Documentation, if applicable
Final review from @DhairyaLGandhi (for API changes).

cc @CarloLucibello @chengchingwen

Need to benchmark on machine w/ GPU.

DhairyaLGandhi

I find generalising the type to be nicer in general, but I've comments around defining some of methods and the type seems to hold more information than necessary. That would induce dynamic dispatch which would be nice to avoid

src/onehot.jl

DhairyaLGandhi · 2021-01-02T10:57:24Z

src/onehot.jl

-Base.getindex(xs::OneHotMatrix, ::Colon, i::AbstractArray) = OneHotMatrix(xs.height, xs.data[i])
-Base.getindex(xs::OneHotMatrix, ::Colon, ::Colon) = OneHotMatrix(xs.height, copy(xs.data))
+_onehot_bool_type(x::OneHotArray{<:Any, <:Any, <:Any, N, <:OneHotIndex}) where N = Array{Bool, N}
+_onehot_bool_type(x::OneHotArray{<:Any, <:Any, <:Any, N, <:CuArray}) where N = CuArray{Bool, N}


We might want to simply return the type of the underlying array iiic

The underlying array is an integer array. This is just an internal convenience function I use when I want to convert the OneHotArray to an Bool array. I use this to decide whether to convert to a Array{Bool} or CuArray{Bool} depending on the underlying storage location.

But why do we need to have this? The implementation for CuArray should fall straight out of assuming regular array

src/onehot.jl

Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com>

src/onehot.jl

darsnack · 2021-01-05T20:07:36Z

The last remaining issue that needs to be addressed is that Base's reshape logic returns a lazy ReshapedArray for cases like reshape(x, 10, :). This means that if you collect the lazy iterator, you'll currently get an Array{Bool} when we want to return a OneHotArray (see the failing test cases).

CarloLucibello · 2021-01-06T09:08:19Z

What Base does with reshape is just crazy.
I think you have to intercept the following line
https://github.com/JuliaLang/julia/blob/788b2c77c10c2160f4794a4d4b6b81a95a90940c/base/reshapedarray.jl#L118

So maybe overload

Base._reshape(x::OneHotArray, dims:::Dims)

?

DhairyaLGandhi · 2021-01-06T10:23:14Z

Please don't touch internal Julia functions

darsnack · 2021-01-06T15:58:31Z

Probably we just want to extend

reshape(x::OneHotArray, dims::Tuple{Vararg{Union{Int,Colon}}})

Only downside is we'd still rely on the internal Base._reshape_uncolon. Ideally, Base wouldn't have this singular odd dispatch path. I opened JuliaLang/julia#39123, so I think we should just define

reshape(x::OneHotArray, dims::Tuple{Vararg{Union{Int,Colon}}}) = reshape(x, Base._reshape_uncolon(x, dims))

Once JuliaLang/julia#39123 is addressed, we can remove this line (so temporary, minimal use of an internal function).

darsnack · 2021-01-06T16:43:58Z

Never mind, that leads to a method ambiguity that I think can only be resolved in Base. We can either overload _reshape like @CarloLucibello suggested, or we ship with these tests broken until this is addressed in Base.

CarloLucibello · 2021-01-07T08:16:03Z

Never mind, that leads to a method ambiguity that I think can only be resolved in Base. We can either overload _reshape like @CarloLucibello suggested, or we ship with these tests broken until this is addressed in Base.

I wouldn't hold on adding a very basic functionality just because the situation is a bit weird in Base. I say we add _reshape, if things will change in future julia versions we will change accordingly, the important thing is to test cases like reshape(x, 10, :).

Most importantly though, we have to decide whether we go with this or #1447

DhairyaLGandhi · 2021-01-07T11:14:43Z

Agreed on testing for the right cases but not on overloading internal functions. We can catch the failing case as a separate method instead

darsnack · 2021-01-07T14:05:40Z

I wouldn't hold on adding a very basic functionality just because the situation is a bit weird in Base.

I agree.

We can catch the failing case as a separate method instead

That's what I tried. Due to the situation in Base, we are forced to catch the failing case with Base._reshape. There's no way to catch all the cases with Base.reshape alone.

Most importantly though, we have to decide whether we go with this or #1447

If this is for the v0.12 milestone, then I suggest we go for this. Based on this comment, I think there is a slim set of models where the performance of this PR and #1447 can't be matched. We can always adopt #1447 if we need to in the future. Going back from a new primitive type will be harder.

#1447 and this PR are composable changes. I think all that's required to move to #1447 in the future is to replace the indices fields with the primitive OneHot.

CarloLucibello · 2021-01-08T00:28:24Z

@DhairyaLGandhi I asked a few times already, could you lift github restrictions (I get "The base branch restricts merging to authorized users. Learn more about protected branches.") and clarify if we have to use bors r+ or not now that we have buildkite?

@darsnack we have a failing onecold test on gpu.

To me the plan sounds good, the simplicity of this approach is very compelling, we can merge this and revisit later if performance issues are lamented on those corner cases. This unless @chengchingwen has strong objections against

test/cuda/cuda.jl

CarloLucibello · 2021-01-08T15:00:17Z

@DhairyaLGandhi I asked a few times already, could you lift github restrictions (I get "The base branch restricts merging to authorized users. Learn more about protected branches.") and clarify if we have to use bors r+ or not now that we have buildkite?

@DhairyaLGandhi bump

test/onehot.jl

DhairyaLGandhi · 2021-01-08T16:35:11Z

Thanks, I've added a thought around saving the vcat to be an error with a message pointing users to collect the onehotarray if it's no longer one hot

I'm inclined to go with this over #1447 since they address the same concerns but in a neater fashion.

We'll continue with bors for the time being.

darsnack · 2021-01-08T16:43:32Z

Okay the constructors should be backwards compatible and vcat will throw an error.

DhairyaLGandhi · 2021-01-08T19:16:52Z

bors r+

bors · 2021-01-08T19:37:19Z

Build succeeded:

buildkite/flux-dot-jl

simeonschaub · 2021-01-08T23:00:12Z

src/onehot.jl

+Base.reshape(x::OneHotArray{<:Any, L}, dims::Dims) where L =
+  (first(dims) == L) ? OneHotArray(reshape(x.indices, dims[2:end]...), L) :
+                       throw(ArgumentError("Cannot reshape OneHotArray if first(dims) != size(x, 1)"))
+Base._reshape(x::OneHotArray, dims::Tuple{Vararg{Int}}) = reshape(x, dims)


What was the reason for adding this? It seems overly restrictive to require that first(dims) == L, in fact, this broke some of my code.

The fallback worked fine for me before

The initial implementation converted to a Bool array for the else case. @CarloLucibello seems like we should add that back in?

I think Simeon is referring to before we merged this? Did we convert to a bool array then? I think it would be difficult to guarantee the return type of the function then. I agree the check seems pretty restrictive

Not sure what fallback @simeonschaub is referring to, but the original implementation of this PR did not throw an error. It converted to a Bool array.

Sorry, should have clarified: I meant falling back to the default defintion in Base for AbstractArray, which produces a ReshapedArray. I think if we overload reshape here, we shouldn't make it error in cases where the fallback would work, since that makes it hard to use reshape in generic code.

I thought that reshaping the first dimension was something never done in practice, but since we broke @simeonschaub's code maybe is not so rare. It can be handled by reshape(collect(oh), ...), but i would be fine to relax the dims check if people feel that need, although, as @DhairyaLGandhi said, this would make reshape type unstable

I'll submit a quick fix PR

Just to explain, in my use case I was adding a singleton dimension in front of the rest for the purpose of broadcasting, i.e. something like:

reshape(Flux.onehotbatch(Flux.onecold(ŷ, classes), classes), 1, 4, :) .=== reshape(outputs_onehot, 4, 1, :);

cossio · 2021-02-25T18:50:48Z

It would be nice to add docs for this. #1519

cossio · 2021-02-25T20:21:28Z

I think this PR hasn't been released yet. Why?

darsnack · 2021-02-25T20:37:18Z

Not 100% positive cause I’m still hazy on semver but I think the struct/constructor changes means it is breaking. So it will have to wait until v0.12. Even though the highest level APIs like onehotbatch are the same.

darsnack added 4 commits January 1, 2021 13:20

Initial arbitrary one hot implementation passing tests.

d4eb7d0

Need to benchmark on machine w/ GPU.

Added fast argmax for onecold

47ab622

Don't convert non-one-hot array argmax in onecold

a98727f

Make _fast_argmax even faster

4355ff1

darsnack changed the title ~~Darsnack/arbitrary one hot~~ Arbitrary dimension one-hot arrays Jan 1, 2021

DhairyaLGandhi reviewed Jan 2, 2021

View reviewed changes

src/onehot.jl Outdated Show resolved Hide resolved

src/onehot.jl Show resolved Hide resolved

src/onehot.jl Outdated Show resolved Hide resolved

src/onehot.jl Show resolved Hide resolved

CarloLucibello reviewed Jan 2, 2021

View reviewed changes

src/onehot.jl Outdated Show resolved Hide resolved

DhairyaLGandhi reviewed Jan 2, 2021

View reviewed changes

darsnack mentioned this pull request Jan 2, 2021

new onehot implementation #1447

Closed

4 tasks

CarloLucibello reviewed Jan 2, 2021

View reviewed changes

src/onehot.jl Outdated Show resolved Hide resolved

darsnack and others added 4 commits January 2, 2021 09:20

Fix typo in one hot cat

630e5f0

Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com>

Remove conversion to AbstractArray

36183f0

Break infinite recursion of reshape(::OneHotArray)

49befe0

Throw error on reshape(::OneHotArray) if first dim doesn't match

2bad6b9

DhairyaLGandhi reviewed Jan 5, 2021

View reviewed changes

src/onehot.jl Show resolved Hide resolved

src/onehot.jl Outdated Show resolved Hide resolved

src/onehot.jl Outdated Show resolved Hide resolved

Added tests and addressed comments

3b89d85

darsnack requested review from CarloLucibello and DhairyaLGandhi January 5, 2021 20:05

darsnack mentioned this pull request Jan 6, 2021

Unify reshape routing to make extension easier JuliaLang/julia#39123

Open

darsnack added 2 commits January 7, 2021 08:18

Fix for reshape(::OneHotArray, ::Tuple)

5a60580

Fix doctest errors

28ece66

CarloLucibello previously approved these changes Jan 8, 2021

View reviewed changes

Fix onehot cuda test

ef1ecb3

darsnack dismissed CarloLucibello’s stale review via ef1ecb3 January 8, 2021 13:42

darsnack requested a review from CarloLucibello January 8, 2021 14:35

CarloLucibello reviewed Jan 8, 2021

View reviewed changes

test/cuda/cuda.jl Show resolved Hide resolved

CarloLucibello previously approved these changes Jan 8, 2021

View reviewed changes

DhairyaLGandhi reviewed Jan 8, 2021

View reviewed changes

test/onehot.jl Outdated Show resolved Hide resolved

Make constructors backwards compatible and throw error on vcat.

aa63a5a

darsnack dismissed CarloLucibello’s stale review via aa63a5a January 8, 2021 16:42

DhairyaLGandhi approved these changes Jan 8, 2021

View reviewed changes

bors bot merged commit ebd37d6 into FluxML:master Jan 8, 2021

darsnack deleted the darsnack/arbitrary-one-hot branch January 8, 2021 19:43

simeonschaub reviewed Jan 8, 2021

View reviewed changes

simeonschaub mentioned this pull request Jan 9, 2021

Use fallback for reshape/cat OneHotArray #1459

Merged

3 tasks

darsnack mentioned this pull request Feb 7, 2021

multiplication of {Transpose, Adjoint} of Array and OneHotVector #1424

Merged

ToucheSir mentioned this pull request Feb 12, 2021

Some memory improvements to OneHotMatrix #578

Closed

darsnack mentioned this pull request Feb 12, 2021

Typical accuracy function using onecold with a OneHotMatrix fails to compile on GPU #582

Closed

cossio mentioned this pull request Feb 25, 2021

Document OneHotArray #1519

Open

ToucheSir mentioned this pull request Jun 20, 2021

Usage of OneHotMatrix for input to neural network is very slow. #1355

Closed

darsnack mentioned this pull request Jul 22, 2023

DRAFT: Redesign underlying storage FluxML/OneHotArrays.jl#39

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arbitrary dimension one-hot arrays #1448

Arbitrary dimension one-hot arrays #1448

darsnack commented Jan 1, 2021

DhairyaLGandhi left a comment

DhairyaLGandhi Jan 2, 2021

darsnack Jan 2, 2021

DhairyaLGandhi Jan 5, 2021

darsnack commented Jan 5, 2021

CarloLucibello commented Jan 6, 2021

DhairyaLGandhi commented Jan 6, 2021

darsnack commented Jan 6, 2021 •

edited

Loading

darsnack commented Jan 6, 2021

CarloLucibello commented Jan 7, 2021

DhairyaLGandhi commented Jan 7, 2021

darsnack commented Jan 7, 2021

CarloLucibello commented Jan 8, 2021

CarloLucibello commented Jan 8, 2021

DhairyaLGandhi commented Jan 8, 2021

darsnack commented Jan 8, 2021

DhairyaLGandhi commented Jan 8, 2021

bors bot commented Jan 8, 2021

simeonschaub Jan 8, 2021

simeonschaub Jan 8, 2021

darsnack Jan 9, 2021

DhairyaLGandhi Jan 9, 2021

darsnack Jan 9, 2021

simeonschaub Jan 9, 2021

CarloLucibello Jan 9, 2021

darsnack Jan 9, 2021

simeonschaub Jan 9, 2021 •

edited

Loading

cossio commented Feb 25, 2021

cossio commented Feb 25, 2021

darsnack commented Feb 25, 2021

Arbitrary dimension one-hot arrays #1448

Arbitrary dimension one-hot arrays #1448

Conversation

darsnack commented Jan 1, 2021

Performance

Summary

PR Checklist

DhairyaLGandhi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

darsnack commented Jan 5, 2021

CarloLucibello commented Jan 6, 2021

DhairyaLGandhi commented Jan 6, 2021

darsnack commented Jan 6, 2021 • edited Loading

darsnack commented Jan 6, 2021

CarloLucibello commented Jan 7, 2021

DhairyaLGandhi commented Jan 7, 2021

darsnack commented Jan 7, 2021

CarloLucibello commented Jan 8, 2021

CarloLucibello commented Jan 8, 2021

DhairyaLGandhi commented Jan 8, 2021

darsnack commented Jan 8, 2021

DhairyaLGandhi commented Jan 8, 2021

bors bot commented Jan 8, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

simeonschaub Jan 9, 2021 • edited Loading

Choose a reason for hiding this comment

cossio commented Feb 25, 2021

cossio commented Feb 25, 2021

darsnack commented Feb 25, 2021

darsnack commented Jan 6, 2021 •

edited

Loading

simeonschaub Jan 9, 2021 •

edited

Loading