Add scatter operations #255

yuehhua · 2020-12-26T15:22:09Z

I have generalized scatter operations to every dimensions, instead of excluding the first dimension.
Related to yuehhua/ScatterNNlib.jl#32

mcabbott · 2020-12-26T18:35:03Z

Could these easily be combined to something like scatter!(+, dst, indices, src)?

yuehhua · 2020-12-27T03:19:23Z

@mcabbott Sure! I have API like scatter!(:add, dst, indices, src) in my ScatterNNlib.jl. Should I move it here as well?

CarloLucibello · 2020-12-27T06:59:31Z

I have API like scatter!(:add, dst, indices, src) in my ScatterNNlib.jl. Should I move it here as well?

I think he is suggesting to directly pass the operator (not a symbol), the same way you would use it in

julia> reduce(+, ones(10))
10.0

We could have

julia> function scatter!(op, ys::Array{T}, us::Array{T}, xs::Array{<:IntOrTuple}) where {T<:Real}
           @simd for k = 1:length(xs)
               k = CartesianIndices(xs)[k]
               ys_v = view(ys, xs[k]...)
               us_v = view(us, k)
               @inbounds ys_v .= (op).(ys_v, us_v)
           end
           ys
       end

and a few specialized methods:

function scatter!(op::typeof(mean), ys::Array{T}, us::Array{T}, xs::Array{<:IntOrTuple}) where {T<:Real}
...

src/utils.jl

src/scatter.jl

src/utils.jl

src/scatter.jl

src/gather.jl

src/scatter.jl

src/gather.jl

chengchingwen · 2020-12-29T14:41:39Z

Maybe we should specified how our scatter/gather different from TF or Torch?

src/gather.jl

yuehhua · 2020-12-30T03:44:54Z

I discussed with @chengchingwen yesterday. We hope to introduce some new features to generalize scatter/gather operations. New features are listed in the following:

Add dims argument for specifying which dimension to start scattering and the dimensions will be contiguous block, like TensorFlow does.
The operational unit of scatter/gather could be a scalar or (at least) a vector. Both of them will be provided. The scalar unit will be designed by assigning dims=0, while the vector unit will be assigning with integer dims=1.
Following above, dims=1 will be a default option.

Scalar version: dst[i, j, k] = src[idx[i, j, k]...]
Array version:
- dims=1: dst[i, j, k] = src[:, idx[i, j, k]...]
- dims=2: dst[i, j, k] = src[:, :, idx[i, j, k]...]
- dims=3: dst[i, j, k] = src[:, :, :, idx[i, j, k]...]
- ...

src/scatter.jl

src/gather.jl

src/utils.jl

src/gather.jl

src/scatter.jl

Project.toml

src/scatter.jl

src/utils.jl

CarloLucibello · 2021-03-03T08:02:08Z

src/scatter.jl

+    dims = Nsrc - Nidx
+    dstsize = (size(src)[1:dims]..., maximum_dims(idx)...)
+    dst = similar(src, T, dstsize)
+    fill!(dst, typemax(T))


I wonder if typemax is the right thing to do here. The problem is if there are positions in dst which receive no contributions for src they will end up holding typemax, which doesn't seem meaningful. Maybe we should error out in such cases, but doing this check may have a performance impact

Yeah, I thought this issue before. Checking the position of dst is properly covered by idx is the way to avoid holding typemax. But still, it is necessary to check values in src is smaller than the value we assigned, either typemax or similar. similar gives the value existing in bare memory, so we have no idea knowing if the values are smaller enough.

What if we give the maximum of src? Thus, the value is at least smaller or equals to the maximum of src.

I think maximum(src) would be more surprising, in un-visited entries. typemax seems OK to me.

yuehhua · 2021-03-04T16:37:17Z

If it is ready to go, just let it go.

CarloLucibello · 2021-03-05T05:32:01Z

ok, I'll merge this so that work can proceed, I'll open a pr to revisit the docstrings

1516: add Embedding layer r=CarloLucibello a=CarloLucibello Basic implementation. Maybe could be improved when FluxML/NNlib.jl#255 lands ### PR Checklist - [x] Tests are added - [x] Entry in NEWS.md - [x] Documentation, if applicable - [ ] Final review from `@dhairyagandhi96` (for API changes). Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com>

1516: add Embedding layer r=DhairyaLGandhi a=CarloLucibello Basic implementation. Maybe could be improved when FluxML/NNlib.jl#255 lands ### PR Checklist - [x] Tests are added - [x] Entry in NEWS.md - [x] Documentation, if applicable - [ ] Final review from `@dhairyagandhi96` (for API changes). Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com> Co-authored-by: Carlo Lucibello <carlo.lucibello@unibocconi.it>

CarloLucibello reviewed Dec 27, 2020

View reviewed changes

src/utils.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Dec 27, 2020

View reviewed changes

src/utils.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Dec 27, 2020

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Dec 27, 2020

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Dec 27, 2020

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Dec 27, 2020

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Dec 27, 2020

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Dec 27, 2020

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Dec 27, 2020

View reviewed changes

src/utils.jl Outdated Show resolved Hide resolved

mcabbott reviewed Dec 27, 2020

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

yuehhua force-pushed the scatter branch 2 times, most recently from ab03446 to be4fa94 Compare December 28, 2020 12:29

chengchingwen reviewed Dec 29, 2020

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

chengchingwen reviewed Dec 29, 2020

View reviewed changes

src/gather.jl Outdated Show resolved Hide resolved

src/scatter.jl Outdated Show resolved Hide resolved

chengchingwen reviewed Dec 29, 2020

View reviewed changes

src/gather.jl Outdated Show resolved Hide resolved

chengchingwen reviewed Dec 29, 2020

View reviewed changes

src/gather.jl Outdated Show resolved Hide resolved

This was referenced Dec 31, 2020

new onehot implementation FluxML/Flux.jl#1447

Closed

Issues about OneHotVector/OneHotMatrix FluxML/Flux.jl#1445

Closed

chengchingwen reviewed Jan 2, 2021

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

chengchingwen reviewed Jan 2, 2021

View reviewed changes

src/gather.jl Outdated Show resolved Hide resolved

chengchingwen reviewed Jan 2, 2021

View reviewed changes

src/utils.jl Outdated Show resolved Hide resolved

mcabbott reviewed Jan 2, 2021

View reviewed changes

src/gather.jl Outdated Show resolved Hide resolved

mcabbott reviewed Jan 2, 2021

View reviewed changes

src/gather.jl Outdated Show resolved Hide resolved

mcabbott reviewed Jan 2, 2021

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Jan 3, 2021

View reviewed changes

Project.toml Outdated Show resolved Hide resolved

CarloLucibello reviewed Mar 1, 2021

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Mar 1, 2021

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Mar 1, 2021

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Mar 1, 2021

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

yuehhua added 3 commits March 1, 2021 14:27

remove restriction of numerical types

010af70

remove type promotion

66ded23

remove inbounds and simd annotations

dcc6710

CarloLucibello reviewed Mar 1, 2021

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

yuehhua added 2 commits March 1, 2021 15:15

update error message

8faca3d

remove @BoundsCheck

e7913b0

CarloLucibello reviewed Mar 1, 2021

View reviewed changes

src/scatter.jl Show resolved Hide resolved

CarloLucibello reviewed Mar 1, 2021

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

fix

e4f0c17

CarloLucibello reviewed Mar 1, 2021

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

yuehhua added 3 commits March 1, 2021 20:22

replace zeros and ones with more generic way

ec402d9

remove bound checks

c6213c6

optimize

4d5cbe8

CarloLucibello reviewed Mar 2, 2021

View reviewed changes

src/scatter.jl Outdated Show resolved Hide resolved

src/scatter.jl Outdated Show resolved Hide resolved

src/scatter.jl Outdated Show resolved Hide resolved

update docs

fc7360d

CarloLucibello reviewed Mar 3, 2021

View reviewed changes

src/utils.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Mar 3, 2021

View reviewed changes

remove not used utilities

6542ea9

CarloLucibello merged commit b59cf53 into FluxML:master Mar 5, 2021

DhairyaLGandhi mentioned this pull request Apr 8, 2021

scatter and gather support element type of idx to be CartesianIndex #308

Merged

yuehhua mentioned this pull request May 28, 2021

merge into NNlib and CUDA? yuehhua/ScatterNNlib.jl#32

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add scatter operations #255

Add scatter operations #255

yuehhua commented Dec 26, 2020

mcabbott commented Dec 26, 2020

yuehhua commented Dec 27, 2020

CarloLucibello commented Dec 27, 2020 •

edited

Loading

chengchingwen commented Dec 29, 2020

yuehhua commented Dec 30, 2020 •

edited

Loading

CarloLucibello Mar 3, 2021

yuehhua Mar 3, 2021

yuehhua Mar 3, 2021

mcabbott Mar 3, 2021

yuehhua commented Mar 4, 2021

CarloLucibello commented Mar 5, 2021

Add scatter operations #255

Add scatter operations #255

Conversation

yuehhua commented Dec 26, 2020

mcabbott commented Dec 26, 2020

yuehhua commented Dec 27, 2020

CarloLucibello commented Dec 27, 2020 • edited Loading

chengchingwen commented Dec 29, 2020

yuehhua commented Dec 30, 2020 • edited Loading

CarloLucibello Mar 3, 2021

Choose a reason for hiding this comment

yuehhua Mar 3, 2021

Choose a reason for hiding this comment

yuehhua Mar 3, 2021

Choose a reason for hiding this comment

mcabbott Mar 3, 2021

Choose a reason for hiding this comment

yuehhua commented Mar 4, 2021

CarloLucibello commented Mar 5, 2021

CarloLucibello commented Dec 27, 2020 •

edited

Loading

yuehhua commented Dec 30, 2020 •

edited

Loading