Implement `cudf::reduce` for `decimal32` and `decimal64` (part 1) #6814

codereport · 2020-11-20T00:52:52Z

This PR resolves a part of #3556.

This is part 1 of 2. The PR implements MIN, MAX, SUM & PRODUCT & NUNIQUE.

Reduction Ops:

  enum Kind {
    SUM,             ///< sum reduction
    PRODUCT,         ///< product reduction
    MIN,             ///< min reduction
    MAX,             ///< max reduction
    COUNT_VALID,     ///< count number of valid elements
    COUNT_ALL,       ///< count number of elements
    ANY,             ///< any reduction
    ALL,             ///< all reduction
    SUM_OF_SQUARES,  ///< sum of squares reduction
    MEAN,            ///< arithmetic mean reduction
    VARIANCE,        ///< groupwise variance
    STD,             ///< groupwise standard deviation
    MEDIAN,          ///< median reduction
    QUANTILE,        ///< compute specified quantile(s)
    ARGMAX,          ///< Index of max element
    ARGMIN,          ///< Index of min element
    NUNIQUE,         ///< count number of unique elements
    NTH_ELEMENT,     ///< get the nth element
    ROW_NUMBER,      ///< get row-number of element
    COLLECT,         ///< collect values into a list
    LEAD,            ///< window function, accesses row at specified offset following current row
    LAG,             ///< window function, accesses row at specified offset preceding current row
    PTX,             ///< PTX UDF based reduction
    CUDA             ///< CUDA UDf based reduction
  };

To Do List:

Operations that "fell out":

NUNIQUE
- Implementation
- Basic unit tests

GPUtester · 2020-11-20T00:53:24Z

Please update the changelog in order to start CI tests.

View the gpuCI docs here.

codecov · 2020-11-20T07:20:13Z

Codecov Report

Merging #6814 (8c994f9) into branch-0.18 (917759b) will increase coverage by 0.43%.
The diff coverage is n/a.

@@               Coverage Diff               @@
##           branch-0.18    #6814      +/-   ##
===============================================
+ Coverage        81.57%   82.01%   +0.43%     
===============================================
  Files               96       96              
  Lines            15912    16267     +355     
===============================================
+ Hits             12980    13341     +361     
+ Misses            2932     2926       -6

Impacted Files	Coverage Δ
python/cudf/cudf/io/feather.py	`100.00% <0.00%> (ø)`
python/cudf/cudf/comm/serialize.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/_fuzz_testing/io.py	`0.00% <0.00%> (ø)`
python/dask_cudf/dask_cudf/_version.py	`0.00% <0.00%> (ø)`
python/dask_cudf/dask_cudf/io/tests/test_csv.py	`100.00% <0.00%> (ø)`
python/dask_cudf/dask_cudf/io/tests/test_orc.py	`100.00% <0.00%> (ø)`
python/dask_cudf/dask_cudf/io/tests/test_json.py	`100.00% <0.00%> (ø)`
...ython/dask_cudf/dask_cudf/io/tests/test_parquet.py	`100.00% <0.00%> (ø)`
python/cudf/cudf/utils/applyutils.py	`98.74% <0.00%> (+0.02%)`	⬆️
python/cudf/cudf/core/join/join.py	`92.44% <0.00%> (+0.03%)`	⬆️
... and 35 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 917759b...8c994f9. Read the comment docs.

…p-reduce2

cpp/src/reductions/reductions.cpp

rgsl888prabhu

LGTM, small questions and changes.

cpp/src/reductions/simple.cuh

cpp/include/cudf/detail/reduction.cuh

cpp/include/cudf/scalar/scalar.hpp

This PR resolves a part of #3556. Supporting `cudf::reduce`: 1. Part 1 (`MIN`, `MAX`, `SUM` & `PRODUCT` & `NUNIQUE`) #6814 2. Part 2 (the rest) ◀️ **Reduction Ops:** **Done in Previous PR** ✔️ `SUM, ///< sum reduction` ✔️ `PRODUCT, ///< product reduction` ✔️ `MIN, ///< min reduction` ✔️ `MAX, ///< max reduction` ✔️ `NUNIQUE, ///< count number of unique elements` **Not supported by `cudf::reduce`:** * [x] `COUNT_VALID, ///< count number of valid elements` * [x] `COUNT_ALL, ///< count number of elements` * [x] `COLLECT, ///< collect values into a list` * [x] `LEAD, ///< window function, accesses row at specified offset following current row` * [x] `LAG, ///< window function, accesses row at specified offset preceding current row` * [x] `PTX, ///< PTX UDF based reduction` * [x] `CUDA ///< CUDA UDf based reduction` * [x] `ARGMAX, ///< Index of max element` * [x] `ARGMIN, ///< Index of min element` * [x] `ROW_NUMBER, ///< get row-number of element` **Won't be supported:** * [x] `ANY, ///< any reduction` * [x] `ALL, ///< all reduction` **To Do / Investigate:** * [x] `SUM_OF_SQUARES, ///< sum of squares reduction` * [x] `MEDIAN, ///< median reduction` * [x] `QUANTILE, ///< compute specified quantile(s)` * [x] `NTH_ELEMENT, ///< get the nth element` **Deferred until requested** * [x] `MEAN, ///< arithmetic mean reduction` * [x] `VARIANCE, ///< groupwise variance` * [x] `STD, ///< groupwise standard deviation` Authors: - Conor Hoekstra <codereport@outlook.com> Approvers: - null - Karthikeyan - David URL: #6980

codereport added 2 - In Progress Currently a work in progress libcudf Affects libcudf (C++/CUDA) code. labels Nov 20, 2020

codereport requested a review from a team as a code owner November 20, 2020 00:52

codereport added this to PR-WIP in v0.17 Release via automation Nov 20, 2020

codereport self-assigned this Nov 20, 2020

codereport requested review from trxcllnt, cwharris and davidwendt and removed request for cwharris November 20, 2020 00:52

codereport mentioned this pull request Nov 20, 2020

[FEA] Fixed-point Decimal type support #3556

Closed

45 tasks

codereport added 2 commits November 19, 2020 19:56

Mising initial (hacky) changes

560de99

Update CHANGELOG

1e7c9c1

codereport force-pushed the fp-reduce branch from 36ebd73 to 1e7c9c1 Compare November 20, 2020 00:57

codereport added 2 commits November 19, 2020 19:59

Merge branch 'branch-0.17' into fp-reduce

dc9df83

CI fix

ad21092

codereport added 3 commits November 21, 2020 16:32

Merge branch 'branch-0.17' into fp-reduce

99e04d7

New changes

2e414c6

Merge branch 'fp-reduce' of https://github.com/codereport/cudf into f…

442c8e1

…p-reduce2

codereport force-pushed the fp-reduce branch 2 times, most recently from 2b48661 to 442c8e1 Compare November 26, 2020 03:29

karthikeyann reviewed Nov 28, 2020

View reviewed changes

cpp/src/reductions/reductions.cpp Outdated Show resolved Hide resolved

codereport added 4 commits December 1, 2020 18:10

Clean up & enhance unit tests

5952e92

Revert reductions.cpp changes

1cab5ef

Reverting some changes

0fb2e51

Merge branch 'branch-0.17' into fp-reduce

93ee8ac

codereport added the non-breaking Non-breaking change label Dec 2, 2020

codereport added 2 commits December 1, 2020 21:34

Add back removed comment

c9f2c7c

More unit tests

0f2413b

codereport added 2 commits December 7, 2020 10:35

Clean up

c14b6ce

Clean up

cc1af7e

codereport changed the title ~~[WIP] Implement cudf::reduce for decimal32 and decimal64~~ Implement cudf::reduce for decimal32 and decimal64 (part 1) Dec 7, 2020

codereport added 3 - Ready for Review Ready for review by team 4 - Needs Review Waiting for reviewer to review or respond and removed 2 - In Progress Currently a work in progress labels Dec 7, 2020

rgsl888prabhu moved this from PR-WIP to PR-Needs review in v0.18 Release Dec 7, 2020

v0.18 Release automation moved this from PR-Needs review to PR-Reviewer approved Dec 7, 2020

rgsl888prabhu approved these changes Dec 7, 2020

View reviewed changes

cpp/src/reductions/simple.cuh Outdated Show resolved Hide resolved

cpp/src/reductions/simple.cuh Outdated Show resolved Hide resolved

karthikeyann requested changes Dec 7, 2020

View reviewed changes

cpp/include/cudf/detail/reduction.cuh Outdated Show resolved Hide resolved

v0.18 Release automation moved this from PR-Reviewer approved to PR-Needs review Dec 7, 2020

codereport added 2 commits December 7, 2020 15:01

Clean up / addressing PR comments

a3d371e

Remove mr parameter for string_view

a558f45

codereport requested a review from karthikeyann December 7, 2020 20:03

davidwendt requested changes Dec 7, 2020

View reviewed changes

cpp/include/cudf/scalar/scalar.hpp Outdated Show resolved Hide resolved

Remove unnecessary scale member

53d91d0

codereport requested a review from davidwendt December 8, 2020 05:58

Merge branch 'branch-0.18' into fp-reduce

8c994f9

davidwendt approved these changes Dec 8, 2020

View reviewed changes

codereport added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team 4 - Needs Review Waiting for reviewer to review or respond labels Dec 8, 2020

v0.18 Release automation moved this from PR-Needs review to PR-Reviewer approved Dec 8, 2020

karthikeyann approved these changes Dec 8, 2020

View reviewed changes

codereport merged commit 9120992 into rapidsai:branch-0.18 Dec 8, 2020

v0.18 Release automation moved this from PR-Reviewer approved to Done Dec 8, 2020

This was referenced Dec 9, 2020

fixed_point_value double-shifts in fixed_point construction #6950

Merged

Implement cudf::reduce for decimal32 and decimal64 (part 2) #6966

Closed

Implement cudf::reduce for decimal32 and decimal64 (part 2) #6980

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `cudf::reduce` for `decimal32` and `decimal64` (part 1) #6814

Implement `cudf::reduce` for `decimal32` and `decimal64` (part 1) #6814

codereport commented Nov 20, 2020 •

edited

Loading

GPUtester commented Nov 20, 2020

codecov bot commented Nov 20, 2020 •

edited

Loading

rgsl888prabhu left a comment

Implement cudf::reduce for decimal32 and decimal64 (part 1) #6814

Implement cudf::reduce for decimal32 and decimal64 (part 1) #6814

Conversation

codereport commented Nov 20, 2020 • edited Loading

Operations that "fell out":

GPUtester commented Nov 20, 2020

codecov bot commented Nov 20, 2020 • edited Loading

Codecov Report

rgsl888prabhu left a comment

Choose a reason for hiding this comment

Implement `cudf::reduce` for `decimal32` and `decimal64` (part 1) #6814

Implement `cudf::reduce` for `decimal32` and `decimal64` (part 1) #6814

codereport commented Nov 20, 2020 •

edited

Loading

codecov bot commented Nov 20, 2020 •

edited

Loading