Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[gpuCI] Auto-merge branch-0.17 to branch-0.18 [skip ci] #6866

Merged
merged 29 commits into from
Dec 1, 2020

Commits on Nov 5, 2020

  1. Configuration menu
    Copy the full SHA
    8015692 View commit details
    Browse the repository at this point in the history
  2. updated changelog

    razajafri committed Nov 5, 2020
    Configuration menu
    Copy the full SHA
    a98eac8 View commit details
    Browse the repository at this point in the history

Commits on Nov 24, 2020

  1. fix uint32_t undefined errors

    rongou committed Nov 24, 2020
    Configuration menu
    Copy the full SHA
    6d9b139 View commit details
    Browse the repository at this point in the history
  2. add to changelog

    rongou committed Nov 24, 2020
    Configuration menu
    Copy the full SHA
    490f01a View commit details
    Browse the repository at this point in the history

Commits on Nov 25, 2020

  1. Configuration menu
    Copy the full SHA
    e9aedb2 View commit details
    Browse the repository at this point in the history
  2. Split out cudf::distinct_count from drop_duplicates.cu (#6822)

    One of our longest compile times is for drop_duplicates.cu measured currently at about 15 minutes. This file contains two cudf APIs: cudf::drop_duplicates and cudf::distinct_count. They share no code so it is reasonable to move the distinct_count to its own source file. Both files individually have a compile time around 7 minutes.
    davidwendt committed Nov 25, 2020
    Configuration menu
    Copy the full SHA
    e1e3047 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    4eff46f View commit details
    Browse the repository at this point in the history
  4. updated changelog

    razajafri committed Nov 25, 2020
    Configuration menu
    Copy the full SHA
    2bb8480 View commit details
    Browse the repository at this point in the history
  5. addressed review comments

    razajafri committed Nov 25, 2020
    Configuration menu
    Copy the full SHA
    6e276be View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    38dc99d View commit details
    Browse the repository at this point in the history
  7. reverted CMake changes

    razajafri committed Nov 25, 2020
    Configuration menu
    Copy the full SHA
    d91ddaf View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    6f198b4 View commit details
    Browse the repository at this point in the history
  9. updated changelog

    razajafri committed Nov 25, 2020
    Configuration menu
    Copy the full SHA
    5d45e03 View commit details
    Browse the repository at this point in the history
  10. Merge pull request #6848 from razajafri/parquet_writer_int96

    [REVIEW] Parquet writer int96 [skip ci]
    razajafri committed Nov 25, 2020
    Configuration menu
    Copy the full SHA
    250e405 View commit details
    Browse the repository at this point in the history

Commits on Nov 26, 2020

  1. Avoid gather when copying strings view from start of strings column (#…

    …6837)
    
    Fixes #6803
    
    This optimizes string slice copying in the case where the string column view starts at offset 0. In that case the offset values do not need to be modified, and all the column buffers can be copied.
    jlowe committed Nov 26, 2020
    Configuration menu
    Copy the full SHA
    f3b0e06 View commit details
    Browse the repository at this point in the history
  2. Add support for scatter() on lists-of-struct columns (#6817)

    Addendum to #6768, to support scatter on columns of type list<struct>. The hard part was the tests.
    mythrocks committed Nov 26, 2020
    Configuration menu
    Copy the full SHA
    c34d9bf View commit details
    Browse the repository at this point in the history

Commits on Nov 27, 2020

  1. Configuration menu
    Copy the full SHA
    a6331bf View commit details
    Browse the repository at this point in the history
  2. update Changelog

    GaryShen2008 committed Nov 27, 2020
    Configuration menu
    Copy the full SHA
    ff4f6f0 View commit details
    Browse the repository at this point in the history

Commits on Nov 28, 2020

  1. reduce HtoD copies in cudf::concatenate #6605

    Co-authored-by: Karthikeyan Natarajan <karthikeyann@users.noreply.github.com>
    Co-authored-by: Mark Harris <mharris@nvidia.com>
    
    closes #6465
    
    - Add utility `cudf::detail::align_ptr_for_type`
    - Add contiguous_copy_column_device_views
    - reduces multiple HtoD copies in cudf::concatenate by adding create_contiguous_device_views for list of column_views
    karthikeyann committed Nov 28, 2020
    Configuration menu
    Copy the full SHA
    0b58244 View commit details
    Browse the repository at this point in the history
  2. Merge pull request #6854 from GaryShen2008/fix-writeParquetBufferBegin

    Correct the param order of writeParquetBufferBegin[skip ci]
    razajafri committed Nov 28, 2020
    Configuration menu
    Copy the full SHA
    45bd967 View commit details
    Browse the repository at this point in the history

Commits on Nov 29, 2020

  1. Configuration menu
    Copy the full SHA
    f0f53c7 View commit details
    Browse the repository at this point in the history
  2. Replace cuio macros with constexpr and inline functions (#6782)

    Fixes #521 - Replaces macros with constexpr variables and inline functions wherever applicable.
    kaatish committed Nov 29, 2020
    Configuration menu
    Copy the full SHA
    1771a8f View commit details
    Browse the repository at this point in the history

Commits on Nov 30, 2020

  1. Configuration menu
    Copy the full SHA
    0492519 View commit details
    Browse the repository at this point in the history
  2. Move template param to member var to improve compile of hash/groupby.…

    …cu (#6835)
    
    This change reduces the compile time for hash/groupby.cu from 16 minutes to 9 minutes.
    The `skip_rows_with_nulls` template parameter is set to avoid calling (and inlining) `cudf::bit_is_set()`. This function is minimal compared to the `cudf::detail::aggregate_row` function that must be inlined twice to accommodate this template parameter. Simply changing this to a member variable means we still do not incur an extra call to `cudf::bit_is_set()` when appropriate but also means we generate half as much device code for this specific function. The `cudf::detail::aggregate_row` code is quite significant.
    davidwendt committed Nov 30, 2020
    Configuration menu
    Copy the full SHA
    76799a1 View commit details
    Browse the repository at this point in the history
  3. Fix contiguous split of null string columns (#6853)

    This PR attempts to address issue #6842, which may be caused by multiple reasons. For now, one certain problem is contiguous splitting on null (fully invalid) string columns, which only contains single child column (offset column). This PR is about to fix this problem.
    sperlingxx committed Nov 30, 2020
    Configuration menu
    Copy the full SHA
    f3c9322 View commit details
    Browse the repository at this point in the history
  4. Merge pull request #6844 from rongou/fix-cstdint

    [REVIEW] fix uint32_t undefined errors
    rongou committed Nov 30, 2020
    Configuration menu
    Copy the full SHA
    b8e1ca6 View commit details
    Browse the repository at this point in the history

Commits on Dec 1, 2020

  1. Move align_ptr_for_type() from cuda.cuh to alignment.hpp(#6859)

    Closes #6857 
    
    This function contains only C++ STL code and no CUDA code. Also, including it in the `column_device_view.cuh` required a new dependency on `cub/cub.cuh` (per the `cuda.cuh`) as found when building blazingSQL.
    
    Authors:
      - davidwendt <dwendt@nvidia.com>
    
    Approvers:
      - Ram (Ramakrishna Prabhu)
      - Mark Harris
      - Ray Douglass
    
    URL: #6859
    davidwendt committed Dec 1, 2020
    Configuration menu
    Copy the full SHA
    cdc53b7 View commit details
    Browse the repository at this point in the history
  2. Fix compile error in type_dispatch_benchmark.cu (#6861)

    Removed unused stream parameter from type-dispatcher benchmark source
    davidwendt committed Dec 1, 2020
    Configuration menu
    Copy the full SHA
    83d2146 View commit details
    Browse the repository at this point in the history
  3. Add dictionary support to cudf::reduce(#6666)

    Reference #5963 
    
    This PR adds dictionary column type support to the set of `cudf::reduce` functions.
    This PR depends on utilities added in PR #6651 
    
    Here are the reduce operations that will be included in this PR.
    - [x] all
    - [x] any
    - [x] max
    - [x] mean
    - [x] median
    - [x] min
    - [x] nth_element
    - [x] product
    - [x] quantile
    - [x] std
    - [x] sum_of_squares
    - [x] sum
    - [x] unique_count
    - [x] var
    
    Authors:
      - davidwendt <dwendt@nvidia.com>
    
    Approvers:
      - Mike Wendt
      - AJ Schmidt
      - Ram (Ramakrishna Prabhu)
      - Karthikeyan
    
    URL: #6666
    davidwendt committed Dec 1, 2020
    Configuration menu
    Copy the full SHA
    1c81827 View commit details
    Browse the repository at this point in the history