Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extract Dremel encoding code from Parquet #11461

Merged
merged 70 commits into from
Aug 4, 2022

Conversation

vyasr
Copy link
Contributor

@vyasr vyasr commented Aug 4, 2022

Description

This PR removes the Dremel encoding logic from Parquet-specific files and places it into a separate set of files for consumption by non-Parquet code. This PR also includes a minor rename of utilities/column.hpp->utilities/linked_column.hpp to more accurately reflect the contents of that file.

These changes were split out from #11129 to minimize future conflicts with Parquet development (which is very active at present) and to allow further refactoring and other improvements on this Dremel code to proceed independently of the list lexicographic comparator.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

Many iterations already happened. I just realized late that I should commit
early return and remove unnecessary statements
@vyasr vyasr added 3 - Ready for Review Ready for review by team libcudf Affects libcudf (C++/CUDA) code. improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Aug 4, 2022
@vyasr vyasr requested a review from a team as a code owner August 4, 2022 00:26
@vyasr vyasr added this to PR-WIP in v22.10 Release via automation Aug 4, 2022
@vyasr vyasr self-assigned this Aug 4, 2022
@vyasr vyasr requested review from a team as code owners August 4, 2022 00:26
@github-actions github-actions bot added CMake CMake build issue conda labels Aug 4, 2022
Copy link
Contributor

@bdice bdice left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approving this as it is a move and slight refactor to namespaces. I have already reviewed this code in #11129 and identified some areas for improvement. We need to keep momentum while avoiding conflicts between the lexicographic comparator work and Parquet work.

@codecov
Copy link

codecov bot commented Aug 4, 2022

Codecov Report

❗ No coverage uploaded for pull request base (branch-22.10@d86bb39). Click here to learn what that means.
The diff coverage is n/a.

@@               Coverage Diff               @@
##             branch-22.10   #11461   +/-   ##
===============================================
  Coverage                ?   86.47%           
===============================================
  Files                   ?      144           
  Lines                   ?    22856           
  Branches                ?        0           
===============================================
  Hits                    ?    19765           
  Misses                  ?     3091           
  Partials                ?        0           

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

v22.10 Release automation moved this from PR-WIP to PR-Reviewer approved Aug 4, 2022
@vyasr
Copy link
Contributor Author

vyasr commented Aug 4, 2022

@gpucibot merge

@rapids-bot rapids-bot bot merged commit d8c25a1 into rapidsai:branch-22.10 Aug 4, 2022
v22.10 Release automation moved this from PR-Reviewer approved to Done Aug 4, 2022
@vyasr vyasr deleted the refactor/dremel branch August 4, 2022 20:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3 - Ready for Review Ready for review by team CMake CMake build issue improvement Improvement / enhancement to an existing function libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change
Projects
No open projects
Development

Successfully merging this pull request may close these issues.

None yet

5 participants