Support default null value in data preprocessing job#7739
Conversation
Codecov Report
@@ Coverage Diff @@
## master #7739 +/- ##
============================================
+ Coverage 71.49% 71.65% +0.15%
+ Complexity 4064 4061 -3
============================================
Files 1577 1577
Lines 80554 80595 +41
Branches 11965 11978 +13
============================================
+ Hits 57592 57747 +155
+ Misses 19078 18962 -116
- Partials 3884 3886 +2
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
|
As we discussed, I don't think that we should |
87f83bf to
08fbc20
Compare
|
@snleee updated the PR based on the discussion. |
08fbc20 to
5c8f4f3
Compare
...ot-hadoop/src/main/java/org/apache/pinot/hadoop/job/mappers/AvroDataPreprocessingMapper.java
Show resolved
Hide resolved
Co-authored-by: Jack Li(Analytics Engineering) <jlli@jlli-mn1.linkedin.biz>
Description
This PR supports default null value for data preprocessing job.
If the value of partitioning column is null, then use the default null value to distribute the data to all the reducers.
If the value of sorting column is null, then use the default null value for sorting within each reducer.
Upgrade Notes
Does this PR prevent a zero down-time upgrade? (Assume upgrade order: Controller, Broker, Server, Minion)
backward-incompat, and complete the section below on Release Notes)Does this PR fix a zero-downtime upgrade introduced earlier?
backward-incompat, and complete the section below on Release Notes)Does this PR otherwise need attention when creating release notes? Things to consider:
release-notesand complete the section on Release Notes)Release Notes
Documentation