-
Notifications
You must be signed in to change notification settings - Fork 704
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CARBONDATA-2136] Fixed bug related to data load for bad_record_action as REDIRECT or IGNORE and sort scope as NO_SORT #1942
Conversation
Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/2320/ |
Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3557/ |
06de5ea
to
48117f7
Compare
Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3558/ |
Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/2321/ |
48117f7
to
413a48a
Compare
Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3563/ |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/2326/ |
SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3399/ |
SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3400/ |
SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3406/ |
@geetikagupta16 pls update the PR template |
@sraghunandan I have updated the PR template please check. |
while (rowBatch.hasNext()) { | ||
CarbonRow convertRow = localConverter.convert(rowBatch.next()); | ||
rowBatch.setPreviousRow(convertRow); | ||
if (convertRow == null && (badRecordsAction.equals(LoggerAction.IGNORE.toString()) || | ||
badRecordsAction.equals(LoggerAction.REDIRECT.toString()))) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please remove check (badRecordsAction.equals(LoggerAction.IGNORE.toString()) ||
badRecordsAction.equals(LoggerAction.REDIRECT.toString())
BadRecordsAction check is not required, only null check over convertRow is enough.
The convert method return's null only in case of bad_records_action is either IGNORE or REDIRECT.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@geetikagupta16 and @QiangCai
In case of bucketing and Stream Record writer also the same exist.
Please correct there as well.
For your reference:
- For bucketing.
org/apache/carbondata/processing/loading/steps/DataConverterProcessorWithBucketingStepImpl.java:128 - For Stream Record Writer
org/apache/carbondata/hadoop/streaming/CarbonStreamRecordWriter.java:191
Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3748/ |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/2508/ |
@mohammadshahidkhan I have made the changes for bucketing. Please review |
@geetikagupta16 |
e94e287
to
74b42f0
Compare
Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3842/ |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/2597/ |
@mohammadshahidkhan I have added test cases for bucketing flow. Will be creating a new JIRA to handle streaming flow as it is throwing ClassCastException when data loading is performed with bad_record_action as IGNORE. |
retest this please |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/2601/ |
Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3846/ |
SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3626/ |
74b42f0
to
915f6e3
Compare
Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3894/ |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/2649/ |
SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3671/ |
+1 The streaming issue is handled in PR #2014 |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/3038/ |
Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/4282/ |
…n as REDIRECT or IGNORE and sort scope as NO_SORT 2. Added related test cases
915f6e3
to
dad2e4a
Compare
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/3513/ |
Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/4740/ |
retest this please |
Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/4746/ |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/3519/ |
SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/4246/ |
SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/4247/ |
Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/5175/ |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/3965/ |
SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/4459/ |
retest this please |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/4610/ |
Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/5769/ |
retest this please |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/4839/ |
Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/5998/ |
LGTM |
1 similar comment
LGTM |
…n as REDIRECT or IGNORE and sort scope as NO_SORT Problem: When data loading is performed with bad_record_action as REDIRECT or IGNORE and with sort_scope option as NO_SORT, it was throwing an error as our row batch was getting filled with null. Solution: Refactored code for creating carbon row batch for bad_record_action as REDIRECT or IGNORE and sort scope as NO_SORT This closes apache#1942
Problem: When data loading is performed with bad_record_action as REDIRECT or IGNORE and with sort_scope option as NO_SORT, it was throwing an error as our row batch was getting filled with null.
Solution: Refactored code for creating carbon row batch for bad_record_action as REDIRECT or IGNORE and sort scope as NO_SORT
Be sure to do all of the following checklist to help us incorporate
your contribution quickly and easily:
Any interfaces changed? No
Any backward compatibility impacted? No
Document update required? No
Testing done
Please provide details on
- Whether new unit test cases have been added or why no new tests are required?
- How it is tested? Please attach test report.
- Is it a performance related change? Please attach the performance test report.
- Any additional information to help reviewers in testing this change.
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.