[TRAFODION-3171] Refactor Hive sequence file reading to use the new i… #1674

selvaganesang · 2018-08-01T20:14:59Z

…mplementation

Traf-Jenkins · 2018-08-01T20:15:17Z

Check Test Started: https://jenkins.esgyn.com/job/Check-PR-master/2911/

Traf-Jenkins · 2018-08-01T22:43:01Z

Test Passed. https://jenkins.esgyn.com/job/Check-PR-master/2911/

sureshsubbiah

+1 Change looks great.

sureshsubbiah · 2018-08-03T04:58:21Z

core/sql/src/main/java/org/trafodion/sql/HDFSClient.java

+          if (readLen <= lenRemain) {
+
+              buf_.put(byteArray, 0, readLen);
+              buf_.put(recDelimiter_);


Should we be worried that the 1 byte delimiter will put us past us past end of buffer as suggest in the comment in line 290? (when readLen == lenRemain)

Yes. I need to consider the extra byte added for delimiter. Good catch, Will fix it.

sureshsubbiah · 2018-08-03T05:11:18Z

core/sql/executor/ExHdfsScan.cpp

@@ -571,6 +569,7 @@ ExWorkProcRetcode ExHdfsScanTcb::work()
             hdfsScan_ = HdfsScan::newInstance((NAHeap *)getHeap(), hdfsScanBuf_, hdfsScanBufMaxSize_, 
                            hdfsScanTdb().hdfsIoByteArraySizeInKB_, 
                            &hdfsFileInfoListAsArray_, beginRangeNum_, numRanges_, hdfsScanTdb().rangeTailIOSize_, 
+                            isSequenceFile(), hdfsScanTdb().recordDelimiter_, 


If we did not pass in the recordDelimiter_ previously, do we know if previous code worked correctly for text format, when the record delimiter was something other than /n ? This PR does not seem change anything for text format reads, so if there was issue previously, it might still exist.

There is no need pass the record delimiter for text format because it is a raw read of text formatted table. It should have the record delimiter as per the hive metadata. In case of sequence files, the reader.next API converts the raw data (or copies) the row without record delimiter.

Thank you for explaining

…mplementation Fix to resolve the issue highlighted in the review comment

Traf-Jenkins · 2018-08-03T17:33:15Z

New Check Test Started: https://jenkins.esgyn.com/job/Check-PR-master/2918/

Traf-Jenkins · 2018-08-03T20:12:40Z

Test Passed. https://jenkins.esgyn.com/job/Check-PR-master/2918/

sureshsubbiah · 2018-08-06T18:26:39Z

+!

[TRAFODION-3171] Refactor Hive sequence file reading to use the new i…

6165e6b

…mplementation

sureshsubbiah approved these changes Aug 3, 2018

View reviewed changes

[TRAFODION-3171] Refactor Hive sequence file reading to use the new i…

abbfe85

…mplementation Fix to resolve the issue highlighted in the review comment

asfgit merged commit abbfe85 into apache:master Aug 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TRAFODION-3171] Refactor Hive sequence file reading to use the new i… #1674

[TRAFODION-3171] Refactor Hive sequence file reading to use the new i… #1674

selvaganesang commented Aug 1, 2018

Traf-Jenkins commented Aug 1, 2018

Traf-Jenkins commented Aug 1, 2018

sureshsubbiah left a comment

sureshsubbiah Aug 3, 2018

selvaganesang Aug 3, 2018

sureshsubbiah Aug 3, 2018

selvaganesang Aug 3, 2018 •

edited

sureshsubbiah Aug 6, 2018

Traf-Jenkins commented Aug 3, 2018

Traf-Jenkins commented Aug 3, 2018

sureshsubbiah commented Aug 6, 2018

[TRAFODION-3171] Refactor Hive sequence file reading to use the new i… #1674

[TRAFODION-3171] Refactor Hive sequence file reading to use the new i… #1674

Conversation

selvaganesang commented Aug 1, 2018

Traf-Jenkins commented Aug 1, 2018

Traf-Jenkins commented Aug 1, 2018

sureshsubbiah left a comment

Choose a reason for hiding this comment

sureshsubbiah Aug 3, 2018

Choose a reason for hiding this comment

selvaganesang Aug 3, 2018

Choose a reason for hiding this comment

sureshsubbiah Aug 3, 2018

Choose a reason for hiding this comment

selvaganesang Aug 3, 2018 • edited

Choose a reason for hiding this comment

sureshsubbiah Aug 6, 2018

Choose a reason for hiding this comment

Traf-Jenkins commented Aug 3, 2018

Traf-Jenkins commented Aug 3, 2018

sureshsubbiah commented Aug 6, 2018

selvaganesang Aug 3, 2018 •

edited