[CARBONDATA-3562] Fix for SDK filter queries not working when schema is given explicitly while Add Segment #3427

manishnalla1994 · 2019-10-30T12:49:56Z

Problem1 : Queries will not return correct result from added segment when the schema is given explicitly in case of SDK.

Solution : Handled it by validating based on both column name and column id if it matches for the SDK column.

Problem2 : While deleting added segment, the physical location is also getting deleted.
Solution: Fixed that by adding validation.

Be sure to do all of the following checklist to help us incorporate
your contribution quickly and easily:

Any interfaces changed?
Any backward compatibility impacted?
Document update required?
Testing done
Please provide details on
- Whether new unit test cases have been added or why no new tests are required?
- How it is tested? Please attach test report.
- Is it a performance related change? Please attach the performance test report.
- Any additional information to help reviewers in testing this change.
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.

manishnalla1994 · 2019-10-30T13:01:22Z

@ravipesala Please review.

CarbonDataQA · 2019-10-30T13:06:59Z

Build Success with Spark 2.1.0, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.1/717/

CarbonDataQA · 2019-10-30T14:07:21Z

Build Success with Spark 2.1.0, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.1/718/

CarbonDataQA · 2019-10-30T15:17:40Z

Build Failed with Spark 2.2.1, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.2/725/

CarbonDataQA · 2019-10-30T15:28:16Z

Build Failed with Spark 2.3.2, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/726/

kunal642 · 2019-11-02T16:43:32Z

...est/src/test/scala/org/apache/carbondata/spark/testsuite/addsegment/AddSegmentTestCase.scala

@@ -722,12 +723,28 @@ class AddSegmentTestCase extends QueryTest with BeforeAndAfterAll {
    val externalSegmentPath = storeLocation + "/" + "external_segment"
    FileFactory.deleteAllFilesOfDir(new File(externalSegmentPath))

+    var fields: Array[Field] = new Array[Field](14)


why is this test case changed?

This test is changed just to check for the schema which we give externally instead of referring to schema file of the already existing table.

ravipesala · 2019-11-04T08:07:43Z

@manishnalla1994 I think this fix might induce new issues in alter table , like drop column and add back the column with the same name might not work as expected.

manishnalla1994 · 2019-11-04T12:02:13Z

@ravipesala when we add the same column then the column id will be different, so it wont match in our case.

CarbonDataQA · 2019-11-05T05:21:43Z

Build Success with Spark 2.1.0, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.1/745/

CarbonDataQA · 2019-11-05T06:34:32Z

Build Success with Spark 2.2.1, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.2/751/

ravipesala · 2019-11-05T08:14:31Z

core/src/main/java/org/apache/carbondata/core/scan/executor/util/RestructureUtil.java

@@ -167,15 +167,22 @@ public static boolean isColumnMatches(boolean isTransactionalTable,
    // column ID but can have same column name
    if (tableColumn.getDataType().isComplexType() && !(tableColumn.getDataType().getId()
        == DataTypes.ARRAY_TYPE_ID)) {
-      if (tableColumn.getColumnId().equalsIgnoreCase(queryColumn.getColumnId())) {
+      if (tableColumn.getColumnId().equalsIgnoreCase(queryColumn.getColumnId()) || (


Can you override equals method of Column or add method inside a column to do this check. It is pretty repetitive and more prone to issues.

ravipesala · 2019-11-05T08:16:32Z

core/src/main/java/org/apache/carbondata/core/scan/executor/util/RestructureUtil.java

          if (carbonDimension.getColumnSchema().getColumnUniqueId()
-              .equalsIgnoreCase(queryColumn.getColumnId())) {
+              .equalsIgnoreCase(queryColumn.getColumnId()) || (
+              carbonDimension.getColumnSchema().getColumnUniqueId()


Why there is a diff of condition check for dimension and measure? Here we compare with ColumnUniqueId but other places are not

This is only for the case of struct matching, as was before, just added one more check.

CarbonDataQA · 2019-11-05T09:50:33Z

Build Success with Spark 2.3.2, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/753/

CarbonDataQA · 2019-11-05T11:33:36Z

Build Success with Spark 2.1.0, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.1/754/

CarbonDataQA · 2019-11-05T12:52:13Z

Build Success with Spark 2.3.2, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/762/

CarbonDataQA · 2019-11-05T23:48:28Z

Build Success with Spark 2.2.1, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.2/760/

jackylk · 2019-11-06T01:22:42Z

LGTM

ravipesala · 2019-11-11T08:17:20Z

LGTM

manishnalla1994 · 2019-11-11T08:38:35Z

retest this please

CarbonDataQA · 2019-11-11T08:58:19Z

Build Success with Spark 2.1.0, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.1/784/

CarbonDataQA · 2019-11-11T10:16:46Z

Build Success with Spark 2.2.1, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.2/790/

CarbonDataQA · 2019-11-11T10:49:46Z

Build Success with Spark 2.3.2, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/792/

…y in Add Segment

CarbonDataQA · 2019-11-11T12:12:18Z

Build Success with Spark 2.1.0, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.1/789/

CarbonDataQA · 2019-11-11T13:40:20Z

Build Success with Spark 2.3.2, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/797/

CarbonDataQA · 2019-11-11T13:44:22Z

Build Success with Spark 2.2.1, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.2/795/

ajantha-bhat · 2019-11-12T05:27:31Z

LGTM

manishnalla1994 force-pushed the SDKIssueFix branch from 2bd74bc to e3dab13 Compare October 30, 2019 13:48

kunal642 reviewed Nov 2, 2019

View reviewed changes

manishnalla1994 force-pushed the SDKIssueFix branch 2 times, most recently from 318a118 to fbb55fb Compare November 5, 2019 05:01

ravipesala reviewed Nov 5, 2019

View reviewed changes

manishnalla1994 force-pushed the SDKIssueFix branch from fbb55fb to d92db6f Compare November 5, 2019 11:15

manishnalla1994 force-pushed the SDKIssueFix branch from d92db6f to fb6ca4d Compare November 11, 2019 11:35

Fix for SDK filter queries not working when schema is given explicitl…

80277f0

…y in Add Segment

manishnalla1994 force-pushed the SDKIssueFix branch from fb6ca4d to 80277f0 Compare November 11, 2019 11:41

asfgit closed this in 86f12c8 Nov 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CARBONDATA-3562] Fix for SDK filter queries not working when schema is given explicitly while Add Segment #3427

[CARBONDATA-3562] Fix for SDK filter queries not working when schema is given explicitly while Add Segment #3427

manishnalla1994 commented Oct 30, 2019 •

edited

manishnalla1994 commented Oct 30, 2019

CarbonDataQA commented Oct 30, 2019

CarbonDataQA commented Oct 30, 2019

CarbonDataQA commented Oct 30, 2019

CarbonDataQA commented Oct 30, 2019

kunal642 Nov 2, 2019

manishnalla1994 Nov 4, 2019

ravipesala commented Nov 4, 2019

manishnalla1994 commented Nov 4, 2019

CarbonDataQA commented Nov 5, 2019

CarbonDataQA commented Nov 5, 2019

ravipesala Nov 5, 2019

ravipesala Nov 5, 2019

manishnalla1994 Nov 5, 2019

CarbonDataQA commented Nov 5, 2019

CarbonDataQA commented Nov 5, 2019

CarbonDataQA commented Nov 5, 2019

CarbonDataQA commented Nov 5, 2019

jackylk commented Nov 6, 2019

ravipesala commented Nov 11, 2019

manishnalla1994 commented Nov 11, 2019

CarbonDataQA commented Nov 11, 2019

CarbonDataQA commented Nov 11, 2019

CarbonDataQA commented Nov 11, 2019

CarbonDataQA commented Nov 11, 2019

CarbonDataQA commented Nov 11, 2019

CarbonDataQA commented Nov 11, 2019

ajantha-bhat commented Nov 12, 2019

[CARBONDATA-3562] Fix for SDK filter queries not working when schema is given explicitly while Add Segment #3427

[CARBONDATA-3562] Fix for SDK filter queries not working when schema is given explicitly while Add Segment #3427

Conversation

manishnalla1994 commented Oct 30, 2019 • edited

manishnalla1994 commented Oct 30, 2019

CarbonDataQA commented Oct 30, 2019

CarbonDataQA commented Oct 30, 2019

CarbonDataQA commented Oct 30, 2019

CarbonDataQA commented Oct 30, 2019

kunal642 Nov 2, 2019

Choose a reason for hiding this comment

manishnalla1994 Nov 4, 2019

Choose a reason for hiding this comment

ravipesala commented Nov 4, 2019

manishnalla1994 commented Nov 4, 2019

CarbonDataQA commented Nov 5, 2019

CarbonDataQA commented Nov 5, 2019

ravipesala Nov 5, 2019

Choose a reason for hiding this comment

ravipesala Nov 5, 2019

Choose a reason for hiding this comment

manishnalla1994 Nov 5, 2019

Choose a reason for hiding this comment

CarbonDataQA commented Nov 5, 2019

CarbonDataQA commented Nov 5, 2019

CarbonDataQA commented Nov 5, 2019

CarbonDataQA commented Nov 5, 2019

jackylk commented Nov 6, 2019

ravipesala commented Nov 11, 2019

manishnalla1994 commented Nov 11, 2019

CarbonDataQA commented Nov 11, 2019

CarbonDataQA commented Nov 11, 2019

CarbonDataQA commented Nov 11, 2019

CarbonDataQA commented Nov 11, 2019

CarbonDataQA commented Nov 11, 2019

CarbonDataQA commented Nov 11, 2019

ajantha-bhat commented Nov 12, 2019

manishnalla1994 commented Oct 30, 2019 •

edited