[CARBONDATA-657]added support for shared dictionary columns in spark 2.1 #570

anubhav100 · 2017-01-25T06:04:16Z

in spark 1.6 shared columns works fine

0: jdbc:hive2://localhost:10000> CREATE TABLE uniq_shared_dictionary (CUST_ID int,CUST_NAME String,ACTIVE_EMUI_VERSION string, DOB timestamp, DOJ timestamp, BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 bigint,DECIMAL_COLUMN1 decimal(30,10), DECIMAL_COLUMN2 decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double,INTEGER_COLUMN1 int) STORED BY 'org.apache.carbondata.format' TBLPROPERTIES('DICTIONARY_INCLUDE'='CUST_ID,Double_COLUMN2,DECIMAL_COLUMN2','columnproperties.CUST_ID.shared_column'='shared.CUST_ID','columnproperties.decimal_column2.shared_column'='shared.decimal_column2');
+---------+--+
| Result |
+---------+--+
+---------+--+

but in spark 2.1 it gives exception

0: jdbc:hive2://hadoop-master:10000> CREATE TABLE uniq_shared_dictionary (CUST_ID int,CUST_NAME String,ACTIVE_EMUI_VERSION string, DOB timestamp, DOJ timestamp, BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 bigint,DECIMAL_COLUMN1 decimal(30,10), DECIMAL_COLUMN2 decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double,INTEGER_COLUMN1 int) STORED BY 'org.apache.carbondata.format' TBLPROPERTIES('DICTIONARY_INCLUDE'='CUST_ID,Double_COLUMN2,DECIMAL_COLUMN2','columnproperties.CUST_ID.shared_column'='shared.CUST_ID','columnproperties.decimal_column2.shared_column'='shared.decimal_column2');
Error: org.apache.carbondata.spark.exception.MalformedCarbonCommandException: Invalid table properties columnproperties.cust_id.shared_column (state=,code=0)
LOGS
ERROR 18-01 13:31:18,147 - Error executing query, currentState RUNNING,
org.apache.carbondata.spark.exception.MalformedCarbonCommandException: Invalid table properties columnproperties.cust_id.shared_column

but if we give column name in lower case it works fine

CREATE TABLE uniq_shared_dictionary (cust_id int,CUST_NAME String,ACTIVE_EMUI_VERSION string, DOB timestamp, DOJ timestamp, BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 bigint,DECIMAL_COLUMN1 decimal(30,10), decimal_column2 decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double,INTEGER_COLUMN1 int) STORED BY 'org.apache.carbondata.format' TBLPROPERTIES('DICTIONARY_INCLUDE'='CUST_ID,Double_COLUMN2,DECIMAL_COLUMN2','columnproperties.cust_id.shared_column'='shared.cust_id','columnproperties.decimal_column2.shared_column'='shared.decimal_column2');
+---------+--+
| Result |
+---------+--+
+---------+--+
No rows selected (2.644 seconds)

with this pr user will be able to create shared columns in spark 2.1 in both case

CarbonDataQA · 2017-01-25T06:14:21Z

Build Success with Spark 1.6.2, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/748/

…cptial letters

CarbonDataQA · 2017-01-30T06:00:08Z

Build Success with Spark 1.6.2, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/779/

asfbot · 2017-05-28T12:53:46Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/carbondata-pr-spark-1.6/76/
Test FAILed.

asfbot · 2017-05-28T13:17:45Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/carbondata-pr-spark-2.1/227/

Build result: ABORTED

[...truncated 635.88 KB...][INFO] [INFO] Apache CarbonData :: Parent ........................ SUCCESS [ 6.463 s][INFO] Apache CarbonData :: Common ........................ SUCCESS [ 8.324 s][INFO] Apache CarbonData :: Core .......................... SUCCESS [01:49 min][INFO] Apache CarbonData :: Processing .................... SUCCESS [ 21.782 s][INFO] Apache CarbonData :: Hadoop ........................ SUCCESS [ 23.117 s][INFO] Apache CarbonData :: Spark Common .................. SUCCESS [ 34.875 s][INFO] Apache CarbonData :: Spark2 ........................ SUCCESS [01:33 min][INFO] Apache CarbonData :: Spark Common Test ............. SUCCESS [07:15 min][INFO] Apache CarbonData :: Assembly ...................... SUCCESS [ 24.990 s][INFO] Apache CarbonData :: Spark2 Examples ............... FAILURE [ 21.502 s][INFO] ------------------------------------------------------------------------[INFO] BUILD FAILURE[INFO] ------------------------------------------------------------------------[INFO] Total time: 13:57 min[INFO] Finished at: 2017-05-28T13:14:45+00:00[INFO] Final Memory: 107M/1203M[INFO] ------------------------------------------------------------------------Waiting for Jenkins to finish collecting dataBuild was abortedAborted by chenliang613channel stoppedSetting status of fc1fa0a to FAILURE with url https://builds.apache.org/job/carbondata-pr-spark-2.1/227/ and message: 'Build finished. 'Using context: Jenkins (Spark 2.1): Maven clean install
Test FAILed.

CarbonDataQA · 2017-08-02T08:51:40Z

SDV Build Failed with Spark 2.1, Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/63/

asfgit · 2017-08-02T08:51:41Z

Can one of the admins verify this patch?

asfgit · 2017-08-02T08:51:42Z

Can one of the admins verify this patch?

CarbonDataQA · 2017-09-13T08:14:37Z

Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/95/

CarbonDataQA · 2017-10-23T22:39:08Z

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/632/

ravipesala · 2017-10-23T22:49:10Z

SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/1263/

anubhav100 changed the title ~~added support for shard columns for spark 2.1~~ added support for shared dictionary columns in spark 2.1 Jan 25, 2017

anubhav100 changed the title ~~added support for shared dictionary columns in spark 2.1~~ [CARBONDATA-657]added support for shared dictionary columns in spark 2.1 Jan 25, 2017

added support for shard columns for spark 2.1 when column name is in …

fc1fa0a

…cptial letters

anubhav100 force-pushed the CARBONDATA-657 branch from 59cc503 to fc1fa0a Compare January 30, 2017 05:50

anubhav100 closed this Nov 15, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CARBONDATA-657]added support for shared dictionary columns in spark 2.1 #570

[CARBONDATA-657]added support for shared dictionary columns in spark 2.1 #570

anubhav100 commented Jan 25, 2017 •

edited

CarbonDataQA commented Jan 25, 2017

CarbonDataQA commented Jan 30, 2017

asfbot commented May 28, 2017

asfbot commented May 28, 2017

CarbonDataQA commented Aug 2, 2017

asfgit commented Aug 2, 2017

asfgit commented Aug 2, 2017

CarbonDataQA commented Sep 13, 2017

CarbonDataQA commented Oct 23, 2017

ravipesala commented Oct 23, 2017

[CARBONDATA-657]added support for shared dictionary columns in spark 2.1 #570

[CARBONDATA-657]added support for shared dictionary columns in spark 2.1 #570

Conversation

anubhav100 commented Jan 25, 2017 • edited

CarbonDataQA commented Jan 25, 2017

CarbonDataQA commented Jan 30, 2017

asfbot commented May 28, 2017

asfbot commented May 28, 2017

Build result: ABORTED

CarbonDataQA commented Aug 2, 2017

asfgit commented Aug 2, 2017

asfgit commented Aug 2, 2017

CarbonDataQA commented Sep 13, 2017

CarbonDataQA commented Oct 23, 2017

ravipesala commented Oct 23, 2017

anubhav100 commented Jan 25, 2017 •

edited