[HUDI-68] Pom cleanup & demo automation #846

vinothchandar · 2019-08-22T05:46:03Z

No description provided.

vinothchandar · 2019-08-22T05:46:29Z

@thesuperzapper this is now up, with full demo automation.. FYI

vinothchandar · 2019-08-22T16:36:33Z

packaging/hudi-spark-bundle/pom.xml

@@ -127,6 +127,8 @@
                  <pattern>parquet.schema</pattern>
                  <shadedPattern>org.apache.hudi.parquet.schema</shadedPattern>
                </relocation>
+                <!-- TODO: Revisit GH ISSUE #533 & PR#633-->


@bvaradar this needs your special attention.. Had to do this to get hive sync as a part of spark datasource working.. some background in those issues and prs

Ack. We would have to check with other certified combinations of Hive and Spark

vinothchandar · 2019-08-22T21:11:07Z

CI has passed 4 times in a row now. I think its stable enough to merge

bvaradar

Looks overall good. Really like the optimizations around batching beeline commands. Should keep integration tests faster

bvaradar · 2019-08-22T21:27:05Z

packaging/hudi-spark-bundle/pom.xml

@@ -127,6 +127,8 @@
                  <pattern>parquet.schema</pattern>
                  <shadedPattern>org.apache.hudi.parquet.schema</shadedPattern>
                </relocation>
+                <!-- TODO: Revisit GH ISSUE #533 & PR#633-->


Ack. We would have to check with other certified combinations of Hive and Spark

- Fix ordering of dependencies in poms, to enable better resolution - Idea is to place more specific ones at the top - And place dependencies which use them below them

- Move hive queries from hive cli to beeline - Standardize on taking query input from text command files - Deltastreamer ingest, also does hive sync in a single step - Spark Incremental Query materialized as a derived Hive table using datasource - Fix flakiness in HDFS spin up and output comparison - Code cleanup around streamlining and loc reduction - Also fixed pom to not shade some hive classs in spark, to enable hive sync

- [HUDI-172] Cleanup Maven POM/Classpath - Fix ordering of dependencies in poms, to enable better resolution - Idea is to place more specific ones at the top - And place dependencies which use them below them - [HUDI-68] : Automate demo steps on docker setup - Move hive queries from hive cli to beeline - Standardize on taking query input from text command files - Deltastreamer ingest, also does hive sync in a single step - Spark Incremental Query materialized as a derived Hive table using datasource - Fix flakiness in HDFS spin up and output comparison - Code cleanup around streamlining and loc reduction - Also fixed pom to not shade some hive classs in spark, to enable hive sync

vinothchandar requested review from n3nash and bvaradar August 22, 2019 05:46

vinothchandar changed the title ~~Pom cleanup & demo automation~~ [HUDI-62] Pom cleanup & demo automation Aug 22, 2019

vinothchandar commented Aug 22, 2019

View reviewed changes

vinothchandar changed the title ~~[HUDI-62] Pom cleanup & demo automation~~ [HUDI-68] Pom cleanup & demo automation Aug 22, 2019

bvaradar approved these changes Aug 22, 2019

View reviewed changes

thesuperzapper and others added 3 commits August 22, 2019 19:45

[HUDI-172] Cleanup Maven POM/Classpath

e160404

- Fix ordering of dependencies in poms, to enable better resolution - Idea is to place more specific ones at the top - And place dependencies which use them below them

Demo automation

e3b9822

vinothchandar force-pushed the pom-cleanup-demo-automation branch from 0c103db to e23112a Compare August 23, 2019 02:46

vinothchandar merged commit 6edf0b9 into apache:master Aug 23, 2019

thesuperzapper mentioned this pull request Aug 28, 2019

[HUDI-12] Upgrade to Spark 2.4, Avro 1.8.2, Parquet 1.10.0... #638

Closed

yanghua mentioned this pull request Jan 27, 2020

[MINOR] Remove junit-dep dependency #1280

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HUDI-68] Pom cleanup & demo automation #846

[HUDI-68] Pom cleanup & demo automation #846

vinothchandar commented Aug 22, 2019

vinothchandar commented Aug 22, 2019

vinothchandar Aug 22, 2019

bvaradar Aug 22, 2019

vinothchandar commented Aug 22, 2019

bvaradar left a comment

bvaradar Aug 22, 2019

[HUDI-68] Pom cleanup & demo automation #846

[HUDI-68] Pom cleanup & demo automation #846

Conversation

vinothchandar commented Aug 22, 2019

vinothchandar commented Aug 22, 2019

vinothchandar Aug 22, 2019

Choose a reason for hiding this comment

bvaradar Aug 22, 2019

Choose a reason for hiding this comment

vinothchandar commented Aug 22, 2019

bvaradar left a comment

Choose a reason for hiding this comment

bvaradar Aug 22, 2019

Choose a reason for hiding this comment