New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[HUDI-68] Pom cleanup & demo automation #846
[HUDI-68] Pom cleanup & demo automation #846
Conversation
@thesuperzapper this is now up, with full demo automation.. FYI |
@@ -127,6 +127,8 @@ | |||
<pattern>parquet.schema</pattern> | |||
<shadedPattern>org.apache.hudi.parquet.schema</shadedPattern> | |||
</relocation> | |||
<!-- TODO: Revisit GH ISSUE #533 & PR#633--> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@bvaradar this needs your special attention.. Had to do this to get hive sync as a part of spark datasource working.. some background in those issues and prs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ack. We would have to check with other certified combinations of Hive and Spark
CI has passed 4 times in a row now. I think its stable enough to merge |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks overall good. Really like the optimizations around batching beeline commands. Should keep integration tests faster
@@ -127,6 +127,8 @@ | |||
<pattern>parquet.schema</pattern> | |||
<shadedPattern>org.apache.hudi.parquet.schema</shadedPattern> | |||
</relocation> | |||
<!-- TODO: Revisit GH ISSUE #533 & PR#633--> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ack. We would have to check with other certified combinations of Hive and Spark
- Fix ordering of dependencies in poms, to enable better resolution - Idea is to place more specific ones at the top - And place dependencies which use them below them
- Move hive queries from hive cli to beeline - Standardize on taking query input from text command files - Deltastreamer ingest, also does hive sync in a single step - Spark Incremental Query materialized as a derived Hive table using datasource - Fix flakiness in HDFS spin up and output comparison - Code cleanup around streamlining and loc reduction - Also fixed pom to not shade some hive classs in spark, to enable hive sync
0c103db
to
e23112a
Compare
- [HUDI-172] Cleanup Maven POM/Classpath - Fix ordering of dependencies in poms, to enable better resolution - Idea is to place more specific ones at the top - And place dependencies which use them below them - [HUDI-68] : Automate demo steps on docker setup - Move hive queries from hive cli to beeline - Standardize on taking query input from text command files - Deltastreamer ingest, also does hive sync in a single step - Spark Incremental Query materialized as a derived Hive table using datasource - Fix flakiness in HDFS spin up and output comparison - Code cleanup around streamlining and loc reduction - Also fixed pom to not shade some hive classs in spark, to enable hive sync
No description provided.