Flink: add flink-runtime module#1423
Flink: add flink-runtime module#1423rdblue merged 5 commits intoapache:masterfrom openinx:flink-runtime
Conversation
|
I used this patch to copy all the dependencies from Here are the dependencies: ➜ dependencies git:(flink-runtime) ✗ ls -atlr
total 72120
drwxr-xr-x 6 openinx staff 192 Sep 4 17:35 ..
-rw-r--r-- 1 openinx staff 70648 Sep 4 17:35 iceberg-data-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 80259 Sep 4 17:35 iceberg-orc-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 269140 Sep 4 17:35 iceberg-parquet-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 31033 Sep 4 17:35 iceberg-hive-metastore-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 723066 Sep 4 17:35 iceberg-core-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 341213 Sep 4 17:35 iceberg-api-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 31339 Sep 4 17:35 iceberg-common-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 587956 Sep 4 17:35 avro-1.9.2.jar
-rw-r--r-- 1 openinx staff 1781264 Sep 4 17:35 orc-core-1.6.3-nohive.jar
-rw-r--r-- 1 openinx staff 28953 Sep 4 17:35 orc-shims-1.6.3.jar
-rw-r--r-- 1 openinx staff 19512529 Sep 4 17:35 parquet-avro-1.11.0.jar
-rw-r--r-- 1 openinx staff 437637 Sep 4 17:35 parquet-hadoop-1.11.0.jar
-rw-r--r-- 1 openinx staff 1619439 Sep 4 17:35 parquet-column-1.11.0.jar
-rw-r--r-- 1 openinx staff 848718 Sep 4 17:35 parquet-encoding-1.11.0.jar
-rw-r--r-- 1 openinx staff 96197 Sep 4 17:35 parquet-common-1.11.0.jar
-rw-r--r-- 1 openinx staff 700667 Sep 4 17:35 parquet-format-structures-1.11.0.jar
-rw-r--r-- 1 openinx staff 41203 Sep 4 17:35 slf4j-api-1.7.25.jar
-rw-r--r-- 1 openinx staff 15322 Sep 4 17:35 findbugs-annotations-1.3.9-1.jar
-rw-r--r-- 1 openinx staff 1634977 Sep 4 17:35 iceberg-bundled-guava-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 1403755 Sep 4 17:35 jackson-databind-2.10.2.jar
-rw-r--r-- 1 openinx staff 349074 Sep 4 17:35 jackson-core-2.10.2.jar
-rw-r--r-- 1 openinx staff 846706 Sep 4 17:35 caffeine-2.7.0.jar
-rw-r--r-- 1 openinx staff 615064 Sep 4 17:35 commons-compress-1.19.jar
-rw-r--r-- 1 openinx staff 193908 Sep 4 17:35 checker-qual-2.6.0.jar
-rw-r--r-- 1 openinx staff 13161 Sep 4 17:35 error_prone_annotations-2.3.3.jar
-rw-r--r-- 1 openinx staff 181098 Sep 4 17:35 aircompressor-0.15.jar
-rw-r--r-- 1 openinx staff 102244 Sep 4 17:35 jaxb-api-2.2.11.jar
-rw-r--r-- 1 openinx staff 19014 Sep 4 17:35 annotations-17.0.0.jar
-rw-r--r-- 1 openinx staff 233745 Sep 4 17:35 threeten-extra-1.5.0.jar
-rw-r--r-- 1 openinx staff 68080 Sep 4 17:35 jackson-annotations-2.10.2.jar
-rw-r--r-- 1 openinx staff 1786250 Sep 4 17:35 parquet-jackson-1.11.0.jar
-rw-r--r-- 1 openinx staff 2021167 Sep 4 17:35 snappy-java-1.1.7.3.jar
-rw-r--r-- 1 openinx staff 111119 Sep 4 17:35 commons-pool-1.6.jar
-rw-r--r-- 1 openinx staff 26586 Sep 4 17:35 javax.annotation-api-1.3.2.jar
drwxr-xr-x 37 openinx staff 1184 Sep 4 17:35 .
-rw-r--r-- 1 openinx staff 20924 Sep 4 17:35 audience-annotations-0.11.0.jar |
|
Do we expect the Flink cluster has hive execution environment like Spark cluster? In our Flink cluster, we don't have a hive environment so that we require users to upload the application jar along with a hive-exec jar. The spark cluster doesn't need this since it has built-in hive jars. Thoughts? |
|
I'm not quite sure whether we need to pack the hive-exec jar within the flink-runtime jar now. Seems it's a fat jar and include an old version guava dependency ( just notice the comment in build.gradle), I will try the demo under hive environment. |
|
I read the apache flink hive document here, It's recommended to use the bundled So I think we don't need to include the |
|
Thanks @openinx! I wasn't aware Flink has this bundle jar. |
|
@kbendick, should this be labelled |
Hmm yes it definitely should be. I have |
|
Thanks, @openinx! Looks good now. |
This patch will create a separate flink runtime module named
flink-runtime, it will shade the common dependency jars and archive all flink connector related classes into a jar. Now I have the basic verification under my localhost as the following:wget https://www.apache.org/dyn/closer.lua/flink/flink-1.11.1/flink-1.11.1-bin-scala_2.12.tgz tar xzvf flink-1.11.1-bin-scala_2.12.tgz cd flink-1.11.1We will see the runtime jar located like:
It will need still more work to fill the LICENSE and NOTICE in
flink-runtimemodule, and I will test more cases to confirm whether it works well.