Fix missing service files in ES-Hadoop jars #1265

jbaiera · 2019-03-15T22:31:25Z

When creating the jar files for ES-Hadoop, each integration copies the contents of the MR jar into itself since the MR jar contains all the core code. Once each jar is built, they all contribute their contents to the top level elasticsearch-hadoop jar (ignoring duplicate code files). A problem occurs during these jar transitions: The contents of META-INF/services are not copied along. This previously would manifest as not being able to create a Spark SQL dataframe using the short name "es" when using the elasticsearch-hadoop-x.x.x.jar. Creating the dataframe using the short name would work fine when using the elasticsearch-spark-yy_zz-x.x.x.jar because it contains the appropriate service file, which is never copied up to the root jar.

Now that we have Kerberos integrated, there are several items in different projects services directories that all need to be copied around in order for different Kerberos features in Hadoop and Spark to function normally.

We did not encounter these problems because we make use of a separate hadoop testing jar, which is created directly from the sources of the projects instead of from the jar files, and which includes all the test and integration test sources.

This PR ensures that the contents of the mr project's META-INF/services directory are copied into the hive, pig, spark, and storm jars, and that the contents of all of integrations META-INF/services directories are copied into the root elasticsearch-hadoop jar.

jakelandis

LGTM

This PR ensures that the contents of the mr project's META-INF/services directory are copied into the hive, pig, spark, and storm jars, and that the contents of all of integrations META-INF/services directories are copied into the root elasticsearch-hadoop jar.

jbaiera added 3 commits March 15, 2019 18:17

Update project build to work with git worktrees

3d1905d

Include META-INF/services in root project jar

bf38f18

Include the contents of mr's META-INF/services in all integrations

d87a81f

jbaiera added bug :Build labels Mar 15, 2019

jbaiera requested a review from jakelandis March 15, 2019 22:33

jakelandis approved these changes Mar 18, 2019

View reviewed changes

jbaiera merged commit f61227c into elastic:master Mar 19, 2019

jbaiera deleted the fix-missing-service-files branch March 19, 2019 15:57

jbaiera added v7.0.0 v6.7.0 v8.0.0-alpha1 v7.2.0 labels Mar 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix missing service files in ES-Hadoop jars #1265

Fix missing service files in ES-Hadoop jars #1265

jbaiera commented Mar 15, 2019

jakelandis left a comment

Fix missing service files in ES-Hadoop jars #1265

Fix missing service files in ES-Hadoop jars #1265

Conversation

jbaiera commented Mar 15, 2019

jakelandis left a comment

Choose a reason for hiding this comment