New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tests fail on spark-1.6.X branch with pyspark 1.6.3 #5
Comments
I've tested JPMML-SparkML-Package with Apache Spark 1.6.0, 1.6.1 and 1.6.2, but not with 1.6.3. Must be the case that the Google Guava dependency has been relocated between the 1.6.2 and 1.6.3 versions. You can bypass tests like this:
During runtime, simply add Google Guava dependency ( Will investigate potential fixes. Could introduce a build profile, which builds a "fat" JAR (includes Guava) for Apache Spark version 1.6.3, and a "thin" JAR (excludes Guava) for all earlier versions. |
I removed Apache Spark 1.6.3 and installed 1.6.0 and again the Maven build succeeds but the nosetests do not, with the same traceback. |
Believe it or not, but everything works as advertised in my computer:
End of the output:
|
Maybe it's some Apache Spark packaging issue? What is the name of your distribution, is it "with Hadoop" or "without Hadoop" edition? |
Spark 1.6.0 Pre-built for Apache Hadoop 2.6, tarball downloaded from spark.apache.org/downloads.html; I have the same result with Spark 1.6.2 Pre-built for Apache Hadoop 2.6. Using 2.0.0 Pre-built for Apache Hadoop 2.7 leads to successfully passing the tests on both branches. Could it have to do with the jpmml-sparkml Maven JAR? Which Spark distribution do you use? |
Classpath misery, for the win. I've just been trying to help Robert understand what's going on here. I must confess I'm a little lost:
So I'm a bit perplexed about where the runtime Guava dependency should be coming from? |
In your application project directory, execute Apache Maven command The availability of Guava depends on Apache Spark version (1.6.X vs 2.0.X), and packaging ("with hadoop" or "without hadoop"). In Robert's application environment (#5 (comment)) there is no Guava dependency available (as indicated by <dependency>
<groupId>com.google.guava</groupId>
<artifactId>guava</artifactId>
<version>19.0</version>
</dependency> |
With your suggested addition to the |
Hello,
The maven build as you outline in the README goes fine but the suggested test fails with relevant output:
In case it's relevant, I'm using py4j-0.10.5 which was released after the most recent branch commit.
The text was updated successfully, but these errors were encountered: