[SPARK-1876] Windows fixes to deal with latest distribution layout changes #819

mateiz · 2014-05-19T00:25:55Z

Look for JARs in the right place
Launch examples the same way as on Unix
Load datanucleus JARs if they exist
Don't attempt to parse local paths as URIs in SparkSubmit, since paths with C:\ are not valid URIs
Also fixed POM exclusion rules for datanucleus (it wasn't properly excluding it, whereas SBT was)

Also fixed an issues where SparkSubmit was trying to parse local files as URLs, which fails on Windows because they contain backslashes. We didn't need to treat those as URLs to check if a file exists.

They are excluded in SBT, but the rule added in Maven didn't actually remove the files from the JAR. The JARs built still worked despite this, but it's better to remove them than have 2 copies on the classpath.

AmplabJenkins · 2014-05-19T00:27:57Z

Merged build triggered.

AmplabJenkins · 2014-05-19T00:28:07Z

Merged build started.

AmplabJenkins · 2014-05-19T01:08:11Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-05-19T01:08:12Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15072/

andrewor14 · 2014-05-19T04:03:50Z

bin/run-example

+  shift
+else
+  echo "Usage: ./bin/run-example <example-class> [example-args]"
+  echo "  - set MASTER=XX to use a specific master"


Isn't this deprecated? I thought in general we want people to use --master, since this goes through Spark submit

Ah, I guess that was already there before

andrewor14 · 2014-05-19T05:29:56Z

Did you mean to also remove the hive check here? https://github.com/apache/spark/blob/master/bin/compute-classpath.sh#L93

mateiz · 2014-05-19T05:58:03Z

Yes, I don't want to rely on "jar" being installed. It's not installed by default when you grab a JRE (as far as I can tell). I'd like to eventually do that on Unix too but it's okay to do it in a later release.�

mateiz · 2014-05-19T05:58:49Z

My worry with Windows is people downloading pre-built Spark and getting a bizarre behavior. I'm assuming most people will work with pre-built Spark (since you'd mostly use Windows for local development) so those who build it by hand can handle a bit more complexity.

andrewor14 · 2014-05-19T06:15:23Z

bin/compute-classpath.cmd

+)
+set "datanucleus_jars="
+for %%d in ("%datanucleus_dir%\datanucleus-*.jar") do (
+  set datanucleus_jars=!datanucleus_jars!;%%d


Hey @mateiz I just tried this on windows 7 and my classpath includes the string !datanucleus_jars!. It should probably be %datanucleus_jars% instead?

(or did you mean to also setlocal enabledelayedexpansion here?)

Ah, how did you try it, you just ran compute-classpath? I set this in spark-class.cmd but not compute-classpath but I guess both need to work.

AmplabJenkins · 2014-05-19T07:52:58Z

Merged build triggered.

AmplabJenkins · 2014-05-19T07:53:04Z

Merged build started.

mateiz · 2014-05-19T07:53:49Z

Thanks for the review, @andrewor14! I think I've dealt with all the comments (modulo a few I replied to above). The enabledelayedexpansion thing was very weird; if you call compute-classpath from spark-shell2, you should not set it again in compute-classpath, otherwise its changes to variables will not propagate out. But if you run compute-classpath by itself in a new process (as we do to launch executors), you should set it.

AmplabJenkins · 2014-05-19T08:07:57Z

Merged build triggered.

AmplabJenkins · 2014-05-19T08:08:04Z

Merged build started.

AmplabJenkins · 2014-05-19T08:33:36Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-05-19T08:33:36Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15077/

AmplabJenkins · 2014-05-19T08:43:33Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-05-19T08:43:33Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/15078/

andrewor14 · 2014-05-19T20:14:36Z

I tested this again on Windows building with and without hive and I can verify that it works as we expect. I think this is ready to go.

tdas · 2014-05-19T21:39:43Z

@mateiz Is this good for merging?

mateiz · 2014-05-19T22:01:10Z

@tdas sure, go for it.

tdas · 2014-05-19T22:01:55Z

Great! Thanks! Merging it.

…anges - Look for JARs in the right place - Launch examples the same way as on Unix - Load datanucleus JARs if they exist - Don't attempt to parse local paths as URIs in SparkSubmit, since paths with C:\ are not valid URIs - Also fixed POM exclusion rules for datanucleus (it wasn't properly excluding it, whereas SBT was) Author: Matei Zaharia <matei@databricks.com> Closes #819 from mateiz/win-fixes and squashes the following commits: d558f96 [Matei Zaharia] Fix comment 228577b [Matei Zaharia] Review comments d3b71c7 [Matei Zaharia] Properly exclude datanucleus files in Maven assembly 144af84 [Matei Zaharia] Update Windows scripts to match latest binary package layout (cherry picked from commit 7b70a70) Signed-off-by: Tathagata Das <tathagata.das1565@gmail.com>

…anges - Look for JARs in the right place - Launch examples the same way as on Unix - Load datanucleus JARs if they exist - Don't attempt to parse local paths as URIs in SparkSubmit, since paths with C:\ are not valid URIs - Also fixed POM exclusion rules for datanucleus (it wasn't properly excluding it, whereas SBT was) Author: Matei Zaharia <matei@databricks.com> Closes apache#819 from mateiz/win-fixes and squashes the following commits: d558f96 [Matei Zaharia] Fix comment 228577b [Matei Zaharia] Review comments d3b71c7 [Matei Zaharia] Properly exclude datanucleus files in Maven assembly 144af84 [Matei Zaharia] Update Windows scripts to match latest binary package layout

…0 private package branch (apache#819) Co-authored-by: Egor Krivokon <>

mateiz added 2 commits May 18, 2014 16:44

Update Windows scripts to match latest binary package layout

144af84

Also fixed an issues where SparkSubmit was trying to parse local files as URLs, which fails on Windows because they contain backslashes. We didn't need to treat those as URLs to check if a file exists.

Properly exclude datanucleus files in Maven assembly

d3b71c7

They are excluded in SBT, but the rule added in Maven didn't actually remove the files from the JAR. The JARs built still worked despite this, but it's better to remove them than have 2 copies on the classpath.

andrewor14 reviewed May 19, 2014
View reviewed changes

Review comments

228577b

Fix comment

d558f96

asfgit closed this in 7b70a70 May 19, 2014

Agirish pushed a commit to HPEEzmeral/apache-spark that referenced this pull request May 5, 2022

MapR [SPARK-877] Update Jenkins file to build Spark-3.x from MEP-8.0.…

07c907e

…0 private package branch (apache#819) Co-authored-by: Egor Krivokon <>

udaynpusa pushed a commit to mapr/spark that referenced this pull request Jan 30, 2024

MapR [SPARK-877] Update Jenkins file to build Spark-3.x from MEP-8.0.…

1e51dd8

…0 private package branch (apache#819) Co-authored-by: Egor Krivokon <>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-1876] Windows fixes to deal with latest distribution layout changes #819

[SPARK-1876] Windows fixes to deal with latest distribution layout changes #819

mateiz commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

andrewor14 May 19, 2014

andrewor14 May 19, 2014

andrewor14 commented May 19, 2014

mateiz commented May 19, 2014

mateiz commented May 19, 2014

andrewor14 May 19, 2014

mateiz May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

mateiz commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

andrewor14 commented May 19, 2014

tdas commented May 19, 2014

mateiz commented May 19, 2014

tdas commented May 19, 2014

[SPARK-1876] Windows fixes to deal with latest distribution layout changes #819

[SPARK-1876] Windows fixes to deal with latest distribution layout changes #819

Conversation

mateiz commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

andrewor14 May 19, 2014

Choose a reason for hiding this comment

andrewor14 May 19, 2014

Choose a reason for hiding this comment

andrewor14 commented May 19, 2014

mateiz commented May 19, 2014

mateiz commented May 19, 2014

andrewor14 May 19, 2014

Choose a reason for hiding this comment

mateiz May 19, 2014

Choose a reason for hiding this comment

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

mateiz commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

AmplabJenkins commented May 19, 2014

andrewor14 commented May 19, 2014

tdas commented May 19, 2014

mateiz commented May 19, 2014

tdas commented May 19, 2014