Add a new option for an alternate mirror for spark binaries #104

rmessner · 2016-04-06T09:20:02Z

Fixes #101.

nchammas · 2016-04-06T14:41:16Z

@rmessner thank you for tackling this. As you experienced, it is useful in times when, for whatever reason, the default Spark packages on S3 that Flintrock uses are corrupt.

We will probably want to use this pattern here to also solve #71. As such, I'd like the option name and URL template to be consistent with what's proposed there. Is that OK with you?

nchammas · 2016-04-06T14:43:19Z

flintrock/config.yaml.template

@@ -1,6 +1,7 @@
 services:
  spark:
    version: 1.6.0
+    # preferred-mirror: # optional; default to 'https://s3.amazonaws.com/spark-related-packages/${file}'


As proposed in #71, I prefer {v} or even {version} instead of ${file}, since the latter's exact meaning to the user is not clear.

Ok, so to be quite clear, i will use these variables in my default value in the script :
spark_version
hadoop_distribution

If you're talking about the config template, I think we just need a {version} variable. It seems unnecessary to say spark_... inside the Spark config.

As for the Hadoop distribution, we currently don't support specifying that, unless you are also intending to tackle #88.

rmessner · 2016-04-06T14:55:31Z

Okay, i will use mirror instead of preferred_mirror to be consistent across flintrock

nchammas · 2016-04-06T15:03:00Z

#71 uses download-source, so I'd prefer that over mirror. I think it's clearer, though more verbose.

rmessner · 2016-04-06T15:06:13Z

Okay, i'll use download-source instead of mirror.

For the variable names, it's okay for you as well ?

nchammas · 2016-04-06T15:12:54Z

For the variable names, it's okay for you as well ?

Not sure what you're referring to. I think the proposal laid out in #71 for Hadoop should give you a good template for what to name things.

If you're still not sure, it might be easier to just update the PR and I'll comment on the line items as necessary.

nchammas · 2016-04-09T23:27:30Z

flintrock/config.yaml.template

@@ -1,6 +1,8 @@
 services:
  spark:
    version: 1.6.0
+    # distribution: # optional; default to '2.6'


Style nitpick: Two spaces before the #; "defaults" and not "default"

Hmm, can we leave out the ability to specify distribution for now? I'm not sure about how best to name this option (e.g. there are non-Hadoop distributions like CDH, but we are assuming Hadoop) and, more importantly, I haven't fully considered the implications of supporting user-specified distributions.

BenFradet · 2016-05-27T20:14:31Z

@rmessner are you still working on this?

Raphael MESSNER added 2 commits April 6, 2016 11:17

Add a new option for an alternate mirror for spark binaries

911b5d1

Fix pep8 compliance

40f6d19

rmessner mentioned this pull request Apr 6, 2016

Support launching clusters into private VPCs #14

Closed

nchammas reviewed Apr 6, 2016
View reviewed changes

rmessner mentioned this pull request Apr 6, 2016

Add option to download Spark from a custom URL #101

Closed

Fix terms consistency AND hadoop distribution within spark

4243801

nchammas reviewed Apr 9, 2016
View reviewed changes

rmessner mentioned this pull request Apr 10, 2016

Add the option to install hdfs from custom source #109

Closed

BenFradet mentioned this pull request Jun 22, 2016

Add option to download Spark from a custom URL #125

Merged

nchammas closed this in #125 Jun 29, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a new option for an alternate mirror for spark binaries #104

Add a new option for an alternate mirror for spark binaries #104

rmessner commented Apr 6, 2016

nchammas commented Apr 6, 2016

nchammas Apr 6, 2016

rmessner Apr 6, 2016

nchammas Apr 6, 2016

rmessner commented Apr 6, 2016

nchammas commented Apr 6, 2016

rmessner commented Apr 6, 2016

nchammas commented Apr 6, 2016

nchammas Apr 9, 2016

nchammas Apr 10, 2016

BenFradet commented May 27, 2016

Add a new option for an alternate mirror for spark binaries #104

Add a new option for an alternate mirror for spark binaries #104

Conversation

rmessner commented Apr 6, 2016

nchammas commented Apr 6, 2016

nchammas Apr 6, 2016

Choose a reason for hiding this comment

rmessner Apr 6, 2016

Choose a reason for hiding this comment

nchammas Apr 6, 2016

Choose a reason for hiding this comment

rmessner commented Apr 6, 2016

nchammas commented Apr 6, 2016

rmessner commented Apr 6, 2016

nchammas commented Apr 6, 2016

nchammas Apr 9, 2016

Choose a reason for hiding this comment

nchammas Apr 10, 2016

Choose a reason for hiding this comment

BenFradet commented May 27, 2016