[SPARK-6363][BUILD] Make Scala 2.11 the default Scala version #10608

JoshRosen · 2016-01-05T23:20:25Z

This patch changes Spark's build to make Scala 2.11 the default Scala version. To be clear, this does not mean that Spark will stop supporting Scala 2.10: users will still be able to compile Spark for Scala 2.10 by following the instructions on the "Building Spark" page; however, it does mean that Scala 2.11 will be the default Scala version used by our CI builds (including pull request builds).

The Scala 2.11 compiler is faster than 2.10, so I think we'll be able to look forward to a slight speedup in our CI builds (it looks like it's about 2X faster for the Maven compile-only builds, for instance).

After this patch is merged, I'll update Jenkins to add new compile-only jobs to ensure that Scala 2.10 compilation doesn't break.

JoshRosen · 2016-01-05T23:20:57Z

Note: I'm happy to defer merging this pull request for a while. I just happened to have the changes ready locally and figured that it would be nice to test them on Jenkins.

SparkQA · 2016-01-06T03:17:07Z

Test build #48796 has finished for PR 10608 at commit 623a929.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-01-27T20:25:10Z

Test build #50218 has finished for PR 10608 at commit 18c5223.

This patch fails to build.
This patch merges cleanly.
This patch adds no public classes.

The high level idea is that instead of having the executors send both accumulator updates and TaskMetrics, we should have them send only accumulator updates. This eliminates the need to maintain both code paths since one can be implemented in terms of the other. This effort is split into two parts: **SPARK-12895: Implement TaskMetrics using accumulators.** TaskMetrics is basically just a bunch of accumulable fields. This patch makes TaskMetrics a syntactic wrapper around a collection of accumulators so we don't need to send TaskMetrics from the executors to the driver. **SPARK-12896: Send only accumulator updates to the driver.** Now that TaskMetrics are expressed in terms of accumulators, we can capture all TaskMetrics values if we just send accumulator updates from the executors to the driver. This completes the parent issue SPARK-10620. While an effort has been made to preserve as much of the public API as possible, there were a few known breaking DeveloperApi changes that would be very awkward to maintain. I will gather the full list shortly and post it here. Note: This was once part of #10717. This patch is split out into its own patch from there to make it easier for others to review. Other smaller pieces of already been merged into master. Author: Andrew Or <andrew@databricks.com> Closes #10835 from andrewor14/task-metrics-use-accums.

JoshRosen · 2016-01-27T20:38:05Z

/cc @srowen @pwendell @yhuai for review

srowen · 2016-01-27T21:30:57Z

That looks like what I'd expect -- roughly what you get from running the change-version script and updating the list of dependencies?

srowen · 2016-01-27T21:31:28Z

sql/catalyst/pom.xml

@@ -127,13 +127,4 @@
      </plugin>
    </plugins>
  </build>
-  <profiles>
-    <!-- Quasiquotes are merged into scala reflect from scala 2.11 onwards. -->
-    <profile>


Does this profile still become necessary when 2.10 is enabled?

Nah, I think it's no longer needed. This was from the days where Spark SQL depended on the Quasiquote compiler for its code generation, but I think this profile became redundant once we started using Janino and removed the old codgen.

JoshRosen · 2016-01-27T22:53:16Z

Jenkins, retest this please.

JoshRosen · 2016-01-27T22:54:03Z

@srowen, yep, this is pretty much just the result of running the version changing and dep update scripts, followed by a grep through the documentation and build to invert all of the directions and build scripts.

SparkQA · 2016-01-27T23:17:21Z

Test build #50236 has finished for PR 10608 at commit 18c5223.

This patch fails MiMa tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-01-28T06:48:59Z

Want to fix the mima issue?

SparkQA · 2016-01-29T03:26:59Z

Test build #50321 has finished for PR 10608 at commit 8dd36ab.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

JoshRosen · 2016-01-29T05:16:44Z

Jenkins, retest this please.

SparkQA · 2016-01-29T08:18:27Z

Test build #50343 has finished for PR 10608 at commit 8dd36ab.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

JoshRosen · 2016-01-29T08:28:42Z

These look like legitimate test failures. It's a little tricky to reason about which ones might be caused by this patch vs. longstanding 2.11 compatibility issues because it turns out that we don't actually run the entire test suite against Scala 2.11 but only test compilation.

A lot of the failures seem to occur in HiveThriftBinaryServerSuite; it seems that we time out while waiting for the server to start. I wonder whether there's a transitive dependency change in that subproject which might have caused this.

JoshRosen · 2016-01-29T08:45:55Z

Looks like the problem might be related to #9816: I think that the REPL log level is being set to WARN, preventing the suite from finding the log messages which indicate that the ThriftServer started up.

JoshRosen · 2016-01-29T09:05:46Z

I think I found the problem: looks like Utils.isInterp() was returning true even in non-interpreter code because of a small difference in how the 2.11 REPL works.

SparkQA · 2016-01-29T11:27:02Z

Test build #50367 has finished for PR 10608 at commit 2c901e5.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-01-29T22:33:02Z

Test build #50401 has finished for PR 10608 at commit d24c31d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

JoshRosen · 2016-01-29T23:33:38Z

Okay, I think this should now be ready for final review and sign-off.

rxin · 2016-01-30T08:18:02Z

It's a bit hard to know whether the repl changes make sense or not, but I think we just need to try it out and see if problems come up.

LGTM.

rxin · 2016-01-30T08:20:27Z

Merging this in master. Hopefully compilation will be faster.

zzcclp · 2016-02-03T04:26:45Z

pom.xml

@@ -165,7 +165,7 @@
    <!-- managed up from 3.2.1 for SPARK-11652 -->
    <commons.collections.version>3.2.2</commons.collections.version>
    <scala.version>2.10.5</scala.version>


@JoshRosen , why this version is 2.10.5, is it need to modify?

JoshRosen added 4 commits January 5, 2016 11:47

Run dev/change-scala-version.sh 2.11

c8c4432

Update Building Spark doc

3bd72a8

Update the build scripts to reflect new default.

ad1c0bb

Update dependency manifests.

623a929

JoshRosen closed this Jan 6, 2016

Merge remote-tracking branch 'origin/master' into SPARK-6363

18c5223

JoshRosen reopened this Jan 27, 2016

srowen reviewed Jan 27, 2016
View reviewed changes

JoshRosen added 3 commits January 28, 2016 16:09

Fix MiMa checks.

2065849

Merge remote-tracking branch 'origin/master' into SPARK-6363

6403224

More MiMa fixes.

8dd36ab

Fix isInInterpreter for Scala 2.11

2c901e5

Fix REPLSuite tests.

d24c31d

asfgit closed this in 289373b Jan 30, 2016

JoshRosen deleted the SPARK-6363 branch January 30, 2016 08:25

zzcclp reviewed Feb 3, 2016
View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-6363][BUILD] Make Scala 2.11 the default Scala version #10608

[SPARK-6363][BUILD] Make Scala 2.11 the default Scala version #10608

JoshRosen commented Jan 5, 2016

JoshRosen commented Jan 5, 2016

SparkQA commented Jan 6, 2016

SparkQA commented Jan 27, 2016

JoshRosen commented Jan 27, 2016

srowen commented Jan 27, 2016

srowen Jan 27, 2016

JoshRosen Jan 27, 2016

JoshRosen commented Jan 27, 2016

JoshRosen commented Jan 27, 2016

SparkQA commented Jan 27, 2016

rxin commented Jan 28, 2016

SparkQA commented Jan 29, 2016

JoshRosen commented Jan 29, 2016

SparkQA commented Jan 29, 2016

JoshRosen commented Jan 29, 2016

JoshRosen commented Jan 29, 2016

JoshRosen commented Jan 29, 2016

SparkQA commented Jan 29, 2016

SparkQA commented Jan 29, 2016

JoshRosen commented Jan 29, 2016

rxin commented Jan 30, 2016

rxin commented Jan 30, 2016

zzcclp Feb 3, 2016

[SPARK-6363][BUILD] Make Scala 2.11 the default Scala version #10608

[SPARK-6363][BUILD] Make Scala 2.11 the default Scala version #10608

Conversation

JoshRosen commented Jan 5, 2016

JoshRosen commented Jan 5, 2016

SparkQA commented Jan 6, 2016

SparkQA commented Jan 27, 2016

JoshRosen commented Jan 27, 2016

srowen commented Jan 27, 2016

srowen Jan 27, 2016

Choose a reason for hiding this comment

JoshRosen Jan 27, 2016

Choose a reason for hiding this comment

JoshRosen commented Jan 27, 2016

JoshRosen commented Jan 27, 2016

SparkQA commented Jan 27, 2016

rxin commented Jan 28, 2016

SparkQA commented Jan 29, 2016

JoshRosen commented Jan 29, 2016

SparkQA commented Jan 29, 2016

JoshRosen commented Jan 29, 2016

JoshRosen commented Jan 29, 2016

JoshRosen commented Jan 29, 2016

SparkQA commented Jan 29, 2016

SparkQA commented Jan 29, 2016

JoshRosen commented Jan 29, 2016

rxin commented Jan 30, 2016

rxin commented Jan 30, 2016

zzcclp Feb 3, 2016

Choose a reason for hiding this comment