SPARK-2519. Eliminate pattern-matching on Tuple2 in performance-critical... #1435

sryza · 2014-07-16T07:41:49Z

... aggregation code

…cal aggregation code

SparkQA · 2014-07-16T07:47:54Z

QA tests have started for PR 1435. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16719/consoleFull

rxin · 2014-07-16T07:48:40Z

Nice. LGTM.

SparkQA · 2014-07-16T09:27:48Z

QA results for PR 1435:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16719/consoleFull

markhamstra · 2014-07-16T14:37:40Z

Hmmm... not sure that I would go so far as to call it "nice". This does make the code slightly more difficult to read and understand, so can we hope that you've got some relative performance numbers that justify this compromise, @sryza ?

sryza · 2014-07-16T16:54:09Z

I'm going off of @mateiz 's report on SPARK-2048 that "we found [this] to be much slower than accessing fields directly".

markhamstra · 2014-07-16T16:59:31Z

Got it. Thanks. That also helps to put some bound (for now) on where we will make such performance optimizations.

rxin · 2014-07-16T18:07:09Z

Merging in master. Thanks.

…cal... ... aggregation code Author: Sandy Ryza <sandy@cloudera.com> Closes apache#1435 from sryza/sandy-spark-2519 and squashes the following commits: 640706a [Sandy Ryza] SPARK-2519. Eliminate pattern-matching on Tuple2 in performance-critical aggregation code

…n in default logging configurations (apache#1435) ### What changes were proposed in this pull request? This PR addresses a performance problem in Log4J 2 related to exception logging: in certain scenarios I observed that Log4J2's default exception stacktrace logging can be ~10x slower than Log4J 1. The problem stems from a new log pattern format in Log4J2 called ["extended exception"](https://logging.apache.org/log4j/2.x/manual/layouts.html#PatternExtendedException), which enriches the regular stacktrace string with information on the name of the JAR files that contained the classes in each stack frame. Log4J queries the classloader to determine the source JAR for each class. This isn't cheap, but this information is cached and reused in future exception logging calls. In certain scenarios involving runtime-generated classes, this lookup will fail and the failed lookup result will _not_ be cached. As a result, expensive classloading operations will be performed every time such an exception is logged. In addition to being very slow, these operations take out a lock on the classloader and thus can cause severe lock contention if multiple threads are logging errors. This issue is described in more detail in [a comment on a Log4J2 JIRA](https://issues.apache.org/jira/browse/LOG4J2-2391?focusedCommentId=16667140&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-16667140) and in a linked blogpost. Spark frequently uses generated classes and lambdas and thus Spark executor logs will almost always trigger this edge-case and suffer from poor performance. By default, if you do not specify an explicit exception format in your logging pattern then Log4J2 will add this "extended exception" pattern (see PatternLayout's alwaysWriteExceptions flag in Log4J's documentation, plus [the code implementing that flag](https://github.com/apache/logging-log4j2/blob/d6c8ab0863c551cdf0f8a5b1966ab45e3cddf572/log4j-core/src/main/java/org/apache/logging/log4j/core/pattern/PatternParser.java#L206-L209) in Log4J2). In this PR, I have updated Spark's default Log4J2 configurations so that each pattern layout includes an explicit %ex so that it uses the normal (non-extended) exception logging format. This is the workaround that is currently recommended on the Log4J JIRA. ### Why are the changes needed? Avoid performance regressions in Spark programs which use Spark's default Log4J 2 configuration and log many exceptions. Although it's true that any program logging exceptions at a high rate should probably just fix the source of the exceptions, I think it's still a good idea for us to try to fix this out-of-the-box performance difference so that users' existing workloads do not regress when upgrading to 3.3.0. ### Does this PR introduce _any_ user-facing change? Yes: it changes the default exception logging format so that it matches Log4J 1's default rather than Log4J 2's. The new format is consistent with behavior in previous Spark versions, but is different than the behavior in the current Spark 3.3.0-rc3. ### How was this patch tested? Existing tests. Closes apache#36747 from JoshRosen/disable-log4j2-extended-exception-pattern. Authored-by: Josh Rosen <joshrosen@databricks.com> Signed-off-by: Josh Rosen <joshrosen@databricks.com> Co-authored-by: Josh Rosen <joshrosen@databricks.com>

SPARK-2519. Eliminate pattern-matching on Tuple2 in performance-criti…

640706a

…cal aggregation code

asfgit closed this in fc7edc9 Jul 16, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPARK-2519. Eliminate pattern-matching on Tuple2 in performance-critical... #1435

SPARK-2519. Eliminate pattern-matching on Tuple2 in performance-critical... #1435

sryza commented Jul 16, 2014

SparkQA commented Jul 16, 2014

rxin commented Jul 16, 2014

SparkQA commented Jul 16, 2014

markhamstra commented Jul 16, 2014

sryza commented Jul 16, 2014

markhamstra commented Jul 16, 2014

rxin commented Jul 16, 2014

SPARK-2519. Eliminate pattern-matching on Tuple2 in performance-critical... #1435

SPARK-2519. Eliminate pattern-matching on Tuple2 in performance-critical... #1435

Conversation

sryza commented Jul 16, 2014

SparkQA commented Jul 16, 2014

rxin commented Jul 16, 2014

SparkQA commented Jul 16, 2014

markhamstra commented Jul 16, 2014

sryza commented Jul 16, 2014

markhamstra commented Jul 16, 2014

rxin commented Jul 16, 2014