[SPARK-13962][ML] spark.ml Evaluators should support other numeric types for label by BenFradet · Pull Request #12500 · apache/spark

BenFradet · 2016-04-19T16:35:02Z

What changes were proposed in this pull request?

Made BinaryClassificationEvaluator, MulticlassClassificationEvaluator and RegressionEvaluator accept all numeric types for label

How was this patch tested?

Unit tests

…label

SparkQA · 2016-04-19T16:38:45Z

Test build #56235 has finished for PR 12500 at commit 24e45e3.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-04-19T21:02:56Z

Test build #56259 has finished for PR 12500 at commit 2bec69e.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

BenFradet · 2016-04-20T11:42:29Z

pinging @MLnick

MLnick · 2016-04-21T15:29:34Z

mllib/src/main/scala/org/apache/spark/ml/evaluation/RegressionEvaluator.scala

-        (prediction, label)
+      .rdd
+      .map {
+        case Row(prediction: Double, label: Double) => (prediction, label)


minor but can this fit on one line? or have .map { case Row ... }

MLnick · 2016-04-21T15:33:16Z

LGTM. Will leave open a little while in case anyone else wants to take a look. @sethah?

sethah · 2016-04-21T15:58:57Z

mllib/src/test/scala/org/apache/spark/ml/tree/impl/TreeTests.scala

   * @return DataFrame with metadata
   */
-  def setMetadata(data: DataFrame, numClasses: Int, labelColName: String): DataFrame = {
+  def setMetadata(data: DataFrame,


Follow Spark style here:

def setMetadata( data: DataFrame, numClasses: Int, labelColName: String, featuresColName: String): DataFrame = {

BenFradet · 2016-04-21T16:11:22Z

@MLnick @sethah thanks for the reviews, will fix.

sethah · 2016-04-21T16:11:30Z

A couple minor syntax comments, other than that LGTM.

SparkQA · 2016-04-21T17:18:09Z

Test build #56553 has finished for PR 12500 at commit ec54d74.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

MLnick · 2016-04-25T18:57:12Z

mllib/src/test/scala/org/apache/spark/ml/util/MLTestingUtils.scala

+    val thrown = intercept[IllegalArgumentException] {
+      evaluator.evaluate(dfWithStringLabels)
+    }
+    assert(thrown.getMessage contains


Minor style issue, but Spark code style prefers not to use infix notation (see https://cwiki.apache.org/confluence/display/SPARK/Spark+Code+Style+Guide). Could you do thrown.getMessage.contains(...) instead? And also change the occurrence above in L57.

MLnick · 2016-04-25T19:04:03Z

@BenFradet just one more minor style comment, then I think this is ready to merge.

BenFradet · 2016-04-25T19:33:32Z

@MLnick will do

SparkQA · 2016-04-25T20:28:33Z

Test build #56916 has finished for PR 12500 at commit 6c66068.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

MLnick · 2016-04-26T06:56:48Z

@BenFradet Thanks! Merged to master. Thanks @sethah for the review.

BenFradet added 5 commits April 19, 2016 17:01

added features column name parameter to TreeTests.setMetadata

4723508

binary classification evaluator now accepts all numeric types as label

f8fe189

added method to check that an evaluator accepts all numeric types as …

963805c

…label

multiclass classification evaluator now accepts all numeric types as …

0dd9ef6

…label

regression evaluator now accepts all numeric types as label

24e45e3

fixed scalastyle

2bec69e

MLnick reviewed Apr 21, 2016
View reviewed changes

sethah reviewed Apr 21, 2016
View reviewed changes

formatting

ec54d74

MLnick reviewed Apr 25, 2016
View reviewed changes

style issue

6c66068

asfgit closed this in 2a5c930 Apr 26, 2016

Conversation

BenFradet commented Apr 19, 2016

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

SparkQA commented Apr 19, 2016

Uh oh!

SparkQA commented Apr 19, 2016

Uh oh!

BenFradet commented Apr 20, 2016

Uh oh!

MLnick Apr 21, 2016

Choose a reason for hiding this comment

Uh oh!

MLnick commented Apr 21, 2016

Uh oh!

sethah Apr 21, 2016

Choose a reason for hiding this comment

Uh oh!

BenFradet commented Apr 21, 2016

Uh oh!

sethah commented Apr 21, 2016

Uh oh!

SparkQA commented Apr 21, 2016

Uh oh!

MLnick Apr 25, 2016

Choose a reason for hiding this comment

Uh oh!

MLnick commented Apr 25, 2016

Uh oh!

BenFradet commented Apr 25, 2016

Uh oh!

SparkQA commented Apr 25, 2016

Uh oh!

MLnick commented Apr 26, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants