[SPARK-9911] [DOC] [ML] Update Userguide for Evaluator #8304

MechCoder · 2015-08-19T06:13:22Z

I added a small note about the different types of evaluator and the metrics used.

MechCoder · 2015-08-19T06:14:29Z

@mengxr

I thought it is unnecessary to add a separate guide for evaluators and hence added a note within the existing example.

SparkQA · 2015-08-19T06:42:05Z

Test build #41221 has finished for PR 8304 at commit 4a4d7a6.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

feynmanliang · 2015-08-19T17:04:10Z

docs/ml-guide.md

+
+The `Evaluator` can be a [`RegressionEvaluator`](api/scala/index.html#org.apache.spark.ml.RegressionEvaluator)
+for regression problems, a [`BinaryClassificationEvaluator`](api/scala/index.html#org.apache.spark.ml.BinaryClassificationEvaluator)
+for binary data or a [`MultiClassClassificationEvaluator`](api/scala/index.html#org.apache.spark.ml.MultiClassClassificationEvaluator)


nit: Oxford comma ("...binary data, or a...")

feynmanliang · 2015-08-19T17:09:27Z

+1 for @mengxr 's suggestion on JIRA to break this example into a separate ml-evaluation.md section; the examples in ml-guide are becoming monolithic and breaking another section will allow us to better document the features available in evaluators (e.g. all the metrics supported, whether to maximize or minimize the metric a la #8290)

MechCoder · 2015-08-19T18:58:18Z

That sounds like a great idea. But do you have better suggestions for an example? I would have to use a cross-validator and tune it anyway and I'm afraid there will be code repetition.

feynmanliang · 2015-08-19T19:15:25Z

I don't think we need code examples for every possible combination, otherwise combinatorial complexity is going to bite us hard.

We can:

List all the evaluators and their features
List the (currently two?) validators, CV and TrainTestSplit, and their features
Provide the current code example going through with the binary classification metric

MechCoder · 2015-08-19T19:33:58Z

otherwise combinatorial complexity is going to bite us hard.

lol. That is not what I had meant. Sorry I should be more clear from here on.

In other words I was asking if it would be sufficient to move the Model Selection Example to the new ml-evalation.md or if you had anything else in mind?

feynmanliang · 2015-08-19T20:06:41Z

Oh, sorry!

I think moving the example and providing some guidance about how to choose evaluators/validators is sufficient for now.

MechCoder · 2015-08-24T15:04:36Z

ping @jkbradley ?

jkbradley · 2015-08-25T20:04:38Z

+1 for "moving the example and providing some guidance about how to choose evaluators/validators"

Something simple, but separate, which we can build upon later on.

mengxr · 2015-08-28T04:34:17Z

@MechCoder is busy this week. I will make a PR based on this.

I added a small note about the different types of evaluator and the metrics used. Author: MechCoder <manojkumarsivaraj334@gmail.com> Closes #8304 from MechCoder/multiclass_evaluator. (cherry picked from commit 30734d4) Signed-off-by: Xiangrui Meng <meng@databricks.com>

mengxr · 2015-08-28T04:44:55Z

Since all comments are minor (or beyond this PR), I'm going to merge this into master and branch-1.5 first and then send another PR to fix issues here and some others.

[SPARK-9911] Update Userguide for Evaluator

4a4d7a6

MechCoder force-pushed the multiclass_evaluator branch from e5547f1 to 4a4d7a6 Compare August 19, 2015 06:18

feynmanliang reviewed Aug 19, 2015
View reviewed changes

jkbradley mentioned this pull request Aug 26, 2015

[SPARK-9910][ML]User guide for train validation split #8377

Closed

asfgit closed this in 30734d4 Aug 28, 2015

MechCoder deleted the multiclass_evaluator branch August 28, 2015 14:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-9911] [DOC] [ML] Update Userguide for Evaluator #8304

[SPARK-9911] [DOC] [ML] Update Userguide for Evaluator #8304

MechCoder commented Aug 19, 2015

MechCoder commented Aug 19, 2015

SparkQA commented Aug 19, 2015

feynmanliang Aug 19, 2015

feynmanliang commented Aug 19, 2015

MechCoder commented Aug 19, 2015

feynmanliang commented Aug 19, 2015

MechCoder commented Aug 19, 2015

feynmanliang commented Aug 19, 2015

MechCoder commented Aug 24, 2015

jkbradley commented Aug 25, 2015

mengxr commented Aug 28, 2015

mengxr commented Aug 28, 2015

[SPARK-9911] [DOC] [ML] Update Userguide for Evaluator #8304

[SPARK-9911] [DOC] [ML] Update Userguide for Evaluator #8304

Conversation

MechCoder commented Aug 19, 2015

MechCoder commented Aug 19, 2015

SparkQA commented Aug 19, 2015

feynmanliang Aug 19, 2015

Choose a reason for hiding this comment

feynmanliang commented Aug 19, 2015

MechCoder commented Aug 19, 2015

feynmanliang commented Aug 19, 2015

MechCoder commented Aug 19, 2015

feynmanliang commented Aug 19, 2015

MechCoder commented Aug 24, 2015

jkbradley commented Aug 25, 2015

mengxr commented Aug 28, 2015

mengxr commented Aug 28, 2015