[SPARK-13761] [ML] Deprecate validateParams #11620

hhbyyh · 2016-03-10T00:04:21Z

What changes were proposed in this pull request?

Deprecate validateParams() method here:

spark/mllib/src/main/scala/org/apache/spark/ml/param/params.scala

Line 553 in 035d3ac

def validateParams(): Unit = {

Move all functionality in overridden methods to transformSchema().
Check docs to make sure they indicate complex Param interaction checks should be done in transformSchema.

How was this patch tested?

unit tests

SparkQA · 2016-03-10T00:40:33Z

Test build #52781 has finished for PR 11620 at commit 5682bfc.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-03-10T08:34:01Z

Test build #52822 has finished for PR 11620 at commit 3ccf90a.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-03-10T18:34:18Z

Test build #52833 has finished for PR 11620 at commit 29c5a2f.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jkbradley · 2016-03-15T01:52:01Z

mllib/src/main/scala/org/apache/spark/ml/param/params.scala

   */
+  @deprecated("All the checks should be merged into transformSchema", "2.0.0")


Also say this method will be removed in 2.1.0

jkbradley · 2016-03-15T01:52:16Z

Only a few small comments

jkbradley · 2016-03-16T21:27:20Z

mllib/src/main/scala/org/apache/spark/ml/regression/GeneralizedLinearRegression.scala

@@ -61,7 +63,8 @@ private[regression] trait GeneralizedLinearRegressionBase extends PredictorParam
   * Param for the name of link function which provides the relationship
   * between the linear predictor and the mean of the distribution function.
   * Supported options: "identity", "log", "inverse", "logit", "probit", "cloglog" and "sqrt".
-   * @group param
+    *


indentation

jkbradley · 2016-03-16T21:28:25Z

Thanks for the updates. Still some indentation issues, but that's it.

SparkQA · 2016-03-16T21:34:18Z

Test build #53347 has finished for PR 11620 at commit 91f72a9.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

hhbyyh · 2016-03-16T23:18:15Z

@jkbradley Sorry for those unintentional changes and thanks for the patience.

SparkQA · 2016-03-17T00:10:17Z

Test build #53375 has finished for PR 11620 at commit f348044.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jkbradley · 2016-03-17T00:31:16Z

No problem. Thanks for the PR!
LGTM
Merging with master

srowen · 2016-03-17T14:34:33Z

mllib/src/main/scala/org/apache/spark/ml/param/params.scala

   */
+  @deprecated("Will be removed in 2.1.0. Checks should be merged into transformSchema.", "2.0.0")


It looks like this now causes a number of deprecation warnings in the Spark code, which we're trying to get rid of. Can most of the remaining usages be transformed to not use this method?

Apologies! I should have checked the Jenkins logs. I'll send a clean-up PR.

Hi @jkbradley have you already started on this? Sorry for the troubling. I didn't remove the ones in CrossValidator and TrainValidationSplit because I think it can be handy if we can run some validation before submitting the paramMap. Let me know if I can help in any way.

No problem; I just sent a PR for it.

## What changes were proposed in this pull request? Cleanups from [#11620]: remove remaining uses of validateParams, and put functionality into transformSchema ## How was this patch tested? Existing unit tests, modified to check using transformSchema instead of validateParams Author: Joseph K. Bradley <joseph@databricks.com> Closes #11790 from jkbradley/SPARK-13761-cleanup.

## What changes were proposed in this pull request? Deprecate validateParams() method here: https://github.com/apache/spark/blob/035d3acdf3c1be5b309a861d5c5beb803b946b5e/mllib/src/main/scala/org/apache/spark/ml/param/params.scala#L553 Move all functionality in overridden methods to transformSchema(). Check docs to make sure they indicate complex Param interaction checks should be done in transformSchema. ## How was this patch tested? unit tests Author: Yuhao Yang <hhbyyh@gmail.com> Closes apache#11620 from hhbyyh/depreValid.

## What changes were proposed in this pull request? Cleanups from [apache#11620]: remove remaining uses of validateParams, and put functionality into transformSchema ## How was this patch tested? Existing unit tests, modified to check using transformSchema instead of validateParams Author: Joseph K. Bradley <joseph@databricks.com> Closes apache#11790 from jkbradley/SPARK-13761-cleanup.

…hon API. ## What changes were proposed in this pull request? apache#14597 modified ```ChiSqSelector``` to support ```fpr``` type selector, however, it left some issue need to be addressed: * We should allow users to set selector type explicitly rather than switching them by using different setting function, since the setting order will involves some unexpected issue. For example, if users both set ```numTopFeatures``` and ```percentile```, it will train ```kbest``` or ```percentile``` model based on the order of setting (the latter setting one will be trained). This make users confused, and we should allow users to set selector type explicitly. We handle similar issues at other place of ML code base such as ```GeneralizedLinearRegression``` and ```LogisticRegression```. * Meanwhile, if there are more than one parameter except ```alpha``` can be set for ```fpr``` model, we can not handle it elegantly in the existing framework. And similar issues for ```kbest``` and ```percentile``` model. Setting selector type explicitly can solve this issue also. * If setting selector type explicitly by users is allowed, we should handle param interaction such as if users set ```selectorType = percentile``` and ```alpha = 0.1```, we should notify users the parameter ```alpha``` will take no effect. We should handle complex parameter interaction checks at ```transformSchema```. (FYI apache#11620) * We should use lower case of the selector type names to follow MLlib convention. * Add ML Python API. ## How was this patch tested? Unit test. Author: Yanbo Liang <ybliang8@gmail.com> Closes apache#15214 from yanboliang/spark-17017.

deprecate validateParameters

5682bfc

hhbyyh added 4 commits March 9, 2016 16:58

Merge remote-tracking branch 'upstream/master' into depreValid

d509310

Merge remote-tracking branch 'upstream/master' into depreValid

f292cb7

Merge remote-tracking branch 'upstream/master' into depreValid

3e727ac

fix glm

3ccf90a

style fix

29c5a2f

jkbradley reviewed Mar 15, 2016
View reviewed changes

hhbyyh added 2 commits March 16, 2016 11:40

Merge remote-tracking branch 'upstream/master' into depreValid

f3fa1b9

comments and format

91f72a9

jkbradley reviewed Mar 16, 2016
View reviewed changes

revert some unintentional comment change

f348044

asfgit closed this in 92b7057 Mar 17, 2016

srowen reviewed Mar 17, 2016
View reviewed changes

jkbradley mentioned this pull request Mar 17, 2016

[SPARK-13761] [ML] Remove remaining uses of validateParams #11790

Closed

yanboliang mentioned this pull request Sep 23, 2016

[SPARK-17017][Follow-up][ML] Refactor of ChiSqSelector and add ML Python API. #15214

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-13761] [ML] Deprecate validateParams #11620

[SPARK-13761] [ML] Deprecate validateParams #11620

hhbyyh commented Mar 10, 2016

SparkQA commented Mar 10, 2016

SparkQA commented Mar 10, 2016

SparkQA commented Mar 10, 2016

jkbradley Mar 15, 2016

jkbradley commented Mar 15, 2016

jkbradley Mar 16, 2016

jkbradley commented Mar 16, 2016

SparkQA commented Mar 16, 2016

hhbyyh commented Mar 16, 2016

SparkQA commented Mar 17, 2016

jkbradley commented Mar 17, 2016

srowen Mar 17, 2016

jkbradley Mar 17, 2016

hhbyyh Mar 17, 2016

jkbradley Mar 17, 2016

		*/
		@deprecated("All the checks should be merged into transformSchema", "2.0.0")

		*/
		@deprecated("Will be removed in 2.1.0. Checks should be merged into transformSchema.", "2.0.0")

[SPARK-13761] [ML] Deprecate validateParams #11620

[SPARK-13761] [ML] Deprecate validateParams #11620

Conversation

hhbyyh commented Mar 10, 2016

What changes were proposed in this pull request?

How was this patch tested?

SparkQA commented Mar 10, 2016

SparkQA commented Mar 10, 2016

SparkQA commented Mar 10, 2016

jkbradley Mar 15, 2016

Choose a reason for hiding this comment

jkbradley commented Mar 15, 2016

jkbradley Mar 16, 2016

Choose a reason for hiding this comment

jkbradley commented Mar 16, 2016

SparkQA commented Mar 16, 2016

hhbyyh commented Mar 16, 2016

SparkQA commented Mar 17, 2016

jkbradley commented Mar 17, 2016

srowen Mar 17, 2016

Choose a reason for hiding this comment

jkbradley Mar 17, 2016

Choose a reason for hiding this comment

hhbyyh Mar 17, 2016

Choose a reason for hiding this comment

jkbradley Mar 17, 2016

Choose a reason for hiding this comment