-
Notifications
You must be signed in to change notification settings - Fork 28.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-10194] [MLlib] [PySpark] SGD algorithms need convergenceTol parameter in Python #8457
Conversation
Test build #41619 has finished for PR 8457 at commit
|
:param: miniBatchFraction Fraction of data on which SGD is run for each | ||
:param stepSize: Step size for each iteration of gradient descent. | ||
:param numIterations: Total number of iterations run. | ||
:param miniBatchFraction: Fraction of data on which SGD is run for each |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not your fault, but if you make any additional changes can you add a "." at the end of this sentence?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also, the default values should be documented.
LGTM overall The formal parameter changes to |
@@ -212,7 +218,7 @@ private[python] class PythonMLLibAPI extends Serializable { | |||
initialWeights: Vector, | |||
regType: String, | |||
intercept: Boolean, | |||
validateData: Boolean): JList[Object] = { | |||
validateData: Boolean, convergenceTol: Double): JList[Object] = { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
chop down args
I open SPARK-10560 to track the |
Test build #42317 has finished for PR 8457 at commit
|
LGTM. Merged into master. Thanks! |
SPARK-3382 added a
convergenceTol
parameter for GradientDescent-based methods in Scala. We need that parameter in Python; otherwise, Python users will not be able to adjust that behavior (or even reproduce behavior from previous releases since the default changed).