[SPARK-12468] [Pyspark] extractParamMap returns empty dictionary #10419

ZacharySBrown · 2015-12-21T21:20:21Z

This addresses an issue where extractParamMap() method for a model that has been fit returns an empty dictionary, e.g. (from the Pyspark ML API Documentation):

from pyspark.mllib.linalg import Vectors
from pyspark.ml.classification import LogisticRegression
from pyspark.ml.param import Param, Params

# Prepare training data from a list of (label, features) tuples.
training = sqlContext.createDataFrame([
    (1.0, Vectors.dense([0.0, 1.1, 0.1])),
    (0.0, Vectors.dense([2.0, 1.0, -1.0])),
    (0.0, Vectors.dense([2.0, 1.3, 1.0])),
    (1.0, Vectors.dense([0.0, 1.2, -0.5]))], ["label", "features"])

# Create a LogisticRegression instance. This instance is an Estimator.
lr = LogisticRegression(maxIter=10, regParam=0.01)
# Print out the parameters, documentation, and any default values.
print "LogisticRegression parameters:\n" + lr.explainParams() + "\n"

# Learn a LogisticRegression model. This uses the parameters stored in lr.
model1 = lr.fit(training)

# Since model1 is a Model (i.e., a transformer produced by an Estimator),
# we can view the parameters it used during fit().
# This prints the parameter (name: value) pairs, where names are unique IDs for this
# LogisticRegression instance.
print "Model 1 was fit using parameters: "
print model1.extractParamMap()

…he returned model

…EP8 standards

AmplabJenkins · 2015-12-21T21:22:12Z

Can one of the admins verify this patch?

yanboliang · 2015-12-23T03:12:51Z

@ZacharySBrown Thanks for catching this bug. But I think setting a._paramMap with self.extractParamMap() is not appropriate, because you set the child/Model's _paramMap with its parent/Estimator's _paramMap. I think what we should do is to call _transfer_params_from_java which will transforms the embedded params from the companion Scala/Java model to Python one.

yanboliang · 2015-12-23T03:19:40Z

Further more, I think we should update the PySpark ML API Documentation which you mentioned. If you want to view the parameters used during fit(), you should call model1.parent.extractParamMap() rather than model1.extractParamMap().

chrispe · 2016-03-22T14:54:13Z

Is there any workaround, until this gets fixed?
I would like for example to be able and save the parameters used for StringIndexerModel.
Is that possible?

jkbradley · 2016-04-20T23:33:22Z

@ZacharySBrown Thanks for this PR. I think it's a duplicate of [SPARK-10931], so could you close this PR please? As @yanboliang mentioned, a good fix will require transferring the Params from Java, which will also require having the Models contain the actual Params. It would be great to get your input on the other PR.

@chrispe92 There is not a great solution, but you can access the underlying Java object via the _java_obj attribute: list(pythonIndexer._java_obj.labels())

Zak Brown added 2 commits December 21, 2015 15:28

Updated _fit() method of JavaEstimator Class to update paramMap for t…

6e7c80b

…he returned model

Removed extra spaces in modifications to wrapper.py to conform with P…

1c5f499

…EP8 standards

ZacharySBrown changed the title ~~[SPARK-12468] [Pyspark]~~ [SPARK-12468] [Pyspark] extractParamMap returns empty dictionary Dec 21, 2015

asfgit closed this in 6acc72a Apr 23, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-12468] [Pyspark] extractParamMap returns empty dictionary #10419

[SPARK-12468] [Pyspark] extractParamMap returns empty dictionary #10419

ZacharySBrown commented Dec 21, 2015

AmplabJenkins commented Dec 21, 2015

yanboliang commented Dec 23, 2015

yanboliang commented Dec 23, 2015

chrispe commented Mar 22, 2016

jkbradley commented Apr 20, 2016

[SPARK-12468] [Pyspark] extractParamMap returns empty dictionary #10419

[SPARK-12468] [Pyspark] extractParamMap returns empty dictionary #10419

Conversation

ZacharySBrown commented Dec 21, 2015

AmplabJenkins commented Dec 21, 2015

yanboliang commented Dec 23, 2015

yanboliang commented Dec 23, 2015

chrispe commented Mar 22, 2016

jkbradley commented Apr 20, 2016