Skip to content

Conversation

@MechCoder
Copy link
Contributor

Add std, mean to StandardScalerModel
getVectors, findSynonyms to Word2Vec Model
setFeatures and getFeatures to hashingTF

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@SparkQA
Copy link

SparkQA commented Jun 29, 2015

Test build #35988 has started for PR 7086 at commit ac9397b.

@MechCoder
Copy link
Contributor Author

This breaks model.transform(doc).collect() for Word2VecModel but I do not understand why.

@SparkQA
Copy link

SparkQA commented Jun 29, 2015

Test build #35988 has finished for PR 7086 at commit ac9397b.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Merged build finished. Test FAILed.

@MechCoder
Copy link
Contributor Author

It seems that the 'JMap[String, JList[Float]]' is not serializable.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@MechCoder
Copy link
Contributor Author

I pushed in a hack that works, but now the Scala version looks ugly.

@SparkQA
Copy link

SparkQA commented Jun 29, 2015

Test build #36007 has started for PR 7086 at commit 5b01465.

@SparkQA
Copy link

SparkQA commented Jun 29, 2015

Test build #36007 has finished for PR 7086 at commit 5b01465.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Merged build finished. Test PASSed.

@MechCoder
Copy link
Contributor Author

Is there are a way to make a scala method private such that it is accessible only to the python API? It seems private[python] works only with classes.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@SparkQA
Copy link

SparkQA commented Jun 29, 2015

Test build #36015 has started for PR 7086 at commit 51b1f24.

@SparkQA
Copy link

SparkQA commented Jun 29, 2015

Test build #36015 has finished for PR 7086 at commit 51b1f24.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Merged build finished. Test PASSed.

@JoshRosen
Copy link
Contributor

Jenkins, retest this please.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@SparkQA
Copy link

SparkQA commented Jun 30, 2015

Test build #36102 has started for PR 7086 at commit 51b1f24.

@SparkQA
Copy link

SparkQA commented Jun 30, 2015

Test build #36102 has finished for PR 7086 at commit 51b1f24.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Merged build finished. Test FAILed.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@SparkQA
Copy link

SparkQA commented Jun 30, 2015

Test build #36135 has started for PR 7086 at commit 6180b60.

@SparkQA
Copy link

SparkQA commented Jun 30, 2015

Test build #36135 has finished for PR 7086 at commit 6180b60.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Merged build finished. Test FAILed.

@SparkQA
Copy link

SparkQA commented Jul 1, 2015

Test build #36274 has started for PR 7086 at commit 2715b25.

@MechCoder
Copy link
Contributor Author

@mengxr

I just checked that HashingTF inherits from Has NumFeatures so there is not really anything to do for the first point.

I've just added the methods in scala and python in this PR and it should be open for review now.

@SparkQA
Copy link

SparkQA commented Jul 1, 2015

Test build #36274 has finished for PR 7086 at commit 2715b25.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Merged build finished. Test FAILed.

@MechCoder MechCoder force-pushed the missing_model_methods branch from 2715b25 to 3334d31 Compare July 1, 2015 16:00
@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@SparkQA
Copy link

SparkQA commented Jul 1, 2015

Test build #36279 has started for PR 7086 at commit 3334d31.

@SparkQA
Copy link

SparkQA commented Jul 1, 2015

Test build #36279 has finished for PR 7086 at commit 3334d31.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Merged build finished. Test PASSed.

@MechCoder MechCoder force-pushed the missing_model_methods branch from 3334d31 to 6e3d6b2 Compare July 7, 2015 17:39
@MechCoder MechCoder changed the title [SPARK-8704] [ML] [PySpark] Add additional methods to wrappers in ml.pyspark.feature [SPARK-8704] [ML] [PySpark] Add missing methods in StandardScaler Jul 7, 2015
@MechCoder
Copy link
Contributor Author

@mengxr updated. I thought it might be slightly overkill to add the scala and python version separately :P

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@SparkQA
Copy link

SparkQA commented Jul 7, 2015

Test build #36700 has started for PR 7086 at commit 6e3d6b2.

@SparkQA
Copy link

SparkQA commented Jul 7, 2015

Test build #36700 has finished for PR 7086 at commit 6e3d6b2.

  • This patch fails Scala style tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Merged build finished. Test FAILed.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

need the explicit type info for public methods

@mengxr
Copy link
Contributor

mengxr commented Jul 7, 2015

LGTM except the Scala style issue.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@SparkQA
Copy link

SparkQA commented Jul 7, 2015

Test build #36706 has started for PR 7086 at commit 9fbae90.

@SparkQA
Copy link

SparkQA commented Jul 7, 2015

Test build #36706 has finished for PR 7086 at commit 9fbae90.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • class DecisionTreeClassificationModel(DecisionTreeModel):
    • class RandomForestClassificationModel(TreeEnsembleModels):
    • class GBTClassificationModel(TreeEnsembleModels):
    • class DecisionTreeModel(JavaModel):
    • class TreeEnsembleModels(JavaModel):
    • class DecisionTreeRegressionModel(DecisionTreeModel):
    • class RandomForestRegressionModel(TreeEnsembleModels):
    • class GBTRegressionModel(TreeEnsembleModels):
    • case class InSet(child: Expression, hset: Set[Any])

@AmplabJenkins
Copy link

Merged build finished. Test PASSed.

@asfgit asfgit closed this in 35d781e Jul 7, 2015
@mengxr
Copy link
Contributor

mengxr commented Jul 7, 2015

Merged into master. Thanks!

@MechCoder MechCoder deleted the missing_model_methods branch July 7, 2015 19:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants