[SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pyspark #672

mengxr · 2014-05-06T23:57:47Z

Make loading/saving labeled data easier for pyspark users.

Also changed type check in SparseVector to allow numpy integers.

AmplabJenkins · 2014-05-06T23:57:57Z

Merged build triggered.

AmplabJenkins · 2014-05-06T23:58:03Z

Merged build started.

AmplabJenkins · 2014-05-07T00:38:54Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-05-07T00:38:55Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14742/

mateiz · 2014-05-07T00:55:13Z

python/pyspark/mllib/util.py

I believe you should use @param and @return for Epydoc.. check pyspark/conf.py for example. Or have you tried generating the docs with this and seen it work?

Epydoc doesn't work on my Mac. I will try to follow the syntax in conf.py.

AmplabJenkins · 2014-05-07T01:42:57Z

Merged build triggered.

AmplabJenkins · 2014-05-07T01:43:04Z

Merged build started.

AmplabJenkins · 2014-05-07T01:48:02Z

Merged build finished.

AmplabJenkins · 2014-05-07T01:48:02Z

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14749/

mateiz · 2014-05-07T02:28:32Z

Jenkins, retest this please

AmplabJenkins · 2014-05-07T02:32:58Z

Merged build triggered.

AmplabJenkins · 2014-05-07T02:33:04Z

Merged build started.

AmplabJenkins · 2014-05-07T02:38:15Z

Merged build finished.

AmplabJenkins · 2014-05-07T02:38:16Z

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14750/

mengxr · 2014-05-07T06:12:01Z

Jenkins, retest this please.

AmplabJenkins · 2014-05-07T06:13:00Z

Merged build triggered.

AmplabJenkins · 2014-05-07T06:13:05Z

Merged build started.

AmplabJenkins · 2014-05-07T06:51:05Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-05-07T06:51:05Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14760/

Make loading/saving labeled data easier for pyspark users. Also changed type check in `SparseVector` to allow numpy integers. Author: Xiangrui Meng <meng@databricks.com> Closes #672 from mengxr/pyspark-mllib-util and squashes the following commits: 2943fa7 [Xiangrui Meng] format docs d61668d [Xiangrui Meng] add loadLibSVMFile and saveAsLibSVMFile to pyspark (cherry picked from commit 3188553) Signed-off-by: Patrick Wendell <pwendell@gmail.com>

Make loading/saving labeled data easier for pyspark users. Also changed type check in `SparseVector` to allow numpy integers. Author: Xiangrui Meng <meng@databricks.com> Closes apache#672 from mengxr/pyspark-mllib-util and squashes the following commits: 2943fa7 [Xiangrui Meng] format docs d61668d [Xiangrui Meng] add loadLibSVMFile and saveAsLibSVMFile to pyspark

…pache#672) [SPARK-49783][YARN] Fix resource leak of yarn allocator Fix the resource leak of yarn allocator When the target < running containers number, the assigned containers from the resource manager will be skipped, but these containers are not released by invoking the amClient.releaseAssignedContainer , that will make these containers reserved into the Yarn resourceManager at least 10 minutes. And so, the cluster resource will be wasted at a high ratio. And this will reflect that the vcore * seconds statistics from yarn side will be greater than the result from the spark event logs. From my statistics, the cluster resource waste ratio is ~25% if the spark jobs are exclusive in this cluster. No In our internal hadoop cluster No Closes apache#48238 from zuston/patch-1. Authored-by: Junfan Zhang <zuston@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> Co-authored-by: Junfan Zhang <zuston@apache.org>

add loadLibSVMFile and saveAsLibSVMFile to pyspark

d61668d

mateiz reviewed May 7, 2014
View reviewed changes

format docs

2943fa7

mengxr mentioned this pull request May 7, 2014

[SPARK-1752][MLLIB] Standardize text format for vectors and labeled points #685

Closed

asfgit closed this in 3188553 May 7, 2014

[SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pyspark #672

[SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pyspark #672

Uh oh!

Conversation

mengxr commented May 6, 2014

Uh oh!

AmplabJenkins commented May 6, 2014

Uh oh!

AmplabJenkins commented May 6, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

mateiz May 7, 2014

Choose a reason for hiding this comment

Uh oh!

mengxr May 7, 2014

Choose a reason for hiding this comment

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

mateiz commented May 7, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

mengxr commented May 7, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

AmplabJenkins commented May 7, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants