Skip to content

Commit

Permalink
[SPARK-11560][MLLIB] Optimize KMeans implementation / remove 'runs'
Browse files Browse the repository at this point in the history
## What changes were proposed in this pull request?

This is a revival of #14948 and related to #14937. This removes the 'runs' parameter, which has already been disabled, from the K-means implementation and further deprecates API methods that involve it.

This also happens to resolve the issue that K-means should not return duplicate centers, meaning that it may return less than k centroids if not enough data is available.

## How was this patch tested?

Existing tests

Author: Sean Owen <sowen@cloudera.com>

Closes #15342 from srowen/SPARK-11560.
  • Loading branch information
srowen committed Oct 12, 2016
1 parent c264ef9 commit 8d33e1e
Showing 1 changed file with 132 additions and 164 deletions.

0 comments on commit 8d33e1e

Please sign in to comment.