Skip to content

Commit

Permalink
[SPARK-6517][MLLIB] Implement the Algorithm of Hierarchical Clustering
Browse files Browse the repository at this point in the history
I implemented a hierarchical clustering algorithm again.  This PR doesn't include examples, documentation and spark.ml APIs. I am going to send another PRs later.
https://issues.apache.org/jira/browse/SPARK-6517

- This implementation based on a bi-sectiong K-means clustering.
    - It derives from the freeman-lab 's implementation
- The basic idea is not changed from the previous version. (#2906)
    - However, It is 1000x faster than the previous version through parallel processing.

Thank you for your great cooperation, RJ Nowling(rnowling), Jeremy Freeman(freeman-lab), Xiangrui Meng(mengxr) and Sean Owen(srowen).

Author: Yu ISHIKAWA <yuu.ishikawa@gmail.com>
Author: Xiangrui Meng <meng@databricks.com>
Author: Yu ISHIKAWA <yu-iskw@users.noreply.github.com>

Closes #5267 from yu-iskw/new-hierarchical-clustering.

(cherry picked from commit 8a23368)
Signed-off-by: Xiangrui Meng <meng@databricks.com>
  • Loading branch information
yu-iskw authored and mengxr committed Nov 9, 2015
1 parent 9e80db7 commit 1585f55
Show file tree
Hide file tree
Showing 4 changed files with 841 additions and 0 deletions.
Loading

0 comments on commit 1585f55

Please sign in to comment.