Navigation Menu

Skip to content

Commit

Permalink
k-means: Improve performance with array
Browse files Browse the repository at this point in the history
Jira: MADLIB-454

It seems sparse vector was the bottleneck of the scalability.  We see 10
times faster in some of our use cases by using arrays instead of sparse
vectors.  Although we still have concerns around space efficiency, we
saw the arrays are also packed enough by the compression and toast mechanism.
Let's see by having this fix if our test cases verify the improvement.
  • Loading branch information
yi ren authored and haradh1 committed Sep 6, 2012
1 parent 4b2effb commit 57a362e
Show file tree
Hide file tree
Showing 3 changed files with 508 additions and 69 deletions.

0 comments on commit 57a362e

Please sign in to comment.