Skip to content

Commit

Permalink
Documentation
Browse files Browse the repository at this point in the history
  • Loading branch information
rezazadeh committed Sep 14, 2014
1 parent e9c6791 commit 3764983
Showing 1 changed file with 4 additions and 4 deletions.
Expand Up @@ -394,7 +394,7 @@ class RowMatrix(
}

/**
* Compute all similarities between columns of this matrix using the brute-force
* Compute all cosine similarities between columns of this matrix using the brute-force
* approach of computing normalized dot products.
*
* @return An n x n sparse upper-triangular matrix of cosine similarities between columns of this matrix.
Expand All @@ -406,14 +406,14 @@ class RowMatrix(
/**
* Compute all similarities between columns of this matrix using a sampling approach.
*
* The threshold parameter is a trade-off knob between correctness and computational cost.
* The threshold parameter is a trade-off knob between estimate quality and computational cost.
*
* Setting a threshold of 0 guarantees deterministic correct results, but comes at exactly
* the same cost as the brute-force approach. Setting the threshold to positive values
* incurs strictly less computational cost than the brute-force aproach, however the
* incurs strictly less computational cost than the brute-force approach, however the
* similarities computed will be estimates.
*
* The sampling guarantees correctness for those pairs of columns that have
* The sampling guarantees relative-error correctness for those pairs of columns that have
* similarity greater than the given similarity threshold.
*
* To describe the guarantee, we set some notation:
Expand Down

0 comments on commit 3764983

Please sign in to comment.