Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
ddangelov committed Mar 23, 2020
1 parent 54007f3 commit 5091277
Showing 1 changed file with 3 additions and 0 deletions.
3 changes: 3 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -48,6 +48,9 @@ attracted the documents to the dense area are the topic words.
![HDBSCAN Document Clusters](images/hdbscan_docs.png)

**4. For each dense area calculate centroid of document vectors in original dimension. (centroid = topic vector)**
>The red points are outliers and do not get used for calculating the topic vector. The purple points are the documents vectors identified as being part of a dense area are used to calculate the topic vector.
![Topic Vector](images/topic_vector.svg)

**5. Find n-closest word vectors to the resulting topic vector**

Expand Down

0 comments on commit 5091277

Please sign in to comment.