document-clustering

Star

Here are 4 public repositories matching this topic...

lukacupic / PDF-Document-Management-and-Search-System

Star

Bachelor's Thesis at FER, University of Zagreb, 2018.

tf-idf bachelor-thesis document-clustering document-similarity

Updated Jan 24, 2022
Java

nidhisinha11 / predictive-analytics-2021

Star

Document Clustering project utilizing K-Means algorithm. Requires Stanford CoreNLP as a dependency. From my undergraduate course in Predictive Analytics taken with Anasse Bari at NYU.

analytics nlp-machine-learning document-clustering k-means-clustering

Updated Nov 9, 2021
Java

hailiang-wang / apache-mahout

Star

mvn -Dhadoop2.version=2.5.0 -Dlucene.version=xxx -DskipTests clean install

natural-language-processing document-clustering

Updated Oct 18, 2017
Java

DDansAbelenda / doc-clusterizer

Star

DocClusterizer is a Java desktop application designed to analyze and cluster documents based on their content similarity. The application utilizes Lucene and Tika libraries to process various file extensions such as txt, pdf, docx, and pptx.

tika javafx java-8 lucene kmeans-clustering linkage document-clustering kmeans-algorithm lucene-analyzer unsupervised-clustering fuzzycmeans

Updated Apr 6, 2024
Java

Improve this page

Add a description, image, and links to the document-clustering topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the document-clustering topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

document-clustering

Here are 4 public repositories matching this topic...

lukacupic / PDF-Document-Management-and-Search-System

nidhisinha11 / predictive-analytics-2021

hailiang-wang / apache-mahout

DDansAbelenda / doc-clusterizer

Improve this page

Add this topic to your repo