A modular pipeline for extracting, embedding, clustering, and securely archiving Korean document content using BERT-based semantic analysis.
-
Updated
Sep 13, 2025 - Python
A modular pipeline for extracting, embedding, clustering, and securely archiving Korean document content using BERT-based semantic analysis.
📂 Extract, embed, cluster, and securely store Korean text from documents using BERT, enhancing research efficiency and organization.
Add a description, image, and links to the korean-text topic page so that developers can more easily learn about it.
To associate your repository with the korean-text topic, visit your repo's landing page and select "manage topics."