Top2Vec learns jointly embedded topic, document and word vectors.
-
Updated
May 12, 2024 - Python
Top2Vec learns jointly embedded topic, document and word vectors.
Find parts of long text or data, allowing for some changes/typos.
⚡ A telegram bot for searching all the stickers (just like @gif).
Expose a Top2Vec model with a REST API.
Simple full text search demo for Google App Engine
hotpdf is a fast PDF parsing library to extract text and find text within PDF documents built on top of pdfminer.six
A static site generator for Zettelkasten notes
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!
Simple document search (boolean retrieval or TF-IDF) in Python
Fast fuzzy text search
Implementing Ctrl+F for scanned images using Computer Vision approaches
Basic and Full-text Search in Django
An intelligent document search application using semantic similarity and vector embeddings to find relevant documents based on content rather than just keywords. Built with Python, SentenceTransformers, and ChromaDB.
An inverted index on various Nintendo console games using the GiantBomb API
Searches in a folder with pdfs for a list of specified keywords. Writes the result into a csv file.
Laboratory work on text search course
Build regex patterns to search through base64 encoded text.
Add a description, image, and links to the text-search topic page so that developers can more easily learn about it.
To associate your repository with the text-search topic, visit your repo's landing page and select "manage topics."