Leveraging BERT and c-TF-IDF to create easily interpretable topics.
-
Updated
Sep 17, 2024 - Python
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Retrieval and Retrieval-augmented LLMs
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
A curated list of pretrained sentence and word embedding models
xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
A Structured Self-attentive Sentence Embedding
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Compute Sentence Embeddings Fast!
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
unified embedding model
SGPT: GPT Sentence Embeddings for Semantic Search
Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.
A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
A Python vector database you just need - no more, no less.
Exploring the simple sentence similarity measurements using word embeddings
Papers and Book to look at when starting AGI 📚
Add a description, image, and links to the sentence-embeddings topic page so that developers can more easily learn about it.
To associate your repository with the sentence-embeddings topic, visit your repo's landing page and select "manage topics."