An implementation of the TaxRetrievalBenchmark task for the 🤗 Massive Text Embedding Benchmark (MTEB) framework.
-
Updated
May 26, 2024 - Jupyter Notebook
An implementation of the TaxRetrievalBenchmark task for the 🤗 Massive Text Embedding Benchmark (MTEB) framework.
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Backend for the AI-copilot
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Retrieval and Retrieval-augmented LLMs
Embedding Representation for Indonesian Sentences!
Convert MUSE from TensorFlow to PyTorch and ONNX
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
PyTorch implementation of Self-training approch for short text clustering
ColBERT humor dataset for the task of humor detection, containing 200,000 jokes/news
Data and scripts for training the open source PDF questionnaire extraction component for Harmony Kaggle competition using natural language processing (NLP)
This study aims to investigate the effectiveness of three Transformers (BERT, RoBERTa, XLNet) in handling data sparsity and cold start problems in the recommender system. We present a Transformer-based hybrid recommender system that predicts missing ratings and ex- tracts semantic embeddings from user reviews to mitigate the issues.
A custom cross encoder used to predict the diseases from an input of symptoms
Quranic Lexical/Semantic Search
Finetune mistral-7b-instruct for sentence embeddings
Code for BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings (NAACL2024)
Papers and Book to look at when starting AGI 📚
文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT
Add a description, image, and links to the sentence-embeddings topic page so that developers can more easily learn about it.
To associate your repository with the sentence-embeddings topic, visit your repo's landing page and select "manage topics."