MTEB: Massive Text Embedding Benchmark
-
Updated
Jul 6, 2025 - Python
MTEB: Massive Text Embedding Benchmark
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Study guides for MIT's 15.003 Data Science Tools
Profile-Based Long-Term Memory for AI Applications. Memobase handles user profiles, memory events, and evolving context — perfect for chatbots, companions, tutors, customer service bots, and all chat-based agents.
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
A realtime serving engine for Data-Intensive Generative AI Applications
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
SGPT: GPT Sentence Embeddings for Semantic Search
Epsilla is a high performance Vector Database Management System
Parsing-free RAG supported by VLMs
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
Generative Representational Instruction Tuning
Add a description, image, and links to the retrieval topic page so that developers can more easily learn about it.
To associate your repository with the retrieval topic, visit your repo's landing page and select "manage topics."