Skip to content
View AneuYy's full-sized avatar

Block or report AneuYy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Embeddings

24 repositories

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,643 534 Updated Oct 16, 2024

State-of-the-Art Text Embeddings

Python 18,367 2,756 Updated Mar 6, 2026

Retrieval and Retrieval-augmented LLMs

Python 11,373 840 Updated Dec 15, 2025

Open-source search and retrieval database for AI applications.

Rust 26,525 2,092 Updated Mar 10, 2026

AI + Data, online. https://vespa.ai

Java 6,815 700 Updated Mar 10, 2026

Ecommerce Search and Discovery - marqo.ai

Python 5,019 229 Updated Mar 9, 2026

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 29,444 2,081 Updated Mar 10, 2026

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

HTML 9,364 781 Updated Mar 9, 2026

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 43,278 3,880 Updated Mar 9, 2026

LlamaIndex is the leading document agent and OCR platform

Python 47,519 6,943 Updated Mar 10, 2026

A library for efficient similarity search and clustering of dense vectors.

C++ 39,310 4,272 Updated Mar 9, 2026

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 15,770 1,206 Updated Mar 9, 2026

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Python 2,023 156 Updated Jan 15, 2025

A blazing fast inference solution for text embeddings models

Rust 4,573 370 Updated Mar 6, 2026

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

C++ 4,426 416 Updated Mar 9, 2026

Netease Youdao's open-source embedding and reranker models for RAG products.

Python 1,867 129 Updated Sep 9, 2025

Nomic Developer API SDK

Python 1,872 198 Updated Nov 11, 2025

Apache Cassandra®

Java 9,654 3,848 Updated Mar 5, 2026

Vald. A Highly Scalable Distributed Vector Search Engine

Go 1,688 92 Updated Mar 9, 2026

Open-source vector similarity search for Postgres

C 20,176 1,088 Updated Feb 26, 2026

MTEB: Massive Text Embedding Benchmark

Python 3,154 568 Updated Mar 9, 2026

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 17,503 1,402 Updated Feb 8, 2026

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 2,706 181 Updated Feb 5, 2026

A lightweight, lightning-fast, in-process vector database

C++ 8,788 493 Updated Mar 9, 2026