Skip to content
View oroszgy's full-sized avatar
:octocat:
:octocat:

Organizations

@ec-doris @huspacy

Block or report oroszgy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

RAG

35 repositories

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 38,953 3,715 Updated Jul 9, 2025

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Python 4,320 363 Updated Feb 23, 2026

Unified framework for building enterprise RAG pipelines with small, specialized models

Python 14,849 2,964 Updated Feb 21, 2026

LlamaIndex is the leading document agent and OCR platform

Python 47,181 6,861 Updated Feb 24, 2026

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 2,680 179 Updated Feb 5, 2026

Efficient Retrieval Augmentation and Generation Framework

Python 1,769 165 Updated Jan 12, 2026

The LLM Evaluation Framework

Python 13,800 1,258 Updated Feb 23, 2026

Automated Evaluation of RAG Systems

Python 693 68 Updated Mar 28, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 73,681 8,184 Updated Feb 25, 2026

An open-source RAG-based tool for chatting with your documents.

Python 25,163 2,105 Updated Jul 4, 2025

Production-ready platform for agentic workflow development.

TypeScript 130,289 20,303 Updated Feb 25, 2026

Deploy langgenious/dify, an LLM based app on kubernetes with helm chart.

Go Template 614 173 Updated Feb 13, 2026

Fast State-of-the-Art Static Embeddings

Python 2,003 116 Updated Feb 13, 2026

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 28,675 4,098 Updated Feb 23, 2026

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & C…

Python 946 67 Updated Jan 1, 2026

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

Python 12,215 783 Updated Feb 24, 2026

SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.

Python 7,700 626 Updated Nov 7, 2025

This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation. It allows users to compare different chunking methods and i…

Python 472 91 Updated Dec 13, 2025
Python 71 5 Updated Feb 5, 2025

Everything you need to know to build your own RAG application

Jupyter Notebook 4,034 478 Updated Nov 22, 2025

This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.

Python 160 21 Updated May 30, 2025

[NeurIPS'24] UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis

Jupyter Notebook 42 3 Updated Feb 21, 2025
Python 55 7 Updated Mar 11, 2025

Benchmarking library for RAG

Jupyter Notebook 260 31 Updated Feb 15, 2026

Train an adapter for any embedding model in under a minute

Python 129 7 Updated Apr 9, 2025

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Python 979 94 Updated May 3, 2024

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 25,603 3,007 Updated Feb 17, 2026

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Python 7,333 415 Updated Feb 21, 2025

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 5,695 602 Updated Feb 25, 2026