Self-evaluating RAG benchmark over SEC 10-K filings. 12 configurations evaluated with RAGAS, MLflow, and a hybrid retriever with cross-encoder reranking.
openai bm25 sec-filings rag mlflow llm hybrid-retrieval chromadb semantic-chunking ragas sentense-transformers retrieval-benchmarking
-
Updated
May 18, 2026 - Python