A benchmark for outdated retrieval in LLM memory under temporal drift, with a recency-reranking baseline.
benchmark information-retrieval evaluation reranking rag vector-search llm retrieval-augmented-generation llm-memory stale-data agent-memory temporal-rag
-
Updated
Jun 18, 2026 - Python