Modular RAG implementations: reranker, multi-retrievers #16

RobinQu · 2024-04-03T14:38:35Z

For better evaluation result in HF QA dataset.

Reranking: BCE, BGE-M3 scoring
Query rewrite
- to generate SQL filter for given prompt
- to generate hyperthecial queries

RobinQu · 2024-04-05T05:58:00Z

more complex rag pipeline may invole agent frameworks #18

https://github.com/langchain-ai/langgraph/blob/70c1c996a4c9fe8df518bcd849b3c6453dd0d58b/examples/rag/langgraph_adaptive_rag.ipynb

RobinQu · 2024-05-20T07:26:00Z

Rerank, Colbert, ...

Related work

Methods

Keyword based - BM25
Cross-encoders BCE reranker
Late-interaction - ColBERT, BGE-M3
- 2004.12832v2.pdf
- https://zhuanlan.zhihu.com/p/683483778

Opensourced Projects

Official COLBERT implementation: https://github.com/bclavie/RAGatouille
BCE family: https://github.com/netease-youdao/BCEmbedding
BAAI BGE family: https://github.com/FlagOpen/FlagEmbedding
BCE & BGE in C++: https://github.com/li-plus/chatglm.cpp

Conlcusion

Both BCE and BGE family can be regarded as SOTA.
For multilanguage use case, valillan RAGatouille lags behind.
As Late-interaction models are faster in inference. bge-m3 is prefered as first ranking methods in RAG pipeline.

RobinQu · 2024-05-21T01:55:38Z

RobinQu · 2024-05-27T10:26:35Z

OpenAI officials parameter for RAG: https://platform.openai.com/docs/assistants/tools/file-search/how-it-works

By default, the file_search tool uses the following settings:
Chunk size: 800 tokens
Chunk overlap: 400 tokens
Embedding model: text-embedding-3-large at 256 dimensions
Maximum number of chunks added to context: 20 (could be fewer)

Supported file formats: https://platform.openai.com/docs/assistants/tools/file-search/supported-files

RobinQu added this to the v0.1.2 milestone Apr 3, 2024

RobinQu mentioned this issue Apr 5, 2024

Research on agent archtecture and current implementations #15

Closed

14 tasks

RobinQu modified the milestones: v0.1.2, v0.1.3, v0.1.4 May 21, 2024

RobinQu mentioned this issue Jun 14, 2024

Limitations of mini-assistant #22

Open

RobinQu changed the title ~~Adanvanced RAG implementations~~ Modular RAG implementations: reranker, multi-retrievers Jun 14, 2024

RobinQu closed this as completed Jun 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modular RAG implementations: reranker, multi-retrievers #16

Modular RAG implementations: reranker, multi-retrievers #16

RobinQu commented Apr 3, 2024 •

edited

Loading

RobinQu commented Apr 5, 2024

RobinQu commented May 20, 2024 •

edited

Loading

RobinQu commented May 21, 2024 •

edited

Loading

RobinQu commented May 27, 2024

Modular RAG implementations: reranker, multi-retrievers #16

Modular RAG implementations: reranker, multi-retrievers #16

Comments

RobinQu commented Apr 3, 2024 • edited Loading

RobinQu commented Apr 5, 2024

RobinQu commented May 20, 2024 • edited Loading

Rerank, Colbert, ...

Related work

Methods

Opensourced Projects

Conlcusion

RobinQu commented May 21, 2024 • edited Loading

Timeline

RobinQu commented May 27, 2024

RobinQu commented Apr 3, 2024 •

edited

Loading

RobinQu commented May 20, 2024 •

edited

Loading

RobinQu commented May 21, 2024 •

edited

Loading