Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Modular RAG implementations: reranker, multi-retrievers #16

Closed
RobinQu opened this issue Apr 3, 2024 · 4 comments
Closed

Modular RAG implementations: reranker, multi-retrievers #16

RobinQu opened this issue Apr 3, 2024 · 4 comments
Milestone

Comments

@RobinQu
Copy link
Owner

RobinQu commented Apr 3, 2024

For better evaluation result in HF QA dataset.

  • Reranking: BCE, BGE-M3 scoring
  • Query rewrite
    • to generate SQL filter for given prompt
    • to generate hyperthecial queries
@RobinQu RobinQu added this to the v0.1.2 milestone Apr 3, 2024
@RobinQu
Copy link
Owner Author

RobinQu commented Apr 5, 2024

@RobinQu
Copy link
Owner Author

RobinQu commented May 20, 2024

Rerank, Colbert, ...

Related work

Methods

Opensourced Projects

image

image

image

Conlcusion

  • Both BCE and BGE family can be regarded as SOTA.
  • For multilanguage use case, valillan RAGatouille lags behind.
  • As Late-interaction models are faster in inference. bge-m3 is prefered as first ranking methods in RAG pipeline.

@RobinQu RobinQu modified the milestones: v0.1.2, v0.1.3, v0.1.4 May 21, 2024
@RobinQu
Copy link
Owner Author

RobinQu commented May 21, 2024

Timeline

  • BGE M3 in GGUF format #21
    • experiemntes with BERT + BGE M3 model for better understanding of forward pass, 3 days , 1 day.
    • bge-m3 inference code. A standalone repo may be needed. 7 days, 3 days.
  • First version of file-search tool for assistant-api #20
    • file and filebatch controllers, services, mappers. 3 days.
    • duckdb vectordb operator, retriever factory, 2 days
    • DuckDB: search with filter, delete with filter, 1 day
    • summarization pipeline for files, 2 days
    • file object handler, 2 days
    • file batch background jobs to update its progress, 1 day
    • search tool 3 days.
    • (optional) parallel react agent executor, 2 days
    • integration test 3 days.
    • eval test HF QA datast. 3 days.

@RobinQu
Copy link
Owner Author

RobinQu commented May 27, 2024

OpenAI officials parameter for RAG: https://platform.openai.com/docs/assistants/tools/file-search/how-it-works

By default, the file_search tool uses the following settings:
Chunk size: 800 tokens
Chunk overlap: 400 tokens
Embedding model: text-embedding-3-large at 256 dimensions
Maximum number of chunks added to context: 20 (could be fewer)

Supported file formats: https://platform.openai.com/docs/assistants/tools/file-search/supported-files

Image

@RobinQu RobinQu changed the title Adanvanced RAG implementations Modular RAG implementations: reranker, multi-retrievers Jun 14, 2024
@RobinQu RobinQu closed this as completed Jun 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

1 participant