Confusions about REPLUG #40

richhh520 · 2024-02-21T09:48:59Z

How to train the retriever of REPLUG? How to update the embeddings of query and documents?
In your replug_parallel_reader.ipynb, why directly import the default BM25Retriever rather than the trained Retriever？
What does PromptModel do? What is its function? what does ReplugHFLocalInvocationLayer do? It seems this part is not mentioned in the paper.

Thanks for your help!

danielfleischer · 2024-02-25T14:40:33Z

Hi, thanks for the questions. Here we focused more on the LLM part of the REPLUG paper, where documents can be fed in parallel. In general, documents can be retrieved using any method; for the simplicity of the notebook we showed in memory DB that uses BM25 with re-ranker using off-the-shelf sentence transformer. We don't have training code for embedders here, we might add it in the future.

The PromptModel is an abstraction around LLMs where you provide a model name and it generates text based on a prompt, abstracting away whether it is a local model, the hardware specifications or perhaps even a cloud-based service. The API is part of Haystack v1, which we use for the development of our library.

kerkathy · 2024-03-31T08:09:43Z

Thanks for the work, here also look forward to the implementation of the REPLUG training :)

danielfleischer closed this as completed Mar 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Confusions about REPLUG #40

Confusions about REPLUG #40

richhh520 commented Feb 21, 2024

danielfleischer commented Feb 25, 2024

kerkathy commented Mar 31, 2024

Confusions about REPLUG #40

Confusions about REPLUG #40

Comments

richhh520 commented Feb 21, 2024

danielfleischer commented Feb 25, 2024

kerkathy commented Mar 31, 2024