Chatbot improvements #188

miguelgrinberg · 2024-02-08T11:29:32Z

This PR includes a set of minor improvements and tunings to the RAG chatbot example:

The condensed question is used only to retrieve relevant documents from the ES index. Then the LLM receives the original chat history and question from the user with this context. This prevents user questions from diluting as more follow up questions are entered.
A longer request timeout was added to the bulk importer, for robustness.
The source parser used in the frontend was expanded to handle a common case in which the LLM does not select any sources from the provided context.

Chatbot improvements

4282906

miguelgrinberg requested a review from joemcelroy February 8, 2024 11:29

joemcelroy approved these changes Feb 9, 2024

View reviewed changes

miguelgrinberg merged commit d4f3ca4 into elastic:main Feb 9, 2024

miguelgrinberg deleted the chatbot-tuning branch February 9, 2024 14:36

Provide feedback