Skip to content

Conversation

@miguelgrinberg
Copy link
Collaborator

This PR includes a set of minor improvements and tunings to the RAG chatbot example:

  • The condensed question is used only to retrieve relevant documents from the ES index. Then the LLM receives the original chat history and question from the user with this context. This prevents user questions from diluting as more follow up questions are entered.
  • A longer request timeout was added to the bulk importer, for robustness.
  • The source parser used in the frontend was expanded to handle a common case in which the LLM does not select any sources from the provided context.

@miguelgrinberg miguelgrinberg merged commit d4f3ca4 into elastic:main Feb 9, 2024
@miguelgrinberg miguelgrinberg deleted the chatbot-tuning branch February 9, 2024 14:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants