Skip to content

RAG with Kernel Memory plugin always uses two LLM calls? #6903

Answered by dluc
chaelli asked this question in Q&A
Discussion options

You must be logged in to vote

@chaelli the first call is about intent detection, do you know which code is making that first request? About KM, the ASK API uses these 2 requests:

  1. generate embedding for the question, and use the embedding to find relevant sources
  2. generate an answer using the relevant sources found

Replies: 3 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Answer selected by sophialagerkranspandey
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@dluc
Comment options

@chaelli
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants