Wrote a tutorial on loading agent tools on demand using local ONNX embeddings #6373
anktsrkr
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Had ~80 tools in an agent and the model kept picking the wrong ones or just ignoring them. turned out I was burning most of the context window just on tool definitions before any actual conversation.
Built a "RAG for tools" pattern to fix it — agent starts with one
SearchToolsfunction, calls it when it needs something, local embedding model finds the relevant tools and loads them into the session on the fly. Runs offline usingall-MiniLM-L6-v2via ONNX Runtime withFusionCacheso there's no inference overhead on repeat queries.Full writeup: https://microsoft-agent-framework.github.io/learn/agent-capabilities/semantic-tool-search/
Beta Was this translation helpful? Give feedback.
All reactions