-
Notifications
You must be signed in to change notification settings - Fork 99
Open
Description
Hi
Very cool framework, and I like this direction a lot!
Just want to point out that in KBLaM, we considered a similiar approach for augmenting external memory into pre-trained LLM:
https://arxiv.org/abs/2410.10450
where we concat the KV of compressed external knowledge in front of the input text's QKV for knowledge augmentation (using attention as the retrieval mechanism).
There are also some follow-up works that try to further imporve the scalability, e.g. https://arxiv.org/abs/2510.17934
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels