You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
✨ New Features
KVzapPress - a fast approximation of KVzip for prefill and decoding compression (https://arxiv.org/abs/2601.07891). Comes with KVzap training and evaluation utilities (#171)
ThresholdPress - adaptive compression using score thresholds instead of fixed compression ratios (#171)
📈 Improvements
Update KVzipPress with improvements and evaluation registry support (#172)
Rename compress-question to query-aware in evaluation config (#168)
Refactor ObservedAttentionPress for cleaner implementation (#166)