Skip to content

v0.4.1

Choose a tag to compare

@SimJeg SimJeg released this 14 Jan 09:22
· 25 commits to main since this release
50c2ae5

✨ New Features

  • KVzapPress - a fast approximation of KVzip for prefill and decoding compression (https://arxiv.org/abs/2601.07891). Comes with KVzap training and evaluation utilities (#171)
  • ThresholdPress - adaptive compression using score thresholds instead of fixed compression ratios (#171)

📈 Improvements

  • Update KVzipPress with improvements and evaluation registry support (#172)
  • Rename compress-question to query-aware in evaluation config (#168)
  • Refactor ObservedAttentionPress for cleaner implementation (#166)
  • Add leaderboard generation script (#171)

🐛 Bug Fixes

  • Fix empty context handling in pipeline (#165)