π Release v0.4.0
β¨ New Features
- CURPress - Value-Guided KV Compression for LLMs via Approximated CUR Decomposition (#150)
- CompactorPress - Compactor: Calibrated Query-Agnostic KV Cache Compression with Approximate Leverage Scores (#143)
- Decoding Press Functionality - Support for KV cache compression during the decoding phase (#139)
- AIME25 & Math500 Benchmarks - New evaluation datasets for mathematical reasoning tasks (#142)
post_init_from_model Hook - Add model-specific initialization support in BasePress (#163)
π Improvements
- Moved tests to GPU for faster CI execution (#132)
- Improved needle-in-haystack test coverage (#133)
- Updated README and documentation for clarity (#162)
- Enhanced docstrings throughout the codebase (#159)
- Updated decoding notebook with latest examples (#156)
- Code cleanup: moved utilities, cleaned imports (#160)
π Bug Fixes
- Fixed LongBench-v2 benchmark evaluation (#161)
- Fixed kvzip press access to
past_key_values
- Fixed ComposedPress behavior (#148)
- Fixed import issues (#144)
π¦ Installation
pip install kvpress==0.4.0
π Full Changelog
v0.3.0...v0.4.0