Skip to content

Tags: NVIDIA/kvpress

Tags

v0.4.0

Toggle v0.4.0's commit message
Release v0.4.0

- Add CURPress - CUR decomposition-based KV cache compression (#150)
- Add Compactor press for enhanced compression capabilities (#143)
- Add decoding press functionality for compression during decoding (#139)
- Add AIME25 and Math500 benchmark datasets for evaluation (#142)
- Add post_init_from_model hook to BasePress for model-specific initialization (#163)

- Move tests to GPU for faster CI (#132)
- Improve needle-in-haystack test (#133)
- Update README and documentation (#162)
- Update docstrings (#159)
- Update decoding notebook (#156)
- Move utils, clean and fix imports (#160)

- Fix LongBench-v2 benchmark (#161)
- Fix kvzip press access to past_key_values
- Fix ComposedPress (#148)
- Fix imports (#144)

v0.3.0

Toggle v0.3.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Transformers compatibility (#115)

v0.2.10

Toggle v0.2.10's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Migration to uv (#108)

v0.2.9

Toggle v0.2.9's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Fix trandformers<4.54.0

v0.2.8

Toggle v0.2.8's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Fix failing tests (#94)

v0.2.7

Toggle v0.2.7's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
add Alessio to authors (#92)

v0.2.6

Toggle v0.2.6's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Support Qwen3 and Gemma3 (#81)

v0.2.5

Toggle v0.2.5's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add FinchPress (#69)

v0.2.4

Toggle v0.2.4's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add QFilterPress (#54)

v0.2.3

Toggle v0.2.3's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add DuoAttentionPress (#50)

* Add DuoAttentionPress

* Fix tests and compression_ratio

* Address feedback

* Update plot

* Update version