Skip to content

Conversation

@l-bat
Copy link
Contributor

@l-bat l-bat commented Oct 22, 2025

Changes

  • Integrated the Adaptive R-KV Token Eviction algorithm, which computes token importance within a small sliding window of the most recent queries during both the prefill and decode stages.
  • Added refined selection methods (KVCrush and DiverseKV) to improve token diversity and reduce redundancy
  • Added reasoning benchmark (MATH500 and GSM8K datasets)
    Integrated the Adaptive R-KV Token Eviction algorithm, which computes token importance within a small sliding window of the most recent queries during both the prefill and decode stages.

Related tickets

169957

@l-bat l-bat requested a review from a team as a code owner October 22, 2025 14:43
@github-actions github-actions bot added the dependencies Pull requests that update a dependency file label Oct 22, 2025
@github-actions github-actions bot added the category: custom operations OpenVINO Runtime Extension with custom operations label Nov 7, 2025
@l-bat
Copy link
Contributor Author

l-bat commented Nov 7, 2025

@openvinotoolkit/openvino_contrib-maintainers please take a look at the custom_operations module tests — it seems that the CI is broken.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: custom operations OpenVINO Runtime Extension with custom operations dependencies Pull requests that update a dependency file

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant