Home Knowledge Base LU-KV

LU-KV

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Predicting Future Utility: Global Combinatorial Optimization for Task-Agnostic KV Cache Eviction

arXiv:2602.08585v2 Announce Type: replace Abstract: Given the quadratic complexity of attention, KV cache eviction is vital to accelerate model inference. Current KV cache eviction methods typically rely on instantaneous heuristic metrics, implicitly assuming that score magnitudes are consistent proxies for importance across all heads.

arXiv CS 8d ago