WildCat: Near-Linear Attention in Theory and Practice

arXiv CS Tuesday 02 June 2026, 04:00 UTC By Tobias Schr\"oder, Lester Mackey 1 min read

Key Points

arXiv:2602.10056v2 Announce Type: replace Abstract: We introduce WildCat, a high-accuracy, low-cost approach to compressing the attention mechanism in neural networks. While attention is a staple of modern network architectures, it is also notoriously expensive to deploy due to resource requirements that scale quadratically with the input sequence length $n$. WildCat avoids these quadratic costs by only attending over a small weighted coreset. Crucially, we select the coreset using a fast but spectrally-accurate subsampling algorithm -- randomly pivoted Cholesky -- and weight the elements optimally to minimise reconstruction error. Remarkably, given bounded inputs, WildCat approximates exact attention with super-polynomial $O(n^{-\sqrt{\log(\log(n))}})$ error decay while running in near-linear $O(n^{1+o(1)})$ time. In contrast, prior practical approximations either lack error guarantees or require quadratic runtime to guarantee such high fidelity. We couple this advance with a GPU-optimized PyTorch implementation and a suite of benchmark experiments demonstrating the benefits of WildCat for image generation, image classification, and language model KV cache compression.

WildCat (ORG) Cholesky (PERSON) GPU (ORG) PyTorch (ORG) KV (ORG)

Originally published by arXiv CS Read original →

Waymo has a lot of experience building virtual systems to help its autonomous vehicles better understand the real world. It built realistic 3D worlds to better anticipate natural disasters and unpredictable edge cases. It created a virtual representation of a hyperattentive driver to test against its own autonomous vehicles in a series of simulated scenarios to see which is better at crash avoidance.

The Verge 18m ago

Rare tiger cub from litter of four dies

These Sumatran tigers are the first of this breed to be welcomed at the animal park in Kent.

BBC Science 49m ago

The SpaceX IPO could lead to 8% of America’s current-account deficit being refinanced in a single day

A remarkable back-of-the-envelope calculation from a currency strategist shows just how big SpaceX’s initial public offering could reverberate in global markets.

MarketWatch 1h ago

'Don’t give parents more to do to keep kids safe online - they need help, not homework'

'Don’t give parents more to do to keep kids safe online - they need help, not homework' "Parents have said they need more support with online safety, but a ban for under 16s plus plans to issue guidance might not be the help we need" Parents who said they want more help keeping their kids safe online might regret asking what they wished for. Because it sounds like we are about to get a whole lot more homework without any of the real support families and young people need. In an interview...

Daily Mirror 1h ago

WildCat: Near-Linear Attention in Theory and Practice

Related Stories

Waymo built a virtual driver to study how humans react to surprises on the road

Rare tiger cub from litter of four dies

The SpaceX IPO could lead to 8% of America’s current-account deficit being refinanced in a single day

'Don’t give parents more to do to keep kids safe online - they need help, not homework'