Closure-Validated Circuit Discovery in Attention Heads: Co-activation Proposes, Ablation Disposes

arXiv CS Tuesday 09 June 2026, 04:00 UTC By Yongzhong Xu 1 min read

Key Points

arXiv:2606.09607v1 Announce Type: new Abstract: Interpretability increasingly treats groups of components, not individual units, as the basic object, and proposes to find them by clustering co-activation statistics. We ask whether such a cheap signal actually identifies an attention-head circuit. Adapting a sparse-autoencoder clustering recipe to attention heads -- but validating by causal ablation rather than reconstruction -- we cluster heads and then run a closure test: ablate the discovered community and compare per-example damage to matched-random controls. Across two dense 1B-scale models (Pythia 1B, OLMo 1B) and two input distributions, the communities pass closure. In a Mixture-of-Experts model (OLMoE-1B-7B), route-conditional clustering recovers a statistically real signal that nonetheless does not survive closure -- ablation improves loss, the wrong direction. Extending closure across training, attention-target selectivity and participation ratio decouple from function in both directions. We conclude that a cheap signal is a circuit proposal, not a confirmed circuit; closure is what separates them.

Closure-Validated Circuit Discovery in Attention (ORG)

Originally published by arXiv CS Read original →

Genetically modified worms can now produce and deliver drugs inside a living body, scientists say In a proof-of-concept lab experiment, scientists demonstrated that intestinal parasites could make and release therapeutic agents inside a living host. Scientists genetically tweaked a tiny, worm-like parasite to produce a life-saving antitoxin from inside a living host. In a first-of-its-kind study, researchers modified the hookworm Ancylostoma ceylanicum so that it produces antibodies that...

Live Science 47m ago

Indonesia Landslides Devastated Endangered Orangutans, Study Finds

More than 5 percent of the species is estimated to have been lost when a climate-fueled storm unleashed torrents of water, mud and debris.

NYT Science 56m ago

Mysterious 'cold blob' in the Atlantic is a sign of the Gulf Stream weakening — and that's bad news for the US East Coast

Mysterious 'cold blob' in the Atlantic is a sign of the Gulf Stream weakening — and that's bad news for the US East Coast The Atlantic's enigmatic "cold blob" has once again been linked to a weakening of key ocean currents and a devastating climate tipping point. A mysterious "cold blob" in the Atlantic Ocean is a sign that key ocean currents are weakening, a new study has found, with potentially devastating long-term impacts on our climate and weather. The cold blob, or North Atlantic...

Live Science 1h ago

Neuroscientist reveals the one 'superfood' he eats every single day to slow down ageing

Neuroscientist reveals the one 'superfood' he eats every single day to slow down ageing Neuroscientist Dr David Cox has spoken about how what we eat influences how we age while revealing the one 'superfood' he consumes daily to be as healthy as possible A neuroscientist and health journalist has revealed the one 'superfood' he eats every single day to slow down the ageing process. Dr David Cox, who is the author of The Age Code, made the comments on Tonight on ITV. The documentary looked at...

Daily Mirror 1h ago

Closure-Validated Circuit Discovery in Attention Heads: Co-activation Proposes, Ablation Disposes

Related Stories

Genetically modified worms can now produce and deliver drugs inside a living body, scientists say

Indonesia Landslides Devastated Endangered Orangutans, Study Finds

Mysterious 'cold blob' in the Atlantic is a sign of the Gulf Stream weakening — and that's bad news for the US East Coast

Neuroscientist reveals the one 'superfood' he eats every single day to slow down ageing