Home Knowledge Base SuperActivator

SuperActivator

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

The SuperActivator Mechanism: Transformers Concentrate Reliable Concept Signals in the Tail

Announce Type: replace Abstract: Concept vectors aim to enhance model interpretability by linking internal representations with human-understandable semantics, but their practical utility is often limited by noisy and inconsistent activations. In this work, we uncover the SuperActivator Mechanism: a transformer dynamic that amplifies concept activation gaps, concentrating the most reliable concept evidence into a small set of high-activation tokens. To develop a theoretical understanding of...

arXiv CS 9d ago