Home Knowledge Base the Linear Representation and Superposition Hypotheses

the Linear Representation and Superposition Hypotheses

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Representational Capacity: Geometric Limits on Feature Representation in Transformer Language Models

Announce Type: new Abstract: Model dimension ($d_{model}$) is a fundamental hyperparameter in transformer language models, yet its role in setting the geometric limits of feature representation remains under-explored. Grounded in the Linear Representation and Superposition Hypotheses - which propose that models encode features as near-orthogonal directions in latent space - we develop a framework for estimating how many such directions a model can support. We first establish the embedding...

arXiv CS 7d ago