Home Knowledge Base Reparameterized Orthogonal Equivalence Training

Reparameterized Orthogonal Equivalence Training

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation

arXiv:2603.05500v2 Announce Type: replace Abstract: Efficient and stable training of large language models (LLMs) remains a core challenge in modern machine learning systems. To address this challenge, Reparameterized Orthogonal Equivalence Training (POET), a spectrum-preserving framework that optimizes each weight matrix through orthogonal equivalence transformation, has been proposed. Although POET provides strong training stability, its original implementation incurs high memory...

arXiv CS 1d ago