Pion: A Spectrum-Preserving Optimizer via Orthogonal Equivalence Transformation Paper • 2605.12492 • Published 26 days ago • 6
Measuring Maximum Activations in Open Large Language Models Paper • 2605.15572 • Published 23 days ago • 18