noahamsel (@noahamsel) 's Twitter Profile
noahamsel

@noahamsel

ID: 1677141799968681984

calendar_today07-07-2023 02:28:51

2 Tweet

19 Followers

67 Following

Robert M. Gower πŸ‡ΊπŸ‡¦ (@gowerrobert) 's Twitter Profile Photo

Are you interested in the new Muon/Scion/Gluon method for training LLMs? To run Muon, you need to approximate the matrix sign (or polar factor) of the momentum matrix. We've developed an optimal method *The PolarExpress* just for this! If you're interested, climb aboard 1/x

Are you interested in the new Muon/Scion/Gluon method for training LLMs? 
To run Muon, you need to approximate the matrix sign (or polar factor) of the momentum matrix. We've developed an optimal method *The PolarExpress* just for this! If you're interested, climb aboard 1/x
noahamsel (@noahamsel) 's Twitter Profile Photo

How can classical numerical analysis help train deep nets faster? Climb aboard the Polar Express to find out... arxiv.org/abs/2505.16932 joint with @davpersson Robert M. Gower πŸ‡ΊπŸ‡¦ + Chris Musco

How can classical numerical analysis help train deep nets faster? Climb aboard the Polar Express to find out...  arxiv.org/abs/2505.16932
joint with @davpersson <a href="/gowerrobert/">Robert M. Gower πŸ‡ΊπŸ‡¦</a> + Chris Musco