Shiqian Ma
@shiqianma
Professor@Rice University. PhD from Columbia IEOR. Work on optimization and machine learning.
ID: 1377686198576029697
http://sqma.rice.edu 01-04-2021 18:16:31
251 Tweet
1,1K Takipçi
1,1K Takip Edilen
When training a DNN model (specially LLMs) we often clip the gradients. Why is that? Something to do with exploding gradients? We Fabian Schaipp Guillaume Garrigos Umut U. Simsekli have a story to tell on how clipping is computing the geometric median of gradients. arxiv.org/pdf/2402.12828…
Happy to have this paper on 1-sample zeroth-order Riemannian opt accepted in SIAM Journal on Optimization: arxiv.org/abs/2309.14506 Includes generic approx. bounds between costly 'parallel transports' and cheaper 'vector transports' Joint work with Jiaxiang Li and Shiqian Ma
The website VFS Global @vfsglobalcare for booking France visa appointments is not functioning properly; this time, I can't receive the OTP.... Over the past four months, I've encountered numerous issues with this website. This is the worst service I've ever experienced, and it
I have a postdoc opening on continuous optimization and machine learning at Rice University. Interested candidates please send a CV to Shiqian Ma at [email protected]. Review of the applications will start on 09/23. Please help spread the word. Thanks!