Shiqian Ma (@shiqianma) 's Twitter Profile
Shiqian Ma

@shiqianma

Professor@Rice University. PhD from Columbia IEOR. Work on optimization and machine learning.

ID: 1377686198576029697

linkhttp://sqma.rice.edu calendar_today01-04-2021 18:16:31

251 Tweet

1,1K Takipçi

1,1K Takip Edilen

Robert M. Gower 🇺🇦 (@gowerrobert) 's Twitter Profile Photo

When training a DNN model (specially LLMs) we often clip the gradients. Why is that? Something to do with exploding gradients? We Fabian Schaipp Guillaume Garrigos Umut U. Simsekli have a story to tell on how clipping is computing the geometric median of gradients. arxiv.org/pdf/2402.12828…

When training a DNN model (specially LLMs) we often clip the gradients. Why is that? Something to do with exploding gradients? We <a href="/FSchaipp/">Fabian Schaipp</a> <a href="/GuillaumeG_/">Guillaume Garrigos</a>  <a href="/umutsimsekli/">Umut U. Simsekli</a>  have a story to tell on how clipping is computing the geometric median of gradients. arxiv.org/pdf/2402.12828…
Shiqian Ma (@shiqianma) 's Twitter Profile Photo

Our SIAM Review paper is online today. This paper is an extended version of our 2020 SIOPT paper which proposed the ManPG algorithm and was chosen for the SIAM Review SIGEST Award. Congratulations to coauthors Shixiang Chen, Anthony So and Tong Zhang. epubs.siam.org/doi/abs/10.113…

Krishna Balasubramanian (@krizna_b) 's Twitter Profile Photo

Happy to have this paper on 1-sample zeroth-order Riemannian opt accepted in SIAM Journal on Optimization: arxiv.org/abs/2309.14506 Includes generic approx. bounds between costly 'parallel transports' and cheaper 'vector transports' Joint work with Jiaxiang Li and Shiqian Ma

Quanquan Gu (@quanquangu) 's Twitter Profile Photo

The website VFS Global @vfsglobalcare for booking France visa appointments is not functioning properly; this time, I can't receive the OTP.... Over the past four months, I've encountered numerous issues with this website. This is the worst service I've ever experienced, and it

Quanquan Gu (@quanquangu) 's Twitter Profile Photo

Just spoke with someone from @vfsglobalcare on the phone and was told their system is not compatible with any Apple device. They suggested I use Chrome and clear all cookies. Seriously? This service is terrible.

Shiqian Ma (@shiqianma) 's Twitter Profile Photo

At #ISMP2024 , today at 2pm in 510C, I will talk about adaptive Barzilai-Borwein method. It is a line-search-free, parameter-free gradient method, a very simple modification to the BB method. We prove the O(1/k) convergence rate for general convex functions.

At #ISMP2024 , today at 2pm in 510C, I will talk about adaptive Barzilai-Borwein method. It is a line-search-free, parameter-free gradient method, a very simple modification to the BB method. We prove the O(1/k) convergence rate for general convex functions.
Shiqian Ma (@shiqianma) 's Twitter Profile Photo

departed from Montreal at 8:30am. 3 hours later, we are back to Montreal due to technical errors with the aircraft. Traveling experiences in this summer have been not good.

Shiqian Ma (@shiqianma) 's Twitter Profile Photo

I have a postdoc opening on continuous optimization and machine learning at Rice University. Interested candidates please send a CV to Shiqian Ma at [email protected]. Review of the applications will start on 09/23. Please help spread the word. Thanks!