Mahdi Soltanolkotabi (@mahdisoltanol) 's Twitter Profile
Mahdi Soltanolkotabi

@mahdisoltanol

work on foundations of AI, MLLM reliability/Eval, optimization, probability/stats, AI 4 science/healthcare; Prof & director of center on AIF4S @USC 🚲🏔️🥾🏊‍♂️

ID: 1562908933936664577

linkhttps://viterbi-web.usc.edu/~soltanol/ calendar_today25-08-2022 21:05:09

117 Tweet

604 Followers

573 Following

Mahdi Soltanolkotabi (@mahdisoltanol) 's Twitter Profile Photo

I hear is getting a tad bit chilly 🥶 this weekend for some folks. Friendly reminder that u can make better life choices usccareers.usc.edu/job/los-angele… Yes this is January, yes this is an outdoor pool, no it is not heated, and yes it will be part of the 2028 LA Olympics!

I hear is getting a tad bit chilly 🥶 this weekend for some folks.  Friendly reminder that u can make better life choices 

usccareers.usc.edu/job/los-angele…

Yes this is January, yes this is an outdoor pool, no it is not heated, and yes it will be part of the 2028 LA Olympics!
Tianyi Zhou (@tianyi_zhou12) 's Twitter Profile Photo

Great to see others discovering similar findings as we did in our Neurips2024 paper (arxiv.org/abs/2406.03445). We call these Fourier features instead of helix. How are these features useful for representing numbers? Stay tuned for our new number embedding paper coming soon!

Robin Jia (@robinomial) 's Twitter Profile Photo

Our work (with Tianyi Zhou Deqing Fu and Vatsal Sharan) published at NeurIPS 2024 already found that pretrained LMs do addition using modular arithmetic/“trigonometry” (we called these Fourier features). Indeed it is a clever mechanism.

Tianyi Zhou (@tianyi_zhou12) 's Twitter Profile Photo

Billion-parameter LLMs still struggle with simple arithmetic? 📞 FoNE (Fourier Number Embedding) tackles this problem. By mapping numbers directly into Fourier space, it bypasses tokenization and significantly improves numerical accuracy with better efficiency and accuracy.

Mahdi Soltanolkotabi (@mahdisoltanol) 's Twitter Profile Photo

Opt friends: If I have PL inequality +lipshitzness instead of smoothness do we have convergence guarantees for GD updates. Obviously don’t expect geometric rates. In general good refs for convergence guarantees for PL without smoothness would be appreciated. Cc Damek

Lars Lindemann (@larslindemann2) 's Twitter Profile Photo

Our 2025 RSS workshop on "Statistical Uncertainty Quantification in the Era of AI-Enabled Robots" got accepted. We have an amazing lineup of tentative speakers, see sites.google.com/view/rss2025-w… 🚀 The workshop will be held at USC on June 25th (and we guarantee excellent weather 🏖️🌴)

Our 2025 RSS workshop on "Statistical Uncertainty Quantification in the Era of AI-Enabled Robots" got accepted. We have an amazing lineup of tentative speakers, see sites.google.com/view/rss2025-w… 🚀 The workshop will be held at USC on June 25th (and we guarantee excellent weather 🏖️🌴)
Mahdi Soltanolkotabi (@mahdisoltanol) 's Twitter Profile Photo

Friendly reminder that small random noise has many benefits: improves convergence, reliability, generalization, privacy & reduces deadline conflicts. This Thu I have 3 separate deadlines at 9AM, 12 PM, the AOE one, and end of semester craziness. All docs on overleaf 🤦‍♂️

Mahdi Soltanolkotabi (@mahdisoltanol) 's Twitter Profile Photo

Plan on doing some random summer reading. Pls send your favorite “Agents” papers by you or others (if possible with a short explanation of why u think is a good paper)

Ahmad Beirami @ ICLR 2025 (@abeirami) 's Twitter Profile Photo

If you have written/seen a cool new paper on post training / RL / agents, please share a link and tl;dr. I'm looking for some new reading material.

Mahdi Soltanolkotabi (@mahdisoltanol) 's Twitter Profile Photo

Happy to see the nature in Pacific Palisades slowly healing from the fires. I recently joined the school of eng working group on the LA fires. If anyone has any good ideas re where research efforts could be most useful for rebuilding/future prevention pls reach out

Happy to see the nature in Pacific Palisades slowly healing from the fires. I recently joined the school of eng working group on the LA fires. If anyone has any good ideas re where research efforts could be most useful for rebuilding/future prevention pls reach out
Ahmad Beirami @ ICLR 2025 (@abeirami) 's Twitter Profile Photo

After three incredible years, today is my last day at Google DeepMind! I am truly grateful to the amazing colleagues who made the journey 1000x more fruitful and enjoyable! I am forever indebted to my collaborators who showed me how to be better at everything via demonstrations.

After three incredible years, today is my last day at Google DeepMind!

I am truly grateful to the amazing colleagues who made the journey 1000x more fruitful and enjoyable! I am forever indebted to my collaborators who showed me how to be better at everything via demonstrations.
Andrew Hires (@andrewhires) 's Twitter Profile Photo

Great, detailed explainer. Indirect costs ARE research costs, but are ones that are hard to assign to individual projects. x.com/pottytheron/st…