Mingxue (Mercy) Xu @ICLR2025 (@mercyxu2022) 's Twitter Profile
Mingxue (Mercy) Xu @ICLR2025

@mercyxu2022

PhD student @ImperialCollege, generative (language) model compression with algebraic approaches

ID: 1498029445629812739

linkhttps://mingxue-xu.github.io/ calendar_today27-02-2022 20:17:22

31 Tweet

58 Followers

185 Following

Yuandong Tian (@tydsh) 's Twitter Profile Photo

Are gradient descent solutions completely explainable by group/ring theory in certain reasoning tasks? Yes! 🚀New paper: arxiv.org/abs/2410.01779 We analytically construct global optimizers to certain reasoning tasks (e.g., modular addition) in 2-layer network, from partial

The Nobel Prize (@nobelprize) 's Twitter Profile Photo

"I'm in a cheap hotel in California which doesn't have a good internet or phone connection. I was going to have an MRI scan today but I'll have to cancel that!" - New physics laureate Geoffrey Hinton speaking at today’s press conference where his #NobelPrize was announced.

"I'm in a cheap hotel in California which doesn't have a good internet or phone connection. I was going to have an MRI scan today but I'll have to cancel that!"

- New physics laureate Geoffrey Hinton speaking at today’s press conference where his #NobelPrize was announced.
Lisa Alazraki (@lisaalazraki) 's Twitter Profile Photo

Do LLMs need rationales for learning from mistakes? 🤔 When LLMs learn from previous incorrect answers, they typically observe corrective rationales explaining each mistake. In our new preprint, we find these rationales do not help, in fact they hurt performance! 🧵

Petar Veličković (@petarv_93) 's Twitter Profile Photo

"Tensor Species" by Andrew Dudzik 🧮🐒 A great talk at the Topos Institute Colloquium, overviewing our team's category-theoretic vision for understanding AI systems through CS primitives... ...casually algebrafying einsums and softmax in the process 🐈 youtube.com/live/xBI03pvUn…

λux (@novasarc01) 's Twitter Profile Photo

this is really a brilliant work by team anthropic in interpretability research…understanding the fundamental nature of LLMs and how they actually work internally is a huge task for researchers…the new research actually explains it through circuit tracing and attribution graphs

this is really a brilliant work by team anthropic in interpretability research…understanding the fundamental nature of LLMs and how they actually work internally is a huge task for researchers…the new research actually explains it through circuit tracing and attribution graphs
Imperial NLP (@imperial_nlp) 's Twitter Profile Photo

Can't wait to see everyone at ICLR and NAACL! Check out some of our awesome papers. Come and say hi, we'd love to have a chat :)

Can't wait to see everyone at ICLR and NAACL! Check out some of our awesome papers. Come and say hi, we'd love to have a chat :)
Carles Balsells Rodas (@balsellsrodas) 's Twitter Profile Photo

Excited to share that our paper "Causal discovery from Conditionally Stationary Time Series" has been accepted to ICML 2025!🥳 Pre-print: arxiv.org/abs/2110.06257 Thank you very much to all my collaborators, persistence pays off! #icml #icml2025

Joe Stacey (@_joestacey_) 's Twitter Profile Photo

We have a new paper up on arXiv! 🥳🪇 The paper tries to improve the robustness of closed-source LLMs fine-tuned on NLI, assuming a realistic training budget of 10k training examples. Here's a 60 second rundown of what we found!

We have a new paper up on arXiv! 🥳🪇

The paper tries to improve the robustness of closed-source LLMs fine-tuned on NLI, assuming a realistic training budget of 10k training examples. 

Here's a 60 second rundown of what we found!
Anthropic (@anthropicai) 's Twitter Profile Photo

Find out more about our open-source interpretability tools, and how to use them on open-weights models, here: anthropic.com/research/open-…

Lisa Alazraki (@lisaalazraki) 's Twitter Profile Photo

✨ Accepted as a Spotlight at #NeurIPS2025! Huge thanks to my coauthors and everyone who supported us. Check out the details below 👇

Fei-Fei Li (@drfeifei) 's Twitter Profile Photo

It’s an honor to have received the Queen Elizabeth Prize for Engineering along with my fellow laureates! But it’s also a responsibility. AI’s impact to humanity is in the hands of all of us.

Joe Stacey (@_joestacey_) 's Twitter Profile Photo

Wowww I passed my viva today!! Massive thank you to my assessors Roi Reichart and Francesca Toni for all their insightful and helpful feedback. I feel so lucky to have had the chance to do a PhD with Marek Rei who has been such a brilliant supervisor.