Mubashara Akhtar (@akhtarmubashara) 's Twitter Profile
Mubashara Akhtar

@akhtarmubashara

Postdoc fellow @ETH_AI_Center @ETH_en • NLP, Multimodality • prev @KingsCollegeLon @CambridgeNLP, intern @GoogleDeepmind, board member @igwien, @ClubAlpbachLdn

ID: 1205421834356830208

linkhttp://mubasharaakhtar.com calendar_today13-12-2019 09:39:08

709 Tweet

1,1K Followers

899 Following

Zeyuan Allen-Zhu, Sc.D. (@zeyuanallenzhu) 's Twitter Profile Photo

🚀 NVIDIA continues to lead on open-sourcing pretraining data — Nemotron-CC-v2 has dropped! 👏 Congrats to Rabeeh Karimi Sanjeev Satheesh Pavlo Molchanov Kezhi Kong X. Dong Bryan Catanzaro Yejin Choi + many others! 🙏 A very loud thank you for citing our Physics of LMs, Part 3.1.

🚀 NVIDIA continues to lead on open-sourcing pretraining data — Nemotron-CC-v2 has dropped! 
👏 Congrats to <a href="/KarimiRabeeh/">Rabeeh Karimi</a> <a href="/issanjeev/">Sanjeev Satheesh</a> <a href="/PavloMolchanov/">Pavlo Molchanov</a> <a href="/KezhiKong/">Kezhi Kong</a> <a href="/SimonXinDong/">X. Dong</a> <a href="/ctnzr/">Bryan Catanzaro</a> <a href="/YejinChoinka/">Yejin Choi</a> + many others!

🙏 A very loud thank you for citing our Physics of LMs, Part 3.1.
Marzieh Fadaee (@mziizm) 's Twitter Profile Photo

I'm excited to share that I'll be stepping into the role of Head of Cohere Labs. It's an honor and a responsibility to lead such an extraordinary group of researchers pushing the boundaries of AI research.

I'm excited to share that I'll be stepping into the role of Head of Cohere Labs. It's an honor and a responsibility to lead such an extraordinary group of researchers pushing the boundaries of AI research.
Noam Brown (@polynoamial) 's Twitter Profile Photo

When we at OpenAI released o1-preview a year ago, it would think for seconds. Today, our best reasoning models can think for hours, browse the web, and write code. But there's a lot of room to push reasoning even further. I'm excited for what the next year will bring!

Minqi Jiang (@minqijiang) 's Twitter Profile Photo

Just got the greenlight to share some work we did at Google DeepMind from over a year ago: We fine-tuned Gemini on thousands of the most toxic discussions on 4chan...and it just talked to us like a completely normal and nice language model. How? Our method, Generative Data

Just got the greenlight to share some work we did at Google DeepMind from over a year ago:

We fine-tuned Gemini on thousands of the most toxic discussions on 4chan...and it just talked to us like a completely normal and nice language model.

How? Our method, Generative Data
Percy Liang (@percyliang) 's Twitter Profile Photo

-2016 (classic era): focus on data efficiency 2017-2025 (pretraining era): focus on compute efficiency 2026-: focus on data efficiency (again) The standard Transformer paradigm is optimized for compute efficiency. As we look at data efficiency, we'll see very different design

ICLR 2025 (@iclr_conf) 's Twitter Profile Photo

We’ve received A LOT OF submissions this year 🤯🤯 and are excited to see so much interest! To ensure high-quality review, we are looking for more dedicated reviewers. If you'd like to help, please sign up here docs.google.com/forms/d/e/1FAI…

Daniel Ek (@eldsjal) 's Twitter Profile Photo

Spent an incredible two days in Zurich speaking with students at ETH Zurich. I shared a little bit of my journey as an entrepreneur and spent time inside the ETH AI Center, learning about what they've been working on. The questions from students reminded me just how much curiosity

Spent an incredible two days in Zurich speaking with students at <a href="/ETH_en/">ETH Zurich</a>. I shared a little bit of my journey as an entrepreneur and spent time inside the ETH AI Center, learning about what they've been working on. The questions from students reminded me just how much curiosity
ZurichAI (@zurichnlp) 's Twitter Profile Photo

The first ever Zurich Robotics event is tonight 18:00 ETH AI Center: Barnabas Gavin Cangan (Barnabas Gavin Cangan, ETHZ) on why robot hands are so hard and Caterina Caccavella (ZHAW / ETHZ) on bio-inspired active sensing. zurichai.ch/events/zurichr…

Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

🇨🇳 China just dropped a new GPU: Fantasy III - Claims CUDA compatibility and Ray tracing support - 112GB+ of HBM memory for AI - positioning it as an all-purpose card for gaming, compute, and medical imaging. - For AI, the memory footprint is pitched as enough for 32B and 72B

🇨🇳 China just dropped a new GPU: Fantasy III

- Claims CUDA compatibility and Ray tracing support
- 112GB+ of HBM memory for AI
- positioning it as an all-purpose card for gaming, compute, and medical imaging.
- For AI, the memory footprint is pitched as enough for 32B and 72B
Tejal Patwardhan (@tejalpatwardhan) 's Twitter Profile Photo

Understanding the capabilities of AI models is important to me. To forecast how AI models might affect labor, we need methods to measure their real-world work abilities. That’s why we created GDPval.

Understanding the capabilities of AI models is important to me. To forecast how AI models might affect labor, we need methods to measure their real-world work abilities. That’s why we created GDPval.
Mira Murati (@miramurati) 's Twitter Profile Photo

Sharing our second Connectionism research post on Modular Manifolds, a mathematical approach to refining training at each layer of the neural network

ACLRollingReview (@reviewacl) 's Twitter Profile Photo

📢 Early submission for ACL 2026 via ARR Oct cycle is available to support authors facing potential visa delays. 📝 Early invitation letters possible for submit-ready work (not acceptance guarantees). ⚠️ Preliminary work risks rejection & Jan-cycle ineligibility. #NLProc #ARR

Andreas Vlachos (@vlachos_nlp) 's Twitter Profile Photo

Excited to announce the 9th FEVERworkshop collocated with #EACL2026 in Morocco: fever.ai/index.html with a new shared task on #factchecking image-text claims with evidence from the web using the AVerImaTeC dataset: arxiv.org/abs/2505.17978 Deadline 2nd of December! And...

Mubashara Akhtar (@akhtarmubashara) 's Twitter Profile Photo

🔍 Do vision-language models truly understand diagrams - or just leverage shortcuts? Excited to share our new paper: “Chimera: Diagnosing Shortcut Learning in Visual-Language Understanding” lead by Ziheng Chi & Yifan Hou Yifan Hou! 📄 Paper: arxiv.org/abs/2509.22437 ✨

🔍 Do vision-language models truly understand diagrams - or just leverage shortcuts?

Excited to share our new paper: “Chimera: Diagnosing Shortcut Learning in Visual-Language Understanding” lead by Ziheng Chi &amp; Yifan Hou <a href="/yyyyyyyyifan/">Yifan Hou</a>!

 📄 Paper: arxiv.org/abs/2509.22437

✨