Mubashara Akhtar (@akhtarmubashara) Twitter Tweets • TwiCopy

Mubashara Akhtar

@akhtarmubashara

+ Follow

Postdoc fellow @ETH_AI_Center @ETH_en • NLP, Multimodality • prev @KingsCollegeLon @CambridgeNLP, intern @GoogleDeepmind, board member @igwien, @ClubAlpbachLdn

ID: 1205421834356830208

linkhttp://mubasharaakhtar.com calendar_today13-12-2019 09:39:08

709 Tweet

1,1K Followers

899 Following

Zeyuan Allen-Zhu, Sc.D.

@zeyuanallenzhu

3 months ago

🚀 NVIDIA continues to lead on open-sourcing pretraining data — Nemotron-CC-v2 has dropped! 👏 Congrats to Rabeeh Karimi Sanjeev Satheesh Pavlo Molchanov Kezhi Kong X. Dong Bryan Catanzaro Yejin Choi + many others! 🙏 A very loud thank you for citing our Physics of LMs, Part 3.1.

🚀 NVIDIA continues to lead on open-sourcing pretraining data — Nemotron-CC-v2 has dropped!
👏 Congrats to <a href="/KarimiRabeeh/">Rabeeh Karimi</a> <a href="/issanjeev/">Sanjeev Satheesh</a> <a href="/PavloMolchanov/">Pavlo Molchanov</a> <a href="/KezhiKong/">Kezhi Kong</a> <a href="/SimonXinDong/">X. Dong</a> <a href="/ctnzr/">Bryan Catanzaro</a> <a href="/YejinChoinka/">Yejin Choi</a> + many others!

🙏 A very loud thank you for citing our Physics of LMs, Part 3.1.

thumb_up_off_alt648

chat_bubble_outline14

repeat86

shareShare

Marzieh Fadaee

@mziizm

3 months ago

I'm excited to share that I'll be stepping into the role of Head of Cohere Labs. It's an honor and a responsibility to lead such an extraordinary group of researchers pushing the boundaries of AI research.

thumb_up_off_alt728

chat_bubble_outline74

repeat35

shareShare

AK

@_akhaliq

3 months ago

Meta Superintelligence Labs presents Language Self-Play For Data-Free Training

thumb_up_off_alt621

chat_bubble_outline23

repeat78

shareShare

Noam Brown

@polynoamial

3 months ago

When we at OpenAI released o1-preview a year ago, it would think for seconds. Today, our best reasoning models can think for hours, browse the web, and write code. But there's a lot of room to push reasoning even further. I'm excited for what the next year will bring!

thumb_up_off_alt1,1K

chat_bubble_outline79

repeat177

shareShare

Minqi Jiang

@minqijiang

3 months ago

Just got the greenlight to share some work we did at Google DeepMind from over a year ago: We fine-tuned Gemini on thousands of the most toxic discussions on 4chan...and it just talked to us like a completely normal and nice language model. How? Our method, Generative Data

thumb_up_off_alt1,1K

chat_bubble_outline64

repeat126

shareShare

Percy Liang

@percyliang

3 months ago

-2016 (classic era): focus on data efficiency 2017-2025 (pretraining era): focus on compute efficiency 2026-: focus on data efficiency (again) The standard Transformer paradigm is optimized for compute efficiency. As we look at data efficiency, we'll see very different design

thumb_up_off_alt628

chat_bubble_outline15

repeat68

shareShare

ICLR 2025

@iclr_conf

3 months ago

We’ve received A LOT OF submissions this year 🤯🤯 and are excited to see so much interest! To ensure high-quality review, we are looking for more dedicated reviewers. If you'd like to help, please sign up here docs.google.com/forms/d/e/1FAI…

thumb_up_off_alt362

chat_bubble_outline10

repeat72

shareShare

Daniel Ek

@eldsjal

3 months ago

Spent an incredible two days in Zurich speaking with students at ETH Zurich. I shared a little bit of my journey as an entrepreneur and spent time inside the ETH AI Center, learning about what they've been working on. The questions from students reminded me just how much curiosity

Spent an incredible two days in Zurich speaking with students at <a href="/ETH_en/">ETH Zurich</a>. I shared a little bit of my journey as an entrepreneur and spent time inside the ETH AI Center, learning about what they've been working on. The questions from students reminded me just how much curiosity

thumb_up_off_alt112

chat_bubble_outline21

repeat10

shareShare

ZurichAI

@zurichnlp

3 months ago

The first ever Zurich Robotics event is tonight 18:00 ETH AI Center: Barnabas Gavin Cangan (Barnabas Gavin Cangan, ETHZ) on why robot hands are so hard and Caterina Caccavella (ZHAW / ETHZ) on bio-inspired active sensing. zurichai.ch/events/zurichr…

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Rohan Paul

@rohanpaul_ai

3 months ago

🇨🇳 China just dropped a new GPU: Fantasy III - Claims CUDA compatibility and Ray tracing support - 112GB+ of HBM memory for AI - positioning it as an all-purpose card for gaming, compute, and medical imaging. - For AI, the memory footprint is pitched as enough for 32B and 72B

thumb_up_off_alt43

chat_bubble_outline5

repeat10

shareShare

Tejal Patwardhan

@tejalpatwardhan

3 months ago

Understanding the capabilities of AI models is important to me. To forecast how AI models might affect labor, we need methods to measure their real-world work abilities. That’s why we created GDPval.

thumb_up_off_alt1,1K

chat_bubble_outline59

repeat188

shareShare

Mira Murati

@miramurati

2 months ago

Sharing our second Connectionism research post on Modular Manifolds, a mathematical approach to refining training at each layer of the neural network

thumb_up_off_alt2,2K

chat_bubble_outline91

repeat256

shareShare

ACLRollingReview

@reviewacl

2 months ago

📢 Early submission for ACL 2026 via ARR Oct cycle is available to support authors facing potential visa delays. 📝 Early invitation letters possible for submit-ready work (not acceptance guarantees). ⚠️ Preliminary work risks rejection & Jan-cycle ineligibility. #NLProc #ARR

thumb_up_off_alt52

chat_bubble_outline4

repeat13

shareShare

Andreas Vlachos

@vlachos_nlp

2 months ago

Excited to announce the 9th FEVERworkshop collocated with #EACL2026 in Morocco: fever.ai/index.html with a new shared task on #factchecking image-text claims with evidence from the web using the AVerImaTeC dataset: arxiv.org/abs/2505.17978 Deadline 2nd of December! And...

thumb_up_off_alt14

chat_bubble_outline1

repeat6

shareShare

Mubashara Akhtar

@akhtarmubashara

2 months ago

🔍 Do vision-language models truly understand diagrams - or just leverage shortcuts? Excited to share our new paper: “Chimera: Diagnosing Shortcut Learning in Visual-Language Understanding” lead by Ziheng Chi & Yifan Hou Yifan Hou! 📄 Paper: arxiv.org/abs/2509.22437 ✨

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare