Tal Schuster (@talschuster) 's Twitter Profile
Tal Schuster

@talschuster

Working on Gemini @GoogleDeepMind | formerly: PhD @MIT_CSAIL @MITNLP. Opinions my own

ID: 2807461219

linkhttps://people.csail.mit.edu/tals calendar_today13-09-2014 12:47:42

418 Tweet

1,1K Followers

775 Following

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Think you know Gemini? 🤔 Think again. Meet Gemini 2.5: our most intelligent model 💡 The first release is Pro Experimental, which is state-of-the-art across many benchmarks - meaning it can handle complex problems and give more accurate responses. Try it now →

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆 Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆

Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer
Sangmin Bae (@raymin0223) 's Twitter Profile Photo

Sharing our poster ICLR 2026! Please stop by poster if you're interested 😊 Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA 📍 Hall 3 + Hall 2B, Poster #262 🗓️ Saturday, April 26, 10:00 a.m. — 12:30 p.m.

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

🚨Breaking: Google DeepMind’s latest Gemini-2.5-Pro is now ranked #1 across all LMArena leaderboards 🏆 Highlights: - #1 in all text arenas (Coding, Style Control, Creative Writing, etc) - #1 on the Vision leaderboard with a ~70 pts lead! - #1 on WebDev Arena, surpassing Claude

🚨Breaking: <a href="/GoogleDeepMind/">Google DeepMind</a>’s latest Gemini-2.5-Pro is now ranked #1 across all LMArena leaderboards 🏆

Highlights:
- #1 in all text arenas (Coding, Style Control, Creative Writing, etc)
- #1 on the Vision leaderboard with a ~70 pts lead!
- #1 on WebDev Arena, surpassing Claude
Jack Rae (@jack_w_rae) 's Twitter Profile Photo

There was a lot of announcements at IO, easy to overlook the new 2.5 Flash. It's pushing new boundaries in capability vs speed!

There was a lot of announcements at IO, easy to overlook the new 2.5 Flash.

It's pushing new boundaries in capability vs speed!
Deedy (@deedydas) 's Twitter Profile Photo

Google DeepMind just dropped this new LLM model architecture called Mixture-of-Recursions. It gets 2x inference speed, reduced training FLOPs and ~50% reduced KV cache memory. Really interesting read. Has potential to be a Transformers killer.

Google DeepMind just dropped this new LLM model architecture called Mixture-of-Recursions.

It gets 2x inference speed, reduced training FLOPs and ~50% reduced KV cache memory. Really interesting read.

Has potential to be a Transformers killer.
Sangmin Bae (@raymin0223) 's Twitter Profile Photo

Thanks for sharing our work, Deedy MoR is a new arch that upgrades Recursive Transformers and Early-Exiting algorithms. Simple pretraining with router, and faster inference speed and lower KV caches! Post for details and codes will be released very soon. Stay tuned! ☺️

Reza Bayat (@reza_byt) 's Twitter Profile Photo

📄 New Paper Alert! ✨ 🚀Mixture of Recursions (MoR): Smaller models • Higher accuracy • Greater throughput Across 135 M–1.7 B params, MoR carves a new Pareto frontier: equal training FLOPs yet lower perplexity, higher few‑shot accuracy, and more than 2x throughput.

📄 New Paper Alert! ✨

🚀Mixture of Recursions (MoR): Smaller models • Higher accuracy • Greater throughput

Across 135 M–1.7 B params, MoR carves a new Pareto frontier: equal training FLOPs yet lower perplexity, higher few‑shot accuracy, and more than 2x throughput.
Google DeepMind (@googledeepmind) 's Twitter Profile Photo

An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇 It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵

An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇

It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵
Thang Luong (@lmthang) 's Twitter Profile Photo

Very excited to share that an advanced version of Gemini Deep Think is the first to have achieved gold-medal level in the International Mathematical Olympiad! 🏆, solving five out of six problems perfectly, as verified by the IMO organizers! It’s been a wild run to lead this

Very excited to share that an advanced version of Gemini Deep Think is the first to have achieved gold-medal level in the International Mathematical Olympiad! 🏆, solving five out of six problems perfectly, as verified by the IMO organizers! It’s been a wild run to lead this
Demis Hassabis (@demishassabis) 's Twitter Profile Photo

Official results are in - Gemini achieved gold-medal level in the International Mathematical Olympiad! 🏆 An advanced version was able to solve 5 out of 6 problems. Incredible progress - huge congrats to Thang Luong and the team! deepmind.google/discover/blog/…

Tal Schuster (@talschuster) 's Twitter Profile Photo

We achieved gold medal-level IMO score with an advanced Gemini model that is fast enough to solve hard reasoning problems within the competition time limit. Honored to have had the chance to collaborate with the GDM DeepThink Math team and contribute to this huge milestone!

TestingCatalog News 🗞 (@testingcatalog) 's Twitter Profile Photo

Gemini Deep Think IMO 👀 It is one of the first models which I am testing extensively b/c it is very fun to play with. "Cyberpunk nuclear reactor control interface"