Adam Sadovsky (@asadovsky) 's Twitter Profile
Adam Sadovsky

@asadovsky

Distinguished Software Engineer / Senior Director, Gemini

ID: 11245832

calendar_today17-12-2007 05:03:50

128 Tweet

1,1K Takipçi

406 Takip Edilen

Demis Hassabis (@demishassabis) 's Twitter Profile Photo

Our latest update to our Gemini 2.0 Flash Thinking model (available here: goo.gle/4jsCqZC) scores 73.3% on AIME (math) & 74.2% on GPQA Diamond (science) benchmarks. Thanks for all your feedback, this represents super fast progress from our first release just this past

Our latest update to our Gemini 2.0 Flash Thinking model (available here: goo.gle/4jsCqZC) scores 73.3% on AIME (math) & 74.2% on GPQA Diamond (science) benchmarks. Thanks for all your feedback, this represents super fast progress from our first release just this past
Andrej Karpathy (@karpathy) 's Twitter Profile Photo

We have to take the LLMs to school. When you open any textbook, you'll see three major types of information: 1. Background information / exposition. The meat of the textbook that explains concepts. As you attend over it, your brain is training on that data. This is equivalent

We have to take the LLMs to school.

When you open any textbook, you'll see three major types of information:

1. Background information / exposition. The meat of the textbook that explains concepts. As you attend over it, your brain is training on that data. This is equivalent
Farzad Mostashari (@farzad_md) 's Twitter Profile Photo

1/ After residency at Mass General Hospital, I reported to Atlanta to meet my fellow CDC Epidemic Intelligence Service Officers. I have never felt so intimidated by my peers The best and the brightest, they were star clinicians, had served in disaster zones; MD/PhDs and MSF.

Subhash Choudhary (@subhashchy) 's Twitter Profile Photo

We replaced GPT-4o with Gemini-2.0 Flash for Bot9, reducing our costs by about 20× with no visible loss in accuracy. This change was implemented on a highly complex support agent that makes 32 tool calls. I was seriously not expecting this. At the application layer, it also

We replaced GPT-4o with Gemini-2.0 Flash for Bot9, reducing our costs by about 20× with no visible loss in accuracy. 

This change was implemented on a highly complex support agent that makes 32 tool calls.

I was seriously not expecting this. 

At the application layer, it also
Kyle Corbitt (@corbtt) 's Twitter Profile Photo

If you're fine-tuning LLMs, Gemma 3 is the new 👑 and it's not close. Gemma 3 trounces Qwen/Llama models at every size! - Gemma 3 4B beats 7B/8B competition - Gemma 3 27B matches 70B competiton Vision benchmarks coming soon!

If you're fine-tuning LLMs, Gemma 3 is the new 👑 and it's not close. Gemma 3 trounces Qwen/Llama models at every size!
 - Gemma 3 4B beats 7B/8B competition
 - Gemma 3 27B matches 70B competiton

Vision benchmarks coming soon!
lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆 Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆

Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer
Martin Baeuml (@mbaeuml) 's Twitter Profile Photo

Just shipped a few updates 1. Gemini 2.5 Pro to try for free on gemini.google.com in the model drop down. Advanced has higher limits. 2. Canvas with 2.5 Pro in Advanced. Our best coding model yet. We had so much fun building demos internally, can't wait to see what y'all

Nando de Freitas (@nandodf) 's Twitter Profile Photo

This was an amazing week at ⁦Microsoft AI⁩ !! We released MAI 1 preview and a taste of MAI Voice. I’m super happy with this team - only about 100 people and already shipping in ⁦Arena⁩ in less than a year. Strong support. More soon. Thanks for feedback!

Mustafa Suleyman (@mustafasuleyman) 's Twitter Profile Photo

Meet our third Microsoft AI model: MAI-Image-1 #9 on LMArena, striking an impressive balance of generation speed and quality Excited to keep refining + climbing the leaderboard from here! We're just getting started. microsoft.ai/news/introduci…

Meet our third <a href="/MicrosoftAI/">Microsoft AI</a> model: MAI-Image-1 
#9 on LMArena, striking an impressive balance of generation speed and quality 
Excited to keep refining + climbing the leaderboard from here! 
We're just getting started.
microsoft.ai/news/introduci…