
Eric Malmi
@ericmalmi
Staff Research Scientist at Google DeepMind, building Gemini | PhD @AaltoUniversity
ID: 859294200
https://ericmalmi.com/ 03-10-2012 07:02:15
348 Tweet
1,1K Takipçi
831 Takip Edilen




🚨Breaking: Google DeepMind’s latest Gemini-2.5-Pro is now ranked #1 across all LMArena leaderboards 🏆 Highlights: - #1 in all text arenas (Coding, Style Control, Creative Writing, etc) - #1 on the Vision leaderboard with a ~70 pts lead! - #1 on WebDev Arena, surpassing Claude


thank you for the recognition Gary Marcus! there's room for improvement, but I find it quite remarkable that an LLM learns to play creative sacrifices like this (best move according to Stockfish)


🚨Breaking: New Gemini-2.5-Pro (06-05) takes the #1 spot across all Arenas again! 🥇 #1 in Text, Vision, WebDev 🥇 #1 in Hard, Coding, Math, Creative, Multi-turn, Instruction Following, and Long Queries categories Huge congrats Google DeepMind!


Poster Spotlight! 🔦 Mastering Board Games by External and Internal Planning with Language Models ♟️ arxiv.org/abs/2412.12119 On Wednesday (Poster Session 3 East) Presented by Jakub Adamek and Eric Malmi
