Tekle 🇪🇷 (@alexandertekle) 's Twitter Profile
Tekle 🇪🇷

@alexandertekle

Gemini Post-Training @GoogleDeepMind, @texasexes, Dallas sports

ID: 1424414432

calendar_today13-05-2013 01:53:04

3,3K Tweet

1,1K Followers

1,1K Following

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Welcome to the world, Gemini 2.0 ✨ our most capable AI model yet. We're first releasing an experimental version of 2.0 Flash ⚡ It has better performance, new multimodal output, Google tool use - and paves the way for new agentic experiences. 🧵 goo.gle/gemini-2

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Today, we’re announcing Veo 2: our state-of-the-art video generation model which produces realistic, high-quality clips from text or image prompts. 🎥 We’re also releasing an improved version of our text-to-image model, Imagen 3 - available to use in ImageFX through

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Think you know Gemini? 🤔 Think again. Meet Gemini 2.5: our most intelligent model 💡 The first release is Pro Experimental, which is state-of-the-art across many benchmarks - meaning it can handle complex problems and give more accurate responses. Try it now →

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆 Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆

Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer
Noam Shazeer (@noamshazeer) 's Twitter Profile Photo

Introducing Gemini 2.5 Pro Experimental. The 2.5 series marks a significant evolution: Gemini models are now fundamentally thinking models. This means the model reasons before responding, to maximize accuracy -- and it’s our best Gemini model yet. Blog -

Silas Alberti (@silasalberti) 's Twitter Profile Photo

Wow we just ran Gemini 2.5 Pro on our evals and it got a new state of the art. Congrats to the Gemini team! Sharing preliminary results here and working on bringing it into Devin:

Wow we just ran Gemini 2.5 Pro on our evals and it got a new state of the art. Congrats to the Gemini team!

Sharing preliminary results here and working on bringing it into Devin:
Mislav Balunović (@mbalunovic) 's Twitter Profile Photo

Big update to our MathArena USAMO evaluation: Gemini 2.5 Pro, which was released *the same day* as our benchmark, is the first model to achieve non-trivial amount of points (24.4%). The speed of progress is really mind-blowing.

Big update to our MathArena USAMO evaluation: Gemini 2.5 Pro, which was released *the same day* as our benchmark, is the first model to achieve non-trivial amount of points (24.4%). The speed of progress is really mind-blowing.
Google DeepMind (@googledeepmind) 's Twitter Profile Photo

We’re launching Project Astra capabilities in Gemini Live ✨ Chat with Google Gemini App about anything you see 👀 by sharing your phone’s camera or screen during conversations. ↓

Logan Kilpatrick (@officiallogank) 's Twitter Profile Photo

Deep Research in the Gemini App is now powered by Gemini 2.5 Pro, and our early tests show users prefer this 2:1 vs “other products” ;) gemini.google.com

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

⚡ The latest Gemini 2.5 Flash has arrived on the leaderboard! Ranked jointly at #2 and matching top models such as GPT 4.5 Preview & Grok-3! Highlights: 🏆 tied #1 in Hard Prompts, Coding, and Longer Query 💠 Top 4 across all categories 💵 5-10x cheaper than Gemini-2.5-Pro

⚡ The latest Gemini 2.5 Flash has arrived on the leaderboard! Ranked jointly at #2 and matching top models such as GPT 4.5 Preview & Grok-3! Highlights:

🏆 tied #1 in Hard Prompts, Coding, and Longer Query
💠 Top 4 across all categories
💵 5-10x cheaper than Gemini-2.5-Pro
Logan Kilpatrick (@officiallogank) 's Twitter Profile Photo

Gemini 2.5 Flash is here, our first unified reasoning model with thinking budgets. 🔥 It’s on the perato frontier and punches above its price and size!! developers.googleblog.com/en/start-build…

Demis Hassabis (@demishassabis) 's Twitter Profile Photo

Very excited to share the best coding model we’ve ever built! Today we’re launching Gemini 2.5 Pro Preview 'I/O edition' with massively improved coding capabilities. Ranks no.1 on LMArena in Coding and no.1 on the WebDev Arena Leaderboard. It’s especially good at building

Sundar Pichai (@sundarpichai) 's Twitter Profile Photo

Our latest Gemini 2.5 Pro update is now in preview. It’s better at coding, reasoning, science + math, shows improved performance across key benchmarks (AIDER Polyglot, GPQA, HLE to name a few), and leads lmarena.ai with a 24pt Elo score jump since the previous version. We also

Our latest Gemini 2.5 Pro update is now in preview.

It’s better at coding, reasoning, science + math, shows improved performance across key benchmarks (AIDER Polyglot, GPQA, HLE to name a few), and leads <a href="/lmarena_ai/">lmarena.ai</a> with a 24pt Elo score jump since the previous version.

We also
lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

🚨Breaking: New Gemini-2.5-Pro (06-05) takes the #1 spot across all Arenas again! 🥇 #1 in Text, Vision, WebDev 🥇 #1 in Hard, Coding, Math, Creative, Multi-turn, Instruction Following, and Long Queries categories Huge congrats Google DeepMind!

🚨Breaking: New Gemini-2.5-Pro (06-05) takes the #1 spot across all Arenas again!

🥇 #1 in Text, Vision, WebDev
🥇 #1 in Hard, Coding, Math, Creative, Multi-turn, Instruction Following, and Long Queries categories

Huge congrats <a href="/GoogleDeepMind/">Google DeepMind</a>!