Lisa Lee (@rl_agent) 's Twitter Profile
Lisa Lee

@rl_agent

Gemini pre-training & post-training

ID: 4843757851

linkhttps://leelisa.com calendar_today01-02-2016 08:20:24

150 Tweet

5,5K Followers

0 Following

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

Exciting News from Chatbot Arena! Google DeepMind's new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes. For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive

Exciting News from Chatbot Arena!

<a href="/GoogleDeepMind/">Google DeepMind</a>'s new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes.

For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive
Sundar Pichai (@sundarpichai) 's Twitter Profile Photo

We’re kicking off the start of our Gemini 2.0 era with Gemini 2.0 Flash, which outperforms 1.5 Pro on key benchmarks at 2X speed (see chart below). I’m especially excited to see the fast progress on coding, with more to come.  Developers can try an experimental version in AI

We’re kicking off the start of our Gemini 2.0 era with Gemini 2.0 Flash, which outperforms 1.5 Pro on key benchmarks at 2X speed (see chart below). I’m especially excited to see the fast progress on coding, with more to come. 

Developers can try an experimental version in AI
Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Welcome to the world, Gemini 2.0 ✨ our most capable AI model yet. We're first releasing an experimental version of 2.0 Flash ⚡ It has better performance, new multimodal output, Google tool use - and paves the way for new agentic experiences. 🧵 goo.gle/gemini-2

Jeff Dean (@jeffdean) 's Twitter Profile Photo

The new Gemini 2.0 Flash model debuts as the #3 model overall on the lmarena.ai leaderboard, and #1, 2 or 3 in all the various subcategories. Pretty great for such a low latency model!

The new Gemini 2.0 Flash model debuts as the #3 model overall on the <a href="/lmarena_ai/">lmarena.ai</a> leaderboard, and #1, 2 or 3 in all the various subcategories.  Pretty great for such a low latency model!
Anmol Gulati (@anmol01gulati) 's Twitter Profile Photo

This took some cooking, Project Mariner is out to automate tasks in your browser! Check out benchmark results, 90.5% on Webvoyager. Excited for this to be released to Trusted Testers. The world as we know it is about to change—much sooner than we think :)

This took some cooking, Project Mariner is out to automate tasks in your browser!

Check out benchmark results, 90.5% on Webvoyager. 

Excited for this to be released to Trusted Testers. The world as we know it is about to change—much sooner than we think :)
Ankesh Anand (@ankesh_anand) 's Twitter Profile Photo

Excited to share an early preview of our gemini 2.0 flash thinking model with all it's raw thoughts visible. Here's the model trying to solve a Putnam 2024 with multiple approaches, and then self-verifies that it's answer was correct.

Lisa Lee (@rl_agent) 's Twitter Profile Photo

A very fun chat with Hiroaki Kitano-san, Sony CTO and creator of AIBO robot dog, about the future of AI and robotics. Happy New Year from Tokyo! 今年のお正月、ソニーグループCTOの北野宏明さんと出会うことができました。人工知能・ロボット工学の未来について楽しい会話をしました。

A very fun chat with Hiroaki Kitano-san, Sony CTO and creator of AIBO robot dog, about the future of AI and robotics. Happy New Year from Tokyo!

今年のお正月、ソニーグループCTOの北野宏明さんと出会うことができました。人工知能・ロボット工学の未来について楽しい会話をしました。
Mostafa Dehghani (@m__dehghani) 's Twitter Profile Photo

Anyone who has been in this room knows that it’s never just another day in here! This space has seen the extremes of chaos and genius! ...and we ship! developers.googleblog.com/en/experiment-… Happy Wednesday everyone!

Anyone who has been in this room knows that it’s never just another day in here! This space has seen the extremes of chaos and genius!

...and we ship! 
developers.googleblog.com/en/experiment-…

Happy Wednesday everyone!
Cristian Peñas ░░░░░░░░ (@ilumine_ai) 's Twitter Profile Photo

This quick experiment I just did made my jaw drop... You can literally create and play any game by iterating over images with the new Gemini model! 🤯

This quick experiment I just did made my jaw drop...

You can literally create and play any game by iterating over images with the new Gemini model! 🤯
Cristian Peñas ░░░░░░░░ (@ilumine_ai) 's Twitter Profile Photo

Gemini can generate pretty consistent gif animations too: 'Create an animation by generating multiple frames, showing a seed growing into a plant and then blooming into a flower, in a pixel art style'

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

🚨Breaking: Google DeepMind’s latest Gemini-2.5-Pro is now ranked #1 across all LMArena leaderboards 🏆 Highlights: - #1 in all text arenas (Coding, Style Control, Creative Writing, etc) - #1 on the Vision leaderboard with a ~70 pts lead! - #1 on WebDev Arena, surpassing Claude

🚨Breaking: <a href="/GoogleDeepMind/">Google DeepMind</a>’s latest Gemini-2.5-Pro is now ranked #1 across all LMArena leaderboards 🏆

Highlights:
- #1 in all text arenas (Coding, Style Control, Creative Writing, etc)
- #1 on the Vision leaderboard with a ~70 pts lead!
- #1 on WebDev Arena, surpassing Claude