Kaustubh Sridhar @ ICLR 2025 (@_k_sridhar) 's Twitter Profile
Kaustubh Sridhar @ ICLR 2025

@_k_sridhar

Building generalist agents. Final-year PhD student @PennEngineers, Prev: @AmazonScience @iitbombay.

ID: 1372459248

linkhttps://kaustubhsridhar.github.io/ calendar_today22-04-2013 16:05:31

1,1K Tweet

614 Followers

1,1K Following

adi (@adonis_singh) 's Twitter Profile Photo

I had #EarlyAccess to Gemini 3.0 for about 2 days (thanks to Logan Kilpatrick & the aistudio folks). Here we see gpt-5.1-thinking (left) vs gemini-3.0 (right) building the xbox controller in Minecraft.

I had #EarlyAccess to Gemini 3.0 for about 2 days  (thanks to <a href="/OfficialLoganK/">Logan Kilpatrick</a> &amp; the aistudio folks).

Here we see gpt-5.1-thinking (left) vs gemini-3.0 (right) building the xbox controller in Minecraft.
Sundar Pichai (@sundarpichai) 's Twitter Profile Photo

Its been an exciting 7 days of shipping - New much improved Gemini Live on Android and iOS - Gemini 3.0 Pro in Gemini App and AI Studio - Search AI Mode with Gemini 3.0 Pro and much improved shopping experience - Google Antigravity, our next generation agentic IDE -- Nanobanana

Bilawal Sidhu (@bilawalsidhu) 's Twitter Profile Photo

AI can now create AND explore 3D worlds. World models and agentic AI are on a collision course. World Labs is making world-building effortless. Google DeepMind’s SIMA-2 is making agency inside those worlds possible. Together, they hint at a new paradigm—AI that both creates

Haider. (@slow_developer) 's Twitter Profile Photo

Demis Hassabis says Gemini 3 is on track and shows the fastest progress in the industry But general intelligence requires more than just the current trajectory It needs better reasoning, stronger memory, and "world model ideas" to solve physical intelligence AGI is still 5–10

Dan Hendrycks (@danhendrycks) 's Twitter Profile Photo

Just how significant is the jump with Gemini 3? We just released a new leaderboard to track AI developments. Gemini 3 is the largest leap in a long time.

Just how significant is the jump with Gemini 3?

We just released a new leaderboard to track AI developments.
Gemini 3 is the largest leap in a long time.
Dhruv Batra (@dhruvbatradb) 's Twitter Profile Photo

Introducing Yutori Navigator 31 years ago, the modern web era began with Netscape Navigator. Today, we’re introducing Yutori Navigator — a web agent that autonomously navigates websites on its own cloud browser to complete tasks for you. Navigator achieves pareto-domination

Introducing Yutori Navigator

31 years ago, the modern web era began with Netscape Navigator.

Today, we’re introducing Yutori Navigator — a web agent that autonomously navigates websites on its own cloud browser to complete tasks for you. 

Navigator achieves pareto-domination
Rajan Patel (@rajanpatel) 's Twitter Profile Photo

We asked AI Mode in Search how a basketball player's 3 point shot relates to the quadratic equation. With Gemini 3's new generative UI capabilities, it created an interactive visualization to help bring this concept to life. Try it out with the Gemini 3 drop down in AI Mode and

Sundar Pichai (@sundarpichai) 's Twitter Profile Photo

You went 🍌🍌 for Nano Banana. Now, meet Nano Banana Pro.  It’s SOTA for image generation + editing with more advanced world knowledge, text rendering, precision + controls. Built on Gemini 3, it’s really good at complex infographics - much like how engineers see the world:)

You went 🍌🍌 for Nano Banana. Now, meet Nano Banana Pro. 

It’s SOTA for image generation + editing with more advanced world knowledge, text rendering, precision + controls. Built on Gemini 3, it’s really good at complex infographics - much like how engineers see the world:)
Google AI (@googleai) 's Twitter Profile Photo

Rolling out today we are launching Nano Banana Pro, the world’s best image model built to move beyond casual creation and into a new era of studio-quality, functional design. Nano Banana Pro enables a new level of precision and creative control, transforming the way you bring

Mostafa Dehghani (@m__dehghani) 's Twitter Profile Photo

Thinking (test-time compute) in pixel space... 🍌 Pro tip: always peek at the thoughts if you use AI Studio. Watching the model think in pictures is really fun!

Thinking (test-time compute) in pixel space... 🍌

Pro tip: always peek at the thoughts if you use AI Studio. Watching the model think in pictures is really fun!
Simon Willison (@simonw) 's Twitter Profile Photo

Nano Banana Pro, released this morning, is clearly the best image generation model. Superb instruction following, plus it can generate full infographics (with correct spelling and properly rendered text!) from a short prompt based on running extra searches simonwillison.net/2025/Nov/20/na…

Xeophon (@thexeophon) 's Twitter Profile Photo

you gotta give Gemini a serious try same prompts, Gemini found the one thing I wanted in 1/3 the time, while ChatGPT took >3 mins, gave me 7 results every time I did the comparison the last two days, both were equal or G was better

you gotta give Gemini a serious try

same prompts, Gemini found the one thing I wanted in 1/3 the time, while ChatGPT took &gt;3 mins, gave me 7 results

every time I did the comparison the last two days, both were equal or G was better
Riley Goodside (@goodside) 's Twitter Profile Photo

“Amateur photograph from 1998 of a middle-aged artist copying an image by hand from a computer screen to an oil painting on stretched canvas, but the image is itself the photo of the artist painting the recursive image.” Nano Banana Pro.

“Amateur photograph from 1998 of a middle-aged artist copying an image by hand from a computer screen to an oil painting on stretched canvas, but the image is itself the photo of the artist painting the recursive image.” Nano Banana Pro.
Dhruv Shah (@shahdhruv_) 's Twitter Profile Photo

My group Princeton University is hiring! We are looking for strong postdoc and PhD candidates to join our quest for intelligent robots in open-world environments. Read more below and get in touch 🤖🐅🧡 prism.robo.princeton.edu

My group <a href="/Princeton/">Princeton University</a> is hiring!

We are looking for strong postdoc and PhD candidates to join our quest for intelligent robots in open-world environments. Read more below and get in touch 🤖🐅🧡

prism.robo.princeton.edu
Saining Xie (@sainingxie) 's Twitter Profile Photo

this may seem contradictory to scientific principles, but more often than you might imagine, you believe not because of what you see; you see because of what you believe.

Kaustubh Sridhar @ ICLR 2025 (@_k_sridhar) 's Twitter Profile Photo

I’ll be at neurips in San Diego from Wed-Fri. Get in touch if you want to talk about world models, general agents, robotics, or GDM.