Garrett Bingham (@gjb_ai) 's Twitter Profile
Garrett Bingham

@gjb_ai

Research Scientist at Google DeepMind | UT Austin PhD | Yale BS

ID: 1465804425927151620

linkhttp://gjb.ai calendar_today30-11-2021 22:06:31

7 Tweet

15 Followers

42 Following

Archit Sharma (@archit_sharma97) 's Twitter Profile Photo

2.5 Pro Deep Think is an incredibly smart model. Some of the benchmark results, simply put were surprising to me. But, the benchmarks don’t tell the whole story. It can go into far more intricate details, especially open-ended prompts, unlike any of our previous thinking models.

2.5 Pro Deep Think is an incredibly smart model. Some of the benchmark results, simply put were surprising to me. But, the benchmarks don’t tell the whole story. It can go into far more intricate details, especially open-ended prompts, unlike any of our previous thinking models.
Garrett Bingham (@gjb_ai) 's Twitter Profile Photo

Gemini 2.5 Pro Deep Think is an SVG artist! Prompt: "Draw a SVG of a Pelican riding a bicycle" Left: Gemini 2.5 Pro Right: Gemini 2.5 Pro Deep Think Credit: simonwillison.net/2024/Oct/25/pe…

Gemini 2.5 Pro Deep Think is an SVG artist!

Prompt: "Draw a SVG of a Pelican riding a bicycle"
Left: Gemini 2.5 Pro
Right: Gemini 2.5 Pro Deep Think

Credit: simonwillison.net/2024/Oct/25/pe…
Thang Luong (@lmthang) 's Twitter Profile Photo

Attached is a full proof by Gemini 2.5 Pro #DeepThink with our experts' comments drive.google.com/file/d/1PaKXo4…. Here I quote a few important moments in the proof: (a) Expert: The main part of the solution begins with a "proof by contradiction", which is a reasonable choice considering

Garrett Bingham (@gjb_ai) 's Twitter Profile Photo

An advanced version of Gemini with Deep Think achieved a gold medal at this year's IMO! 🥇 So many amazing contributions from the team. What's next? deepmind.google/discover/blog/…

An advanced version of Gemini with Deep Think achieved a gold medal at this year's IMO! 🥇

So many amazing contributions from the team. What's next?

deepmind.google/discover/blog/…
Garrett Bingham (@gjb_ai) 's Twitter Profile Photo

"It was a team effort" sounds cliché, but it really was! We had so many late-breaking super critical contributions. What a magical experience. 🥇

Jasper Dekoninck (@j_dekoninck) 's Twitter Profile Photo

We launched a new competition on MathArena: Evaluation on the International Mathematics Competition for University students! Our goal: Verify the gold medals on the IMO by testing agentic Gemini-2.5-Pro and Gemini IMO Deep Think The results: The models aced the competition. 🧵

We launched a new competition on MathArena: Evaluation on the International Mathematics Competition for University students!

Our goal: Verify the gold medals on the IMO by testing agentic Gemini-2.5-Pro and Gemini IMO Deep Think

The results: The models aced the competition. 🧵
Archit Sharma (@archit_sharma97) 's Twitter Profile Photo

I think this is one of the interesting situations where considering multiple hypotheses (à la Deep Think) is good. (does someone have gpt-5 thinking or pro results?)

I think this is one of the interesting situations where considering multiple hypotheses (à la Deep Think) is good.

(does someone have gpt-5 thinking or pro results?)