Siamak Shakeri (@maxsonate) 's Twitter Profile
Siamak Shakeri

@maxsonate

RS at Google DeepMind, Working on Gemini β™ŠοΈ RL. Snowboarding and traveling when not working

ID: 1285101980

calendar_today21-03-2013 05:04:14

163 Tweet

422 Followers

288 Following

Garrett Bingham (@gjb_ai) 's Twitter Profile Photo

An advanced version of Gemini with Deep Think achieved a gold medal at this year's IMO! πŸ₯‡ So many amazing contributions from the team. What's next? deepmind.google/discover/blog/…

An advanced version of Gemini with Deep Think achieved a gold medal at this year's IMO! πŸ₯‡

So many amazing contributions from the team. What's next?

deepmind.google/discover/blog/…
Quoc Le (@quocleix) 's Twitter Profile Photo

Excited to share that a scaled up version of Gemini DeepThink achieves gold-medal standard at the International Mathematical Olympiad. This result is official, and certified by the IMO organizers. Watch out this space, more to come soon! deepmind.google/discover/blog/…

Melvin Johnson (@melvinjohnsonp) 's Twitter Profile Photo

So happy to see this incredible achievement. Huge congrats to Thang Luong, Quoc Le, Yi Tay and the IMO team on the result. This was a great collaboration across teams to build a general Gemini DeepThink model that can also get gold at IMO.

Vahab Mirrokni (@mirrokni) 's Twitter Profile Photo

Proud to announce an official Gold Medal at #IMO2025πŸ₯‡ The IMO committee has certified the result from our general-purpose Gemini systemβ€”a landmark moment for our team and for the future of AI reasoning. deepmind.google/discover/blog/… (1/n) Highlights in thread:

Siamak Shakeri (@maxsonate) 's Twitter Profile Photo

I really enjoyed our slow yet meticulous scientific approach to the IMO work, while we knew in-house that we are πŸ₯‡, we waited for the official grading, adhered to all the requirements, had a well-thought out and inclusive announcement. This is what matters in the long run, imo.

Siamak Shakeri (@maxsonate) 's Twitter Profile Photo

I am thrilled to introduce TYDI QA-WANA, a new benchmark for information-seeking question answering. Our dataset includes 28K question-answer pairs across 10 languages of West Asia and North Africa. Key features of TYDI QA-WANA: 1) Questions are genuinely information-seeking, as

Yi Tay (@yitayml) 's Twitter Profile Photo

The Gemini Deep Think model that achieved IMO gold medal πŸ₯‡ is launched! This is a general purpose model that is not only SOTA at math/proofs but also reasoning, code and many others! πŸ”₯ The exact config that achieved IMO gold with scaled up Deep Think is being made available

Siamak Shakeri (@maxsonate) 's Twitter Profile Photo

Further independent verification of 1) strong performance of Gemini models on math, 2) Gemini Deep Think showing very strong results and also clean and well written proofs, 3) move inference time compute -> more wins. We need harder and hard baselines. At some point, it will

Yi Tay (@yitayml) 's Twitter Profile Photo

I really like the manager style of my good buddy Mostafa Dehghani. Sounds like a lot of fun to be his report. Someday I will aspire to be like him. He is so funny. I remember once he told me he rejected his reports travel because they did not take business and only economy. πŸ˜‚πŸ«‘