Olivier Bachem (@olivierbachem) Twitter Tweets • TwiCopy

Sara Hooker

7 months ago

Armand Joulin cohere Thanks Armand Joulin -- we will make a correction. We group all private testing by provider. So while overall number of variants is correct, in this case there is very different testing patterns per model family under a provider. We will clarify gemma only had one private test.

thumb_up_off_alt11

chat_bubble_outline1

repeat1

shareShare

Rishabh Agarwal

@agarwl_

7 months ago

Qwen3 validating on policy distillation at scale for thinking!

thumb_up_off_alt89

chat_bubble_outline1

repeat8

shareShare

Google DeepMind

@googledeepmind

6 months ago

We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO

thumb_up_off_alt4,4K

chat_bubble_outline85

repeat663

shareShare

Jean Tarbouriech

@jean_tarbou

6 months ago

1000+ words per second! ⚡ We just unleashed Gemini Diffusion at #GoogleIO! 🚀 Awesome being part of the team that took this from a small research project all the way to I/O Google DeepMind 🪐

thumb_up_off_alt137

chat_bubble_outline5

repeat31

shareShare

Shantanu Thakoor

@shantanuthakoor

6 months ago

It's been an incredible experience being part of the team that took this from a small research project all the way to I/O 🪐 Super proud of the team! Google DeepMind

thumb_up_off_alt15

chat_bubble_outline1

repeat7

shareShare

Ivana Balazevic

@ibalazevic

6 months ago

🚀Meet Gemini Diffusion, our first diffusion-based and super fast language model, just announced at Google I/O!🚀 Very excited to be able to share what I've been working on for the past little while with our amazing small team Google DeepMind.

thumb_up_off_alt438

chat_bubble_outline23

repeat45

shareShare

Alexandre Ramé

@ramealexandre

6 months ago

Releasing Gemma 3n, our new open-weight model processing audio, images and text (with improved multilingual capabilities), optimized for on-device usage with MatFormer architecture (enabling adaptive compute) and reaching 1283 on Chatbot Arena. Read more: developers.googleblog.com/en/introducing….

thumb_up_off_alt82

chat_bubble_outline2

repeat22

shareShare

Anian Ruoss

@anianruoss

6 months ago

🔥 Gemini Diffusion is blazing fast 🔥 Honored to have been part of this amazing team!

thumb_up_off_alt42

chat_bubble_outline1

repeat8

shareShare

George Powell

@thegeorgepowell

6 months ago

Gemini Diffusion has been announced at #GoogleIO! 🚀 Diffusion for text allows for self correction and incredibly fast inference by generating tokens in parallel over a long horizon. Super proud to have played a small part in making this happen over the last two years.

thumb_up_off_alt16

chat_bubble_outline1

repeat4

shareShare

Edouard Leurent

@eleurent

6 months ago

Excited to share what I've been up to: Gemini Diffusion is FAST! I'm convinced this will revolutionise iterative workflows: refine, get instant feedback, repeat! So proud of what our small team achieved here🪐

thumb_up_off_alt122

chat_bubble_outline5

repeat18

shareShare

Pier Giuseppe Sessa

@piergsessa

6 months ago

Gemini Diffusion is out! Very excited to have worked on the post-training of such a state-of-the-art text diffusion model. Incredible performance at lightspeed⚡️ Congrats to everyone involved!!

thumb_up_off_alt36

chat_bubble_outline0

repeat9

shareShare

Aditya Kusupati

@adityakusupati

6 months ago

Pocket powerhouse admist I/O awesomeness! Gemma 3n E4B & E2B are insane models, optimized for on-device while rivaling frontier models. It's a 🪆Matryoshka Transformer (MatFormer)🪆: Natively elastic b/w 4B & 2B pareto-optimally! ⭐️: free models with ZERO training cost! 🧵👇

thumb_up_off_alt293

chat_bubble_outline9

repeat38

shareShare

Blanca Huergo

@blancahuergo

6 months ago

Very excited to share what I have been working on. Having been part of the Gemini Diffusion team since day one, it is amazing to see our model demoed at Google I/O :) sign up below to try it out!

thumb_up_off_alt82

chat_bubble_outline4

repeat13

shareShare

Brendan O'Donoghue

@bodonoghue85

6 months ago

Excited to share what my team has been working on lately - Gemini diffusion! We bring diffusion to language modeling, yielding more power and blazing speeds! 🚀🚀🚀 Gemini diffusion is especially strong at coding. In this example the model generates at 2000 tokens/sec,

thumb_up_off_alt2,2K

chat_bubble_outline89

repeat258

shareShare

Himanshu Sahni

@sahnihim

6 months ago

Such a privilege to work on Gemini Diffusion with an amazing team! From a small research project to launching at I/O - we've got unstoppable aura 🚀 Welcome to the era of live vibe coding ⚡️

thumb_up_off_alt39

chat_bubble_outline2

repeat8

shareShare

Olivier Bachem

@olivierbachem

6 months ago

Really proud that two new models have been presented at I/O which we have post-trained: - Gemini Diffusion: with >1k tokens per second a completely new LLM experience deepmind.google/models/gemini-… - Gemma 3n: pushing the boundary of what is possible on mobile developers.googleblog.com/en/introducing…

thumb_up_off_alt23

chat_bubble_outline0

repeat4

shareShare

Robert Dadashi

@robdadashi

6 months ago

Gemma 3n is out! 🚀🚀🚀 The frontier models from a year ago can now run locally on a phone! Lots of innovations (e.g. matformers, mix’n’match, per layer embeddings) to make this model mobile first. And we finally have audio/video as an input for Gemma models! 1/2