Olivier Bachem
@olivierbachem
Director, Research Scientist at @GoogleDeepMind where I lead the research team that post-trains Gemma
ID: 1259188770
http://olivierbachem.ch 11-03-2013 11:14:08
546 Tweet
3,3K Takipçi
329 Takip Edilen
Armand Joulin cohere Thanks Armand Joulin -- we will make a correction. We group all private testing by provider. So while overall number of variants is correct, in this case there is very different testing patterns per model family under a provider. We will clarify gemma only had one private test.
1000+ words per second! ⚡ We just unleashed Gemini Diffusion at #GoogleIO! 🚀 Awesome being part of the team that took this from a small research project all the way to I/O Google DeepMind 🪐
It's been an incredible experience being part of the team that took this from a small research project all the way to I/O 🪐 Super proud of the team! Google DeepMind
🚀Meet Gemini Diffusion, our first diffusion-based and super fast language model, just announced at Google I/O!🚀 Very excited to be able to share what I've been working on for the past little while with our amazing small team Google DeepMind.
Really proud that two new models have been presented at I/O which we have post-trained: - Gemini Diffusion: with >1k tokens per second a completely new LLM experience deepmind.google/models/gemini-… - Gemma 3n: pushing the boundary of what is possible on mobile developers.googleblog.com/en/introducing…