
Robert Dadashi
@robdadashi
reinforcement learning research @GoogleDeepMind, Gemma post-training lead
ID: 2799887322
https://ddsh.github.io 09-09-2014 13:30:01
186 Tweet
1,1K Followers
441 Following



Congratulations to the whole Gemma team for the launch and especially Aishwarya Kamath who did an amazing job pushing the MM capability of the model 🚀. Give a try to the model 🔥

Omar Sanseviero okay this is dope af


Hiring two student researchers for Gemma post-training team at Google DeepMind Paris! First topic is about diversity in RL for LLMs (merging, generalization, exploration & creativity), second is about distillation (with Nino Vieillard). Ideal if you're finishing PhD. DMs open!







Really proud that two new models have been presented at I/O which we have post-trained: - Gemini Diffusion: with >1k tokens per second a completely new LLM experience deepmind.google/models/gemini-… - Gemma 3n: pushing the boundary of what is possible on mobile developers.googleblog.com/en/introducing…



