Prateek Jain (@jainprateek_) Twitter Tweets • TwiCopy

Alexandre Ramé

6 months ago

Releasing Gemma 3n, our new open-weight model processing audio, images and text (with improved multilingual capabilities), optimized for on-device usage with MatFormer architecture (enabling adaptive compute) and reaching 1283 on Chatbot Arena. Read more: developers.googleblog.com/en/introducing….

thumb_up_off_alt82

chat_bubble_outline2

repeat22

shareShare

Omar Sanseviero

@osanseviero

6 months ago

Announcing Gemma 3n preview 🎙️ Multimodal: audio, video, text 🧠New architecture with 4B and 2B effective params 🤯LMArena score of 1283 👀Available in AI Studio More info soon! developers.googleblog.com/en/introducing…

thumb_up_off_alt1,1K

chat_bubble_outline33

repeat146

shareShare

Medhini Narasimhan

@medhini_n

6 months ago

Beyond thrilled to share what we've been cooking the past few months! 🥳 Veo can now generate sounds and dialog! We are the state-of-the-art model on audio-video generation: deepmind.google/models/veo/eva… Try out Veo here: labs.google/flow/about

thumb_up_off_alt76

chat_bubble_outline5

repeat7

shareShare

utku

@utkuevci

6 months ago

This is something I've been working on with some amazing collaborators for a while. Model-software-hardware co-design. Making things run fast on real devices. A lot of learning. And happy to share this with the open-source community and beyond. developers.googleblog.com/en/introducing…

thumb_up_off_alt49

chat_bubble_outline5

repeat4

shareShare

Andriy Burkov

@burkov

6 months ago

As you probably already heard, Gemma 3n can use different amounts of parameters during inference (from 2B to 5B) depending on the device it runs on. This is thanks to the Matryoshka architecture proposed in the MatFormer paper. MatFormer is based on "nested" feedforward network

thumb_up_off_alt144

chat_bubble_outline5

repeat17

shareShare

Tris Warkentin

@triswarkentin

6 months ago

This is my favorite demo of Gemma 3n -- multimodal live video understanding and intelligence, locally on your phone 🤯! This was only possible with the peak of foundation models at I/O last year -- the Astra demo -- the progress of small models is incredible

thumb_up_off_alt23

chat_bubble_outline1

repeat4

shareShare

Robert Dadashi

@robdadashi

6 months ago

Gemma 3n is out! 🚀🚀🚀 The frontier models from a year ago can now run locally on a phone! Lots of innovations (e.g. matformers, mix’n’match, per layer embeddings) to make this model mobile first. And we finally have audio/video as an input for Gemma models! 1/2

thumb_up_off_alt40

chat_bubble_outline3

repeat11

shareShare

Robert Dadashi

@robdadashi

6 months ago

Our post-training team (once again) went beyond to post-craft this fantastic performance-for-size model: Johan Ferret Alexandre Ramé Sarah Perrin Geoffrey Cideron Nino Vieillard Sabela Angéline Pouget V Carbune L Rouillard L Hussenot Gaël Liu Danila Sinopalnikov Olivier Bachem 2/2

thumb_up_off_alt21

chat_bubble_outline1

repeat7

shareShare

Tim Dettmers

@tim_dettmers

6 months ago

MatFormers are very powerful alternatives to transformers. Similar to a regular transformer, but after training, you can split up the model to any size you like and get very strong performance that scales just like a regular transformer. So train once, get models of all sizes!

thumb_up_off_alt351

chat_bubble_outline5

repeat45

shareShare

Jack Rae

@jack_w_rae

6 months ago

There was a lot of announcements at IO, easy to overlook the new 2.5 Flash. It's pushing new boundaries in capability vs speed!

thumb_up_off_alt151

chat_bubble_outline6

repeat16

shareShare

Google DeepMind

@googledeepmind

6 months ago

Introducing Gemma 3n, our multimodal model built for mobile on-device AI. 🤳 It runs with a smaller memory footprint, cutting down RAM usage by nearly 3x – enabling more complex applications right on your phone, or for livestreaming from the cloud. Now available in early

thumb_up_off_alt2,2K

chat_bubble_outline61

repeat321

shareShare

Google DeepMind

@googledeepmind

6 months ago

What can you do with Gemma 3n? 🛠️Generate smart text from audio, images, video, and text 🛠️Create live, interactive apps that react to what users see and hear 🛠️Build advanced audio apps for real-time speech, translation, and voice commands

thumb_up_off_alt246

chat_bubble_outline5

repeat28

shareShare

Google DeepMind

@googledeepmind

6 months ago

Gemma 3n was built to be fast and efficient. 🏃 Engineered to run quickly and locally on-device – ensuring reliability, even without the internet. Think up to 1.5x faster response times on mobile! Preview Gemma 3n now on Google AI Studio. → goo.gle/4jrSOZq

thumb_up_off_alt166

chat_bubble_outline2

repeat13

shareShare

Omar Sanseviero

@osanseviero

6 months ago

The Gemma team keeps shipping. In 6 months: - PaliGemma 2 - PaliGemma 2 Mix - Gemma 3 - ShieldGemma 2 - TxGemma - Gemma 3 QAT - Gemma 3n Preview - MedGemma Early - DolphinGemma - SignGemma And so much more to come! 🚀

thumb_up_off_alt431

chat_bubble_outline27

repeat42

shareShare

Prateek Jain

@jainprateek_

6 months ago

Introducing Matryoshka Teaching Assistant aka matTA framework! This allows learning a "Matformed TA-student" pair with distillation from a much more relatable TA, and provides elasticity/adaptivity/flexibility of Matryoshka models. Key benefits: a. more accurate, servable

thumb_up_off_alt28

chat_bubble_outline0

repeat5

shareShare

Manish Gupta

@manishguptamg1

6 months ago

Continuing impact from Matryoshka Models, so proud of our team's contributions!

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Prateek Jain

@jainprateek_

6 months ago

Super blessed to work with such visionary and kind leader 🙏

thumb_up_off_alt29

chat_bubble_outline2

repeat0

shareShare

Prateek Jain

@jainprateek_

6 months ago

We are hiring Research Scientists for our Machine Learning and Optimization team at Google DeepMind Bangalore. If you're passionate about cutting-edge AI research and building efficient, elastic, customized, and safe LLMs, we'd love to hear from you. We are looking for

thumb_up_off_alt710

chat_bubble_outline23

repeat84

shareShare

Manish Gupta

@manishguptamg1

6 months ago

For strong researchers, it doesn't get any better. You get to work with one of the best ML research teams in the world and contribute to Google's frontier models impacting billions of people!

thumb_up_off_alt131

chat_bubble_outline2

repeat14

shareShare

Shagun Sodhani

@shagunsodhani

6 months ago

I recently left FAIR and joined Google DeepMind. I'm deeply grateful to the FAIR leadership, mentors, collaborators & friends who believed in me, encouraged me to aim higher & celebrated my wins. Grateful for the journey so far & excited to help advance AI research at DeepMind.

thumb_up_off_alt748

chat_bubble_outline24

repeat9

shareShare