Armand Joulin (@armandjoulin) Twitter Tweets • TwiCopy

Kudos to our colleagues and external partners that work relentlessly in the shadow to ship Gemma across platforms! Gus (🤖🧠+🐍+🥑🗣️) Omar Sanseviero Philipp Schmid and many others

thumb_up_off_alt48

chat_bubble_outline4

repeat5

shareShare

Armand Joulin

@armandjoulin

5 months ago

Gemma 3 would not be as good without Aishwarya Kamath amazing work!

thumb_up_off_alt17

chat_bubble_outline0

repeat0

shareShare

Armand Joulin

@armandjoulin

5 months ago

That's why I love shipping open models so much

thumb_up_off_alt49

chat_bubble_outline1

repeat2

shareShare

Armand Joulin

@armandjoulin

5 months ago

Gemma everywhere!

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Armand Joulin

@armandjoulin

4 months ago

Insane to see all the complex pieces that have come together to produce this beast. The teams behind it are just incredible.

thumb_up_off_alt142

chat_bubble_outline1

repeat6

shareShare

Armand Joulin

@armandjoulin

4 months ago

Cool long context demo presented by Tatiana Matejovicova from Gemma team: youtube.com/watch?v=TAlkGT…

thumb_up_off_alt8

chat_bubble_outline1

repeat0

shareShare

Armand Joulin

@armandjoulin

4 months ago

Follow the guide by Ravin Kumar from Gemma team on how to pick model sizes: youtube.com/watch?v=qcjrdu…

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Armand Joulin

@armandjoulin

4 months ago

We spend a lot of extra time to cook almost lossless quantized Gemma models. Kudos to Edouard Yvinec and Morgane Rivière for their work!

thumb_up_off_alt83

chat_bubble_outline3

repeat6

shareShare

Putting open models out there is hard and releases often come with implementation bugs. For Gemma 3, a dedicated release team worked for weeks to avoid these issues. So let s wait before judging llama 4 performances....

thumb_up_off_alt154

chat_bubble_outline5

repeat10

shareShare

Armand Joulin

@armandjoulin

4 months ago

🐬Gemma blog.google/technology/ai/…

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Armand Joulin

@armandjoulin

4 months ago

QAT models also available on llama.cpp and MLX!

thumb_up_off_alt52

chat_bubble_outline3

repeat5

shareShare

Qwen

@alibaba_qwen

3 months ago

Introducing Qwen3! We release and open-weight Qwen3, our latest large language models, including 2 MoE models and 6 dense models, ranging from 0.6B to 235B. Our flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general

thumb_up_off_alt7,7K

chat_bubble_outline316

repeat1,1K

shareShare

Armand Joulin

@armandjoulin

3 months ago

Somehow my twitter thread is not saturated with posts about Qwen 3 🤨 anyway, congratulations Qwen on this release!

thumb_up_off_alt26

chat_bubble_outline2

repeat2

shareShare

Armand Joulin

@armandjoulin

3 months ago

we love lmsys and its impact to our field, and thats why we need critical work, like cohere paper to keep improving it. Equally, it is important to clarify that some numbers in Cohere work are inaccurate, e.g., we sent only 1 gemma3 model to lmsys before release ("zizou-10").

thumb_up_off_alt71

chat_bubble_outline2

repeat5

shareShare

Sara Hooker

@sarahookr

3 months ago

It is critical for scientific integrity that we trust our measure of progress. The lmarena.ai has become the go-to evaluation for AI progress. Our release today demonstrates the difficulty in maintaining fair evaluations on lmarena.ai, despite best intentions.