Ruibo Liu (@ruiboliu) Twitter Tweets • TwiCopy

Ruibo Liu

@ruiboliu

8 months ago

Frontiers truly arrive when it is affordable and accessible to everyone! 🥳

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Ruibo Liu

@ruiboliu

8 months ago

No confusing names, no $200/month. Strong, fast, and very affordable. Enjoy! 😉

thumb_up_off_alt152

chat_bubble_outline1

repeat8

shareShare

Geoffrey Hinton says the more we understand how AI and the brain actually work, the less human thinking looks like logic. We're not reasoning machines, he says. We're analogy machines. We think by resonance, not deduction. “We're much less rational than we thought.”

thumb_up_off_alt4,4K

chat_bubble_outline273

repeat815

shareShare

Ruibo Liu

@ruiboliu

7 months ago

Very nice work! Congrats, Qwen team! 🥳 And glad to see the comparison includes Gemini 2.5 Pro --- the "best model in the world" in my opinion (yes, even after seeing some new models released these days). More to come... 😊

thumb_up_off_alt121

chat_bubble_outline1

repeat6

shareShare

Ruibo Liu

@ruiboliu

7 months ago

at NAACL. happy to chat! 🙂

thumb_up_off_alt16

chat_bubble_outline1

repeat1

shareShare

Ruibo Liu

@ruiboliu

6 months ago

This is the coolest demo I have seen recently! Now you can do: S_t -> A_t -> S_t+1, R_t ... infinitely! The real world model! 😇

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Ruibo Liu

@ruiboliu

5 months ago

The real moat is faster iteration than your competitors.

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

Ruibo Liu

@ruiboliu

5 months ago

LLM is nothing without proper decontamination.

thumb_up_off_alt21

chat_bubble_outline2

repeat0

shareShare

Ruibo Liu

@ruiboliu

5 months ago

this is huge. welcome! 🤗

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Ruibo Liu

@ruiboliu

5 months ago

We discovered another effective scaling paradigm. 😊

thumb_up_off_alt38

chat_bubble_outline3

repeat1

shareShare

Thang Luong

@lmthang

5 months ago

Very excited to share that an advanced version of Gemini Deep Think is the first to have achieved gold-medal level in the International Mathematical Olympiad! 🏆, solving five out of six problems perfectly, as verified by the IMO organizers! It’s been a wild run to lead this

thumb_up_off_alt1,1K

chat_bubble_outline75

repeat224

shareShare

Fred Zhang

@fredzhang0

5 months ago

This is the most scaling-pilled project I've ever been part of, and the team really cooked. TL;DR: With RL and inference scaling, Gemini perfectly solved 5 out of 6 problems, reaching a gold medal in IMO '25, all within the time constraints of 4.5hr.

thumb_up_off_alt535

chat_bubble_outline17

repeat28

shareShare

Ruibo Liu

@ruiboliu

5 months ago

If the grading is correct then that means we actually don’t need specially trained model or vertical data — the IMO gold level model is already in your hands. Impressive work from the community!

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Ruibo Liu

@ruiboliu

4 months ago

🛸🚀

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

4 months ago

🚨 Open Model Leaderboard Update New open models entered the Text Arena, and the rankings by provider have reshuffled for August. - Qwen-3-235b-a22b-instruct from Qwen takes the crown 🏆 - GLM-4.5 from Z.ai and gpt-oss-120b by @openAI debut in the top 10! All the

thumb_up_off_alt538

chat_bubble_outline15

repeat93

shareShare

Ruibo Liu

@ruiboliu

4 months ago

“kingfall” might be able to predict your next move! 😍🉑🎖️

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Ruibo Liu

@ruiboliu

3 months ago

There seems to be a strange desperation in the air with AI companies trying to buy web browsers. It’s not just a land grab for distribution; it’s a tacit admission that even with a trillion-parameter model, the user experience is still mediated by a 30-year-old paradigm: the

thumb_up_off_alt15

chat_bubble_outline3

repeat0

shareShare

Jeff Dean

@jeffdean

3 months ago

VaultGemma is a release of an open model trained from scratch with differential privacy. The blog post below and the full tech report linked from the tech report have some nice analyses to present a scaling law for differentially private language models: Blog:

thumb_up_off_alt510

chat_bubble_outline17

repeat64

shareShare

Ruibo Liu

Ruibo Liu

Ruibo Liu

vitrupo

Ruibo Liu

Ruibo Liu

Ruibo Liu

Ruibo Liu

Ruibo Liu

Ruibo Liu

Ruibo Liu

Thang Luong

Fred Zhang

Ruibo Liu

Ruibo Liu

lmarena.ai (formerly lmsys.org)

Ruibo Liu

Ruibo Liu

Jeff Dean