Dan Zhang (@dzhang50) Twitter Tweets • TwiCopy

Albert Gu

5 months ago

I converted one of my favorite talks I've given over the past year into a blog post. "On the Tradeoffs of SSMs and Transformers" (or: tokens are bullshit) In a few days, we'll release what I believe is the next major advance for architectures.

thumb_up_off_alt516

chat_bubble_outline19

repeat72

shareShare

Sukjun (June) Hwang

@sukjun_hwang

4 months ago

Tokenization has been the final barrier to truly end-to-end language models. We developed the H-Net: a hierarchical network that replaces tokenization with a dynamic chunking process directly inside the model, automatically discovering and operating over meaningful units of data

thumb_up_off_alt2,2K

chat_bubble_outline58

repeat355

shareShare

Albert Gu

@_albertgu

4 months ago

Tokenization is just a special case of "chunking" - building low-level data into high-level abstractions - which is in turn fundamental to intelligence. Our new architecture, which enables hierarchical *dynamic chunking*, is not only tokenizer-free, but simply scales better.

thumb_up_off_alt1,1K

chat_bubble_outline58

repeat177

shareShare

Dan Zhang

@dzhang50

4 months ago

not a coincidence that Grok's waifu mode looks like Misa from Death Note 😆

thumb_up_off_alt8

chat_bubble_outline2

repeat0

shareShare

Thang Luong

@lmthang

4 months ago

Very excited to share that an advanced version of Gemini Deep Think is the first to have achieved gold-medal level in the International Mathematical Olympiad! 🏆, solving five out of six problems perfectly, as verified by the IMO organizers! It’s been a wild run to lead this

thumb_up_off_alt1,1K

chat_bubble_outline75

repeat224

shareShare

Quoc Le

@quocleix

4 months ago

Excited to share that a scaled up version of Gemini DeepThink achieves gold-medal standard at the International Mathematical Olympiad. This result is official, and certified by the IMO organizers. Watch out this space, more to come soon! deepmind.google/discover/blog/…

thumb_up_off_alt709

chat_bubble_outline9

repeat51

shareShare

Yi Tay

@yitayml

4 months ago

Our IMO gold model is not just an "experimental reasoning" model. It is way more general purpose than anyone would have expected. This general deep think model is going to be shipped so stay tuned! 🔥

thumb_up_off_alt1,1K

chat_bubble_outline52

repeat93

shareShare

Dan Zhang

@dzhang50

4 months ago

lol

thumb_up_off_alt3,3K

chat_bubble_outline67

repeat142

shareShare

Nithya Attaluri

@attaluri_nithya

4 months ago

Very excited to announce that I’ll be co-organizing a NeurIPS Conference workshop on LLM evals! Identifying shortcomings in model capabilities in a robust, scientific way is a critical part of model development. Looking forward to discussing ideas and hearing from some eval experts!

thumb_up_off_alt66

chat_bubble_outline2

repeat11

shareShare

Mimee // privacy ml thesising

@mimeexu

4 months ago

Exciting news! 📣 The call for papers for ML for Systems NeurIPS Conference is now open. Submission deadline: Aug 22 AoE Help spread the word! P.S. Agents and LLM systems are systems, too! mlforsystems.org/call_for_paper… #MLforSystems #MachineLearning #LLMs #Agents #CodeModels #neurips2025

Exciting news! 📣 The call for papers for ML for Systems <a href="/NeurIPSConf/">NeurIPS Conference</a> is now open.

Submission deadline: Aug 22 AoE

Help spread the word!
P.S. Agents and LLM systems are systems, too!

mlforsystems.org/call_for_paper…
#MLforSystems #MachineLearning #LLMs #Agents #CodeModels #neurips2025

thumb_up_off_alt15

chat_bubble_outline2

repeat6

shareShare

Anne Ouyang

@anneouyang

4 months ago

KernelBench v0.1 is out, featuring: - A guideline on analyzing the validity of results and ruling out physically impossible performance claims. - Support for randomized testing beyond normal distributions. - Fixed problem sizes and improved numerics

thumb_up_off_alt186

chat_bubble_outline8

repeat31

shareShare

Quoc Le

@quocleix

4 months ago

Following its IMO gold-level win, Google DeepMind is sharing Gemini Deep Think with mathematicians for feedback. Excited to see what they discover! 🧠 Plus, an updated Gemini 2.5 Deep Think is now rolling out for Google AI Ultra subscribers. Learn more: bit.ly/3IWcWq0

thumb_up_off_alt281

chat_bubble_outline13

repeat18

shareShare