Dan Zhang (@dzhang50) 's Twitter Profile
Dan Zhang

@dzhang50

Gemini Model+HW Codesign @ Google DeepMind
| Computer Architecture PhD @ UT Austin🤘
| Opinions stated here are my own.

ID: 2909643908

calendar_today25-11-2014 04:47:49

820 Tweet

2,2K Takipçi

877 Takip Edilen

Albert Gu (@_albertgu) 's Twitter Profile Photo

I converted one of my favorite talks I've given over the past year into a blog post. "On the Tradeoffs of SSMs and Transformers" (or: tokens are bullshit) In a few days, we'll release what I believe is the next major advance for architectures.

I converted one of my favorite talks I've given over the past year into a blog post.

"On the Tradeoffs of SSMs and Transformers"
(or: tokens are bullshit)

In a few days, we'll release what I believe is the next major advance for architectures.
Sukjun (June) Hwang (@sukjun_hwang) 's Twitter Profile Photo

Tokenization has been the final barrier to truly end-to-end language models. We developed the H-Net: a hierarchical network that replaces tokenization with a dynamic chunking process directly inside the model, automatically discovering and operating over meaningful units of data

Albert Gu (@_albertgu) 's Twitter Profile Photo

Tokenization is just a special case of "chunking" - building low-level data into high-level abstractions - which is in turn fundamental to intelligence. Our new architecture, which enables hierarchical *dynamic chunking*, is not only tokenizer-free, but simply scales better.

Tokenization is just a special case of "chunking" - building low-level data into high-level abstractions - which is in turn fundamental to intelligence.

Our new architecture, which enables hierarchical *dynamic chunking*, is not only tokenizer-free, but simply scales better.
Thang Luong (@lmthang) 's Twitter Profile Photo

Very excited to share that an advanced version of Gemini Deep Think is the first to have achieved gold-medal level in the International Mathematical Olympiad! 🏆, solving five out of six problems perfectly, as verified by the IMO organizers! It’s been a wild run to lead this

Very excited to share that an advanced version of Gemini Deep Think is the first to have achieved gold-medal level in the International Mathematical Olympiad! 🏆, solving five out of six problems perfectly, as verified by the IMO organizers! It’s been a wild run to lead this
Quoc Le (@quocleix) 's Twitter Profile Photo

Excited to share that a scaled up version of Gemini DeepThink achieves gold-medal standard at the International Mathematical Olympiad. This result is official, and certified by the IMO organizers. Watch out this space, more to come soon! deepmind.google/discover/blog/…

Yi Tay (@yitayml) 's Twitter Profile Photo

Our IMO gold model is not just an "experimental reasoning" model. It is way more general purpose than anyone would have expected. This general deep think model is going to be shipped so stay tuned! 🔥

Nithya Attaluri (@attaluri_nithya) 's Twitter Profile Photo

Very excited to announce that I’ll be co-organizing a NeurIPS Conference workshop on LLM evals! Identifying shortcomings in model capabilities in a robust, scientific way is a critical part of model development. Looking forward to discussing ideas and hearing from some eval experts!

Mimee // privacy ml thesising (@mimeexu) 's Twitter Profile Photo

Exciting news! 📣 The call for papers for ML for Systems NeurIPS Conference is now open. Submission deadline: Aug 22 AoE Help spread the word! P.S. Agents and LLM systems are systems, too! mlforsystems.org/call_for_paper… #MLforSystems #MachineLearning #LLMs #Agents #CodeModels #neurips2025

Exciting news! 📣 The call for papers for ML for Systems <a href="/NeurIPSConf/">NeurIPS Conference</a> is now open.

Submission deadline: Aug 22 AoE

Help spread the word!
P.S. Agents and LLM systems are systems, too!

mlforsystems.org/call_for_paper…
#MLforSystems #MachineLearning #LLMs #Agents #CodeModels #neurips2025
Anne Ouyang (@anneouyang) 's Twitter Profile Photo

KernelBench v0.1 is out, featuring: - A guideline on analyzing the validity of results and ruling out physically impossible performance claims. - Support for randomized testing beyond normal distributions. - Fixed problem sizes and improved numerics

KernelBench v0.1 is out, featuring:
- A guideline on analyzing the validity of results and ruling out physically impossible performance claims.
- Support for randomized testing beyond normal distributions.
- Fixed problem sizes and improved numerics
Quoc Le (@quocleix) 's Twitter Profile Photo

Following its IMO gold-level win, Google DeepMind is sharing Gemini Deep Think with mathematicians for feedback. Excited to see what they discover! 🧠 Plus, an updated Gemini 2.5 Deep Think is now rolling out for Google AI Ultra subscribers. Learn more: bit.ly/3IWcWq0