Zeyuan Allen-Zhu, Sc.D. (@zeyuanallenzhu) Twitter Tweets • TwiCopy

Like Surya Ganguli I hope NSF pure math funding goes up (as does Surya Ganguli ) but it should be noted that lot of math gets funded in many other NSF divisions too.

thumb_up_off_alt22

chat_bubble_outline0

repeat1

shareShare

Elad Hazan

@hazanprinceton

4 months ago

Google link to apply: boards.greenhouse.io/deepmind/jobs/…

thumb_up_off_alt38

chat_bubble_outline0

repeat5

shareShare

Zeyuan Allen-Zhu, Sc.D.

@zeyuanallenzhu

4 months ago

On Mar 9, they rejected my access to Llama 2 models on huggingface, and there's no button to re-apply. Who should I talk to to fix this? Hugging Face AI at Meta

On Mar 9, they rejected my access to Llama 2 models on huggingface, and there's no button to re-apply. Who should I talk to to fix this? <a href="/huggingface/">Hugging Face</a> <a href="/AIatMeta/">AI at Meta</a>

thumb_up_off_alt94

chat_bubble_outline13

repeat0

shareShare

Hossein Mobahi

@thegradient

4 months ago

It is time! Applications for the global Google PhD Fellowship Program are NOW open.

thumb_up_off_alt56

chat_bubble_outline1

repeat4

shareShare

Kamalika Chaudhuri

@kamalikac

4 months ago

Papers I talked about: (1) One-model deja-vu memorization: arxiv.org/abs/2504.05651 (2) AgentDAM "data minimization" benchmark: arxiv.org/abs/2503.09780

thumb_up_off_alt31

chat_bubble_outline0

repeat5

shareShare

(9/8) People suggested I study Primer (arxiv.org/abs/2109.08668). Their multi-dconv-head attention is what I call Canon-B (no-res)—and we found issues with it. Yet, Primer is underrated with just 180 citations. They found meaningful signals from noisy real-life exp that I couldn't

thumb_up_off_alt155

chat_bubble_outline3

repeat20

shareShare

Zeyuan Allen-Zhu, Sc.D.

@zeyuanallenzhu

3 months ago

This person seems stressed and is spreading false rumors on our project. To clarify: this PDF is from our peer-reviewed spotlight paper accepted at ICLR 2025. We have 4 papers accepted at ICLR'25 (Parts 2.1, 2.2, 3.2, 3.3). I suggest you find healthier outlets to cope with stress

thumb_up_off_alt371

chat_bubble_outline11

repeat7

shareShare

Zeyuan Allen-Zhu, Sc.D.

@zeyuanallenzhu

3 months ago

Please stop spreading false rumors. This full arxiv paper underwent peer review. After 30 minutes of discussion, you’ve made no effort to verify the truth or retract the false claim despite my repeated requests. If you retract, I treat this as a misunderstanding, but you haven’t.

thumb_up_off_alt160

chat_bubble_outline4

repeat3

shareShare

Zeyuan Allen-Zhu, Sc.D.

@zeyuanallenzhu

3 months ago

I've wasted too much energy on X, naively thinking any of it mattered. Now I'm truly disillusioned—but finally awake. I'm shedding distractions, returning fully to research and meaningful work. No more replies, only occasional updates. Thanks to the few who truly supported me.

thumb_up_off_alt586

chat_bubble_outline21

repeat5

shareShare

Zeyuan Allen-Zhu, Sc.D.

@zeyuanallenzhu

a month ago

No matter how AI evolves overnight—tech, career, how it may impact me—I remain committed to using "physics of language models" approach to predict next-gen AI. Due to my limited GPU access at Meta, Part 4.1 (+new 4.2) are still in progress, but results on Canon layers are shining

thumb_up_off_alt815

chat_bubble_outline22

repeat61

shareShare

Zeyuan Allen-Zhu, Sc.D.

@zeyuanallenzhu

a month ago

Facebook AI Research (FAIR) is a small, prestigious lab in Meta. We don't train large models like GenAI or MSL, so it's natural that we have limited GPUs. GenAI or MSL's success or failure, past or future, doesn't reflect the work of FAIR. It is important to make this distinction

thumb_up_off_alt831

chat_bubble_outline16

repeat59

shareShare