profile-img
Zeyuan Allen-Zhu

@ZeyuanAllenZhu

physics of language models @ Meta / FAIR

IOI - USACO - MCM - ACM/ICPC - Codejam
Tsinghua - MIT - Princeton/IAS - MSR - FAIR

calendar_today23-04-2010 16:59:01

207 Tweets

8,1K Followers

273 Following

Zeyuan Allen-Zhu(@ZeyuanAllenZhu) 's Twitter Profile Photo

Our 12 scaling laws (for LLM knowledge capacity) are out: arxiv.org/abs/2404.05405. Took me 4mos to submit 50,000 jobs; took Meta 1mo for legal review; FAIR sponsored 4,200,000 GPU hrs. Hope this is a new direction to study scaling laws + help practitioners make informed decisions

Our 12 scaling laws (for LLM knowledge capacity) are out: arxiv.org/abs/2404.05405. Took me 4mos to submit 50,000 jobs; took Meta 1mo for legal review; FAIR sponsored 4,200,000 GPU hrs. Hope this is a new direction to study scaling laws + help practitioners make informed decisions
account_circle