
Aston Zhang
@astonzhangaz
Long context lead of Llama. Lead author of d2l.ai.
ID: 1074411845199421440
http://astonzhang.com 16-12-2018 21:12:00
205 Tweet
9,9K Takipçi
92 Takip Edilen









Thanks AI at Meta for having me on the Llama for Developers podcast! Tokenizers play a crucial role in LLMs, impacting data handling, pre-training, post-training, and inference: 🔹With a larger vocabulary, domain-specific words are more likely to be single tokens, preserving




🚀 New paper from our Llama team AI at Meta! We discuss "cross capabilities" and "Law of the Weakest Link" of large language models (LLMs): 🔹 Cross capabilities: the intersection of multiple distinct capabilities across different types of expertise necessary to address complex,


🚀 Exciting internship opportunity! Join the Llama team AI at Meta and help redefine what's possible with large language models—from pre-training to post-training. Be part of our 2025 research internship and help shape the future of LLMs. Feel free to email or DM me 📩 Learn

