Mila Data
@mila_data
Tech Enthusiast | Game Dev🚀🖥️🎮🖖
youtube.com/@mila_data?si=…
ID: 1433522814800187393
02-09-2021 20:14:00
1,1K Tweet
88 Followers
2,2K Following
The Hugging Face agents course is finally out! This first unit of the course sets you up with all the fundamentals to become a pro in agents. - What's an AI Agent? - What are LLMs? - Understanding AI Agents through the Thought-Action-Observation Cycle - Thought, Internal
lecture 13: scaling transformer training (an overview of Hugging Face ultrascale playbook) 1. intro: huge transformers need distributed training (model/activations > gpu mem). 2. bottleneck & fix: activation memory during backprop often limits scale. activation recomputation