
Yaroslav Bulatov
@yaroslavvb
Together.AI (ex-Google Brain, OpenAI, Meta)
New Blog: medium.com/@yaroslavvb
Old Blog: yaroslavvb.blogspot.com
ID: 258031029
http://medium.com/@yaroslavvb 26-02-2011 20:22:57
1,1K Tweet
7,7K Followers
873 Following

Watching Zhuang Liu's - "Transformers without Normalization", this slide is a reminder how our optimizer and architecture choices are coupled
