profile-img
Andrej Karpathy

@karpathy

πŸ§‘β€πŸ³. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets πŸ§ πŸ€–πŸ’₯

calendar_today21-04-2009 06:49:15

8,7K Tweets

979,5K Followers

905 Following

Andrej Karpathy(@karpathy) 's Twitter Profile Photo

Nice read on the rarely-discussed-in-the-open difficulties of training LLMs. Mature companies have dedicated teams maintaining the clusters. At scale, clusters leave the realm of engineering and become a lot more biological, hence e.g. teams dedicated to 'hardware health'.

It

account_circle