
elie
@eliebakouch
Training llm's at @huggingface | hf.co/science
ID: 1745892418539417600
https://huggingface.co/eliebak 12-01-2024 19:36:21
1,1K Tweet
3,3K Followers
2,2K Following

XAI got Great Greg, so I believe in their MuP, and generally optimization and spectral norm control recipes. Definitely worth reading into more details! Next, I would hope to see thinky's oss and understand what's in Jeremy Bernstein 's head now! However, I am generally not a big fan of




14 Days of Distributed, Day 9! Meet Wanchao Liang (Wanchao Liang), ex PyTorch and currently at Thinking Machines Wanchao developed the TorchTitan framework, a PyTorch library aimed to make multi-dimensional parallelism easy through the DTensor interface. He will be introducing us










