Mark Saroufim (@marksaroufim) 's Twitter Profile
Mark Saroufim

@marksaroufim

@pytorch dev - nowadays in @GPU_MODE

ID: 35473191

calendar_today26-04-2009 14:20:43

1,1K Tweet

11,11K Takipçi

812 Takip Edilen

Mark Saroufim (@marksaroufim) 's Twitter Profile Photo

The more I look at this the more I feel like we should have a GPU tier list that ranks both vendors and neoclouds on how good they are at running PyTorch and CUDA

Mark Saroufim (@marksaroufim) 's Twitter Profile Photo

Registration ends tomorrow for the grand prize but you can still compete for glory until May 27 This is probably the hardest problem we've designed so far so glhf!

Mark Saroufim (@marksaroufim) 's Twitter Profile Photo

At this point submitting your company reference kernels to the GPU MODE kernel leaderboard is the easiest way to accelerate them. All submissions will be made public at the end of every competition so please use the fast code and hire the cracked engineers who produced it

Mark Saroufim (@marksaroufim) 's Twitter Profile Photo

I've seen surprisingly few people complaining about 5090 PyTorch and Triton performance. Did y'all get past the scalpers? Should we make things go super fast?

Mark Saroufim (@marksaroufim) 's Twitter Profile Photo

At the Mojo hackathon today I went over how PyTorch is making it easier to spin up new backends for all the new languages and hardware we're seeing this year and a WIP backend we've been collaborating on with the Mojo team docs.google.com/presentation/d…

Chris Lattner (@clattner_llvm) 's Twitter Profile Photo

Thank you for the collaboration and the talk Mark. I think 🔥+PyTorch are a beautiful combination - a new flame for the torch! It was great to get to spend the day with you yesterday

Daniel Vega-Myhre (@vega_myhre) 's Twitter Profile Photo

Just wrote an illustrated deep-dive into overlapping the compute and comms in TP+SP using Async TP. My eyeballs hurt now so hopefully somebody finds it useful :) danielvegamyhre.github.io/ml/performance…

Alex Zhang (@a1zhang) 's Twitter Profile Photo

Work w/ Griffiths Computational Cognitive Science Lab, Karthik Narasimhan, and Ofir Press Paper: arxiv.org/abs/2505.18134 Code:github.com/alexzhang13/vi… Website: vgbench.com Discord: discord.gg/W89VqYhQcy Our platform is completely open source and super easy to modify / plug into!

Charles 🎉 Frye (@charles_irl) 's Twitter Profile Photo

We've run thousands of LLM inference serving benchmarks at Modal. We're releasing the results so you don't have to. We're releasing the code so that you can. Introducing: The LLM Engineer's Almanac. Just in time for the AI Engineer World's Fair.