GPU MODE (@gpu_mode) 's Twitter Profile
GPU MODE

@gpu_mode

Your favorite GPU community

ID: 1830065876550500354

linkhttp://discord.gg/gpumode calendar_today01-09-2024 02:11:29

78 Tweet

2,2K Followers

6 Following

Anush Elangovan (@anushelangovan) 's Twitter Profile Photo

Woah - We are now down to 183.429μs for FP8 GEMM on MI300X (We started at 890.743μs) on the leaderboard !!! Lets Go!!! gpumode.com/leaderboard/399

Alex Zhang (@a1zhang) 's Twitter Profile Photo

📣 Problem 2, the fused Mixture-of-Experts kernel 🍿 for MI300s, is now OPEN for the AMD x GPU MODE $100k competition! Go compete now for huge cash prizes -- registration ends SOON! Good luck everyone!

📣 Problem 2, the fused Mixture-of-Experts kernel 🍿 for MI300s, is now OPEN for the <a href="/AMD/">AMD</a> x <a href="/GPU_MODE/">GPU MODE</a>  $100k competition!

Go compete now for huge cash prizes -- registration ends SOON! Good luck everyone!
NVIDIA AI Developer (@nvidiaaidev) 's Twitter Profile Photo

ICYMI GPU MODE at GTC brought together leading voices in machine learning systems for an evening of sharp talks and fresh perspectives. 🎥 youtu.be/mdDVkBeFy9A From KernelBench to Thunderkittens, see what’s next in ML systems with speakers from Stanford, NVIDIA, PyTorch,

GPU MODE (@gpu_mode) 's Twitter Profile Photo

This is has been an amazing collaboration between teams at Stanford University AI at Meta GPU MODE PyTorch If you're interested in making GPU programming dramatically more accessible then join us! There's a lot more stuff we're cooking! gpu-mode.github.io/popcorn/

Junda Chen (@junda_chen_) 's Twitter Profile Photo

I will be giving a talk in GPU MODE tomorrow (May 24 12pm PST) about Disaggregated Inference. Come if you're interested! youtube.com/live/uc6TnOszz…

Perry Zhang (@py_z001) 's Twitter Profile Photo

I will be giving a talk in GPU MODE tomorrow (May 31 12pm PST) about FastVideo/STA/VSA. Come if you're interested! youtube.com/watch?v=x44iGp…

I will be giving a talk in <a href="/GPU_MODE/">GPU MODE</a> tomorrow (May 31 12pm PST) about FastVideo/STA/VSA. 
Come if you're interested!

youtube.com/watch?v=x44iGp…
Tim Dettmers (@tim_dettmers) 's Twitter Profile Photo

This is a write-up of the 2nd place entry in the FP8 matmul kernel competition for AMD GPUs. Very insightful: github.com/seb-v/amd_chal…

j4orz (@j4orz) 's Twitter Profile Photo

the follow up to Andrej Karpathy neural networks: zero to hero course is being built. singularity systems: zero to hero builds pytorch1/2 clones from scratch, training gpt2. looking for hardcore hackers to join the core team. come join the work group in the GPU MODE discord.

the follow up to <a href="/karpathy/">Andrej Karpathy</a> neural networks: zero to hero course is being built. singularity systems: zero to hero builds pytorch1/2 clones from scratch, training gpt2.

looking for hardcore hackers to join the core team. come join the work group in  the <a href="/GPU_MODE/">GPU MODE</a> discord.
Alex Zhang (@a1zhang) 's Twitter Profile Photo

kind of a surreal moment being on stage with Lisa Su as she announces & thanks us for the competition we built the past year building w/ Matej Sirovatka Mark Saroufim, Ben, & Erik (all in our free time :p) on GPU MODE has been genuinely incredible, can’t thank you guys enough ❤️

kind of a surreal moment being on stage with Lisa Su as she announces &amp; thanks us for the competition we built

the past year building w/ <a href="/m_sirovatka/">Matej Sirovatka</a> <a href="/marksaroufim/">Mark Saroufim</a>, Ben, &amp; Erik (all in our free time :p) on <a href="/GPU_MODE/">GPU MODE</a> has been genuinely incredible, can’t thank you guys enough ❤️
Alex Zhang (@a1zhang) 's Twitter Profile Photo

Announcing a new GPU MODE kernel writing competition: our first featuring both NVIDIA and AMD hardware! The first problem will be the Triangle Multiplication operator essential to the AlphaFold 🧬 models! It's a particularly tricky problem with no good public implementation!

Announcing a new <a href="/GPU_MODE/">GPU MODE</a> kernel writing competition: our first featuring both NVIDIA and AMD hardware!

The first problem will be the Triangle Multiplication operator essential to the AlphaFold 🧬 models! It's a particularly tricky problem with no good public implementation!
GPU MODE (@gpu_mode) 's Twitter Profile Photo

If you want to hack on your own GPU schedules instead of being stuck with whatever the compiler gives you then join us in 30 min!

Matej Sirovatka (@m_sirovatka) 's Twitter Profile Photo

The biggest dataset of human written GPU Code all open-source? 👀 YES Please! We at GPU MODE have released around 40k 🚀 human written code samples spanning Triton, Hip and PyTorch and it's all open on the Hugging Face Hub. Train the new GPT to make GPTs faster ⚡️ Link below ⬇️