Sylvain Gugger (@guggersylvain) Twitter Tweets • TwiCopy

Sylvain Gugger

@guggersylvain

+ Follow

Machine Learning at Jane Street. Previously at @huggingface and @fastdotai Co-author of github.com/fastai/fastbook He/him

ID: 976897777589456897

linkhttp://sgugger.github.io calendar_today22-03-2018 19:05:54

1,1K Tweet

25,25K Takipçi

350 Takip Edilen

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

For too long, users have lived under the software lottery tyranny of fused attention implementations. No longer. Introducing FlexAttention, a new PyTorch API allowing for many attention variants to enjoy fused kernels in a few lines of PyTorch. pytorch.org/blog/flexatten… 1/10

thumb_up_off_alt1,1K

chat_bubble_outline24

repeat267

shareShare

Charlie Marsh

@charliermarsh

10 months ago

Today, we're shipping a series of features that move uv beyond a pip alternative, and into an end-to-end solution for managing Python projects, command-line tools, single-file scripts, and even Python itself. A single, unified tool. Like Cargo, for Python. It's very fast.

thumb_up_off_alt3,3K

chat_bubble_outline107

repeat464

shareShare

Marc Sun

@_marcsun

9 months ago

Happy to share that we are (pre)releasing Accelerate V1.0.0! 🔥 It's been an incredible journey since I joined the accelerate team 1.5 years ago, and there's plenty of exciting updates on the way. Learn more about this milestone here: huggingface.co/blog/accelerat…

thumb_up_off_alt58

chat_bubble_outline1

repeat9

shareShare

Sylvain Gugger

@guggersylvain

9 months ago

I’m at the #PyTorchConf today and tomorrow, come say hi at the Jane Street booth!

thumb_up_off_alt503

chat_bubble_outline7

repeat7

shareShare

Yaron (Ron) Minsky

@yminsky

8 months ago

A new Signals and Threads! This one is an interview with the great Sylvain Gugger, all about making GPUs go brrr... signalsandthreads.com/the-uncertain-…

thumb_up_off_alt24

chat_bubble_outline0

repeat3

shareShare

Sylvain Gugger

@guggersylvain

8 months ago

I had a lot of fun talking with Yaron (Ron) Minsky about GPU performance (go brrr!) and the common pitfalls to avoid. signalsandthreads.com/the-uncertain-…

thumb_up_off_alt50

chat_bubble_outline0

repeat6

shareShare

PyTorch

@pytorch

8 months ago

PyTorch 2.5 is here 🔥 We are excited to announce the release of #PyTorch 2.5, featuring a new CuDNN backend for SDPA, regional compilation of torch.compile, & TorchInductor CPP backend performance speedup Read more in our blog: hubs.la/Q02TRs9p0

thumb_up_off_alt680

chat_bubble_outline9

repeat154

shareShare

Horace He

@chhillee

7 months ago

Jane Street tech talks have always been super awesome. So I'm quite excited to be visiting Jane Street on Monday to give a talk on building ML systems for a trillion trillion FLOPs :) I'll talk about a bunch of fun things, including cool GPU optimizations, how I think about

thumb_up_off_alt830

chat_bubble_outline9

repeat31

shareShare

Sylvain Gugger

@guggersylvain

5 months ago

We had an awesome talk at Jane Street from the amazing Horace He on scaling ML systems to and I just realized the recording is now online: youtu.be/139UPjoq7Kw?si…

thumb_up_off_alt452

chat_bubble_outline4

repeat42

shareShare

Morgan McGuire

@morgymcg

5 months ago

TIL Jane Street have an eng podcast Most recent episode is with Sylvain Gugger on training & ML infra

thumb_up_off_alt18

chat_bubble_outline1

repeat1

shareShare

Stas Bekman

@stasbekman

5 months ago

This is huge, huge, huge - DeepSpeed is now a community-owned project as it's now a part of the Linux Foundation. Committer access should be possible now. Thank you, Microsoft Research for breathing life into this very important to the ML community scalability framework and now

thumb_up_off_alt124

chat_bubble_outline1

repeat15

shareShare

Nouamane Tazi

@nouamanetazi

4 months ago

🚀 Excited to release *THE* Ultra-Scale Playbook - a comprehensive guide on training LLMs from 1 to 1000s of GPUs!

thumb_up_off_alt1,1K

chat_bubble_outline29

repeat233

shareShare

GPU MODE

@gpu_mode

4 months ago

Write a fast kernel and run it on Discord. See how you compare against the best! If you're familiar with Leetcode, Kaggle or Codeforces then this should feel right at home

thumb_up_off_alt419

chat_bubble_outline8

repeat39

shareShare

Benjamin F Spector

@bfspector

4 months ago

(1/7) Inspired by DeepSeek's FlashMLA, we're releasing ThunderMLA—a fused megakernel optimized for variable-prompt decoding! ⚡️🐱ThunderMLA is up to 35% faster than FlashMLA and just 400 LoC. Blog: bit.ly/4kubAAK With Aaryan Singhal, Dan Fu, and @hazyresearch!

thumb_up_off_alt370

chat_bubble_outline7

repeat70

shareShare

João Gante

@joao_gante

3 months ago

Speculative Decoding before: limited choices, the draft model must have the same tokenizer 😬 Speculative Decoding now: unlimited choices, ANY draft model can be used and better speedup opportunities 😎 The folks at Intel have been cooking, and Speculative Decoding (with

thumb_up_off_alt55

chat_bubble_outline2

repeat8

shareShare

Mark Saroufim

@marksaroufim

3 months ago

x.com/i/article/1904…

thumb_up_off_alt394

chat_bubble_outline9

repeat67

shareShare

Vijay

@__tensorcore__

a month ago

🚨🔥 CUTLASS 4.0 is released 🔥🚨 pip install nvidia-cutlass-dsl 4.0 marks a major shift for CUTLASS: towards native GPU programming in Python slidehelloworld.png docs.nvidia.com/cutlass/media/…

thumb_up_off_alt407

chat_bubble_outline15

repeat81

shareShare