Joe Fioti (@joefioti) Twitter Tweets • TwiCopy

Joe Fioti

@joefioti

+ Follow

it's not possible. it's necessary. luminalai.com joefioti.com

ID: 3330038775

calendar_today16-06-2015 18:51:48

709 Tweet

357 Followers

313 Following

Matthew Gunton

@matthewjgunton

6 months ago

I just published a blog on PyTorch Tensors at a low level 3 Key Learnings: 💿 Strided Tensors store data contiguously in memory, using metadata (shape + stride) to describe access 🤖 Autograd builds dynamic computation graphs to automatically compute gradients—perfect for rapid

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Joe Fioti

@joefioti

6 months ago

Luminal can discover flash attention entirely automatically. We've been working towards this north star in our search compiler. Check out the prototype demo below ↓

thumb_up_off_alt433

chat_bubble_outline15

repeat42

shareShare

Joe Fioti

@joefioti

6 months ago

Since we’ve got a lot of new people following Luminal's progress, I figure we should go over where we are and where we’re going ↓

thumb_up_off_alt9

chat_bubble_outline1

repeat1

shareShare

Matthew Gunton

@matthewjgunton

6 months ago

Link here: medium.com/data-science-c…

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Joe Fioti

@joefioti

6 months ago

i can't square this with the other hazy post about how tensor cores make up 95% of the flops on a GPU. wouldn't it be massively beneficial? or is it because matvecs are so bandwidth constrained compared to matmuls? if someone from hazy follows me or knows someone from hazy lmk!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare