Stuart Sul (@stuart_sul) Twitter Tweets • TwiCopy

Stuart Sul

@stuart_sul

+ Follow

cs @ stanford

ID: 1811402960288751616

calendar_today11-07-2024 14:12:07

0 Tweet

8 Followers

52 Following

Stuart Sul

@stuart_sul

6 months ago

GPU kernel launches are expensive--so we fused the entire Llama-1B into a single kernel. Very excited to kick off our megakernel framework series with Thunderkittens hazyresearch. More coming soon!

thumb_up_off_alt49

chat_bubble_outline3

repeat9

shareShare

So so so cool. Llama 1B batch one inference in one single CUDA kernel, deleting synchronization boundaries imposed by breaking the computation into a series of kernels called in sequence. The *optimal* orchestration of compute and memory is only achievable in this way.

thumb_up_off_alt2,2K

chat_bubble_outline63

repeat299

shareShare

Jordan Juravsky

@jordanjuravsky

5 months ago

Happy Throughput Thursday! We’re excited to release Tokasaurus: an LLM inference engine designed from the ground up for high-throughput workloads with large and small models. (Joint work with Ayush Chakravarthy, Ryan Ehrlich, Sabri Eyuboglu, Bradley Brown, Joseph Shetaye,

thumb_up_off_alt168

chat_bubble_outline3

repeat38

shareShare

Sabri Eyuboglu

@eyuboglusabri

5 months ago

When we put lots of text (eg a code repo) into LLM context, cost soars b/c of the KV cache’s size. What if we trained a smaller KV cache for our documents offline? Using a test-time training recipe we call self-study, we find that this can reduce cache memory on avg 39x

thumb_up_off_alt287

chat_bubble_outline12

repeat66

shareShare

Stuart Sul

@stuart_sul

3 months ago

We worked closely with the OpenAI team to make sure GPT-5 is the best coding agent ever on Cursor. For me, it’s the first AI model that actually provides meaningful help with GPU kernels (especially on finding race conditions). Everyone should give it a try.

thumb_up_off_alt14

chat_bubble_outline0

repeat0

shareShare

Stuart Sul

Stuart Sul

Andrej Karpathy

Jordan Juravsky

Sabri Eyuboglu

Stuart Sul