Reece Shuttleworth (@reeceshuttle) Twitter Tweets • TwiCopy

Reece Shuttleworth

@reeceshuttle

+ Follow

MIT '25

ID: 1545068827137941504

linkhttp://reeceshuttle.me calendar_today07-07-2022 15:37:02

13 Tweet

368 Takipçi

73 Takip Edilen

Vedang Lad

@vedanglad

2 years ago

1/7 Wondered what happens when you permute the layers of a language model? In our recent paper with Max Tegmark, we swap and delete entire layers to understand how models perform inference - in doing so we see signs of four universal stages of inference!

1/7 Wondered what happens when you permute the layers of a language model? In our recent paper with <a href="/tegmark/">Max Tegmark</a>, we swap and delete entire layers to understand how models perform inference - in doing so we see signs of four universal stages of inference!

thumb_up_off_alt556

chat_bubble_outline21

repeat96

shareShare

Elon Musk

@elonmusk

6 months ago

Stefano Ermon Inception Diffusion will obviously work on any bitstream. With text, since humans read from first word to last, there is just the question of whether the delay to first sentence for diffusion is worth it. That said, the vast majority of AI workload will be video understanding and

thumb_up_off_alt2,2K

chat_bubble_outline131

repeat206

shareShare

Reece Shuttleworth

@reeceshuttle

3 months ago

Excited to have this work included in the PEFT library! PR: github.com/huggingface/pe…

thumb_up_off_alt167

chat_bubble_outline3

repeat20

shareShare

Inception Labs

@inceptionailabs

3 months ago

"It's now or never." Our CEO @stefanoermon on why he started Inception – featured this week in The Wall Street Journal. We're building dLLMs that generate tokens in parallel. Faster. More efficient. More controllable. This is the moment. Thanks for the coverage Kate Clark

"It's now or never."

Our CEO @stefanoermon on why he started Inception – featured this week in <a href="/WSJ/">The Wall Street Journal</a>.

We're building dLLMs that generate tokens in parallel. Faster. More efficient. More controllable.

This is the moment. Thanks for the coverage <a href="/KateClarkTweets/">Kate Clark</a>

thumb_up_off_alt29

chat_bubble_outline1

repeat10

shareShare

Julia Turc

@juliarturc

3 months ago

Diffusion clicked for me when I read about score-based models, a line of work pioneered by Stefano Ermon (et al.) at Stanford. So it was a full-circle moment to collab with him and Inception on a video about training & sampling techniques for making diffusion LLMs faster.

thumb_up_off_alt153

chat_bubble_outline5

repeat30

shareShare

Inception Labs

@inceptionailabs

3 months ago

The more structure a language has, the faster diffusion can run. Code fits that profile. Code has plenty of it. Listen to Sid Kharbanda on how diffusion unlocks speed for real-world coding workloads. #Diffusion #AIInfrastructure #DeveloperTools

thumb_up_off_alt17

chat_bubble_outline0

repeat6

shareShare