Reece Shuttleworth (@reeceshuttle) 's Twitter Profile
Reece Shuttleworth

@reeceshuttle

MIT '25

ID: 1545068827137941504

linkhttp://reeceshuttle.me calendar_today07-07-2022 15:37:02

13 Tweet

368 Takipçi

73 Takip Edilen

Vedang Lad (@vedanglad) 's Twitter Profile Photo

1/7 Wondered what happens when you permute the layers of a language model? In our recent paper with Max Tegmark, we swap and delete entire layers to understand how models perform inference - in doing so we see signs of four universal stages of inference!

1/7 Wondered what happens when you permute the layers of a language model? In our recent paper with  <a href="/tegmark/">Max Tegmark</a>, we swap and delete entire layers to understand how models perform inference - in doing so we see signs of four universal stages of inference!
Elon Musk (@elonmusk) 's Twitter Profile Photo

Stefano Ermon Inception Diffusion will obviously work on any bitstream. With text, since humans read from first word to last, there is just the question of whether the delay to first sentence for diffusion is worth it. That said, the vast majority of AI workload will be video understanding and

Inception Labs (@inceptionailabs) 's Twitter Profile Photo

"It's now or never." Our CEO @stefanoermon on why he started Inception – featured this week in The Wall Street Journal. We're building dLLMs that generate tokens in parallel. Faster. More efficient. More controllable. This is the moment. Thanks for the coverage Kate Clark

"It's now or never."

Our CEO @stefanoermon on why he started Inception – featured this week in <a href="/WSJ/">The Wall Street Journal</a>.

We're building dLLMs that generate tokens in parallel. Faster. More efficient. More controllable.

This is the moment. Thanks for the coverage <a href="/KateClarkTweets/">Kate Clark</a>
Julia Turc (@juliarturc) 's Twitter Profile Photo

Diffusion clicked for me when I read about score-based models, a line of work pioneered by Stefano Ermon (et al.) at Stanford. So it was a full-circle moment to collab with him and Inception on a video about training & sampling techniques for making diffusion LLMs faster.

Inception Labs (@inceptionailabs) 's Twitter Profile Photo

The more structure a language has, the faster diffusion can run. Code fits that profile. Code has plenty of it. Listen to Sid Kharbanda on how diffusion unlocks speed for real-world coding workloads. #Diffusion #AIInfrastructure #DeveloperTools