@sedielem : Why do diffusion models generalise at all? It's not obvious that they would. It turns out underfitting plays an important role, as well as the architectural inductive biases of locality and translation equivariance. What other kinds of symmetry and structure could we hardcode? 🤔 • TwiCopy

Sander Dieleman

@sedielem

+ Follow

Research Scientist at Google DeepMind (WaveNet, Imagen 3, Veo). I tweet about deep learning (research + software), music, generative models (personal account).

ID: 2902658140

linkhttps://sander.ai calendar_today02-12-2014 18:02:01

2,2K Tweet

59,59K Followers

1,1K Following

Sander Dieleman

@sedielem

7 months ago

Why do diffusion models generalise at all? It's not obvious that they would. It turns out underfitting plays an important role, as well as the architectural inductive biases of locality and translation equivariance. What other kinds of symmetry and structure could we hardcode? 🤔

thumb_up_off_alt270

chat_bubble_outline5

repeat34

shareShare