Sander Dieleman (@sedielem) 's Twitter Profile
Sander Dieleman

@sedielem

Research Scientist at Google DeepMind (WaveNet, Imagen 3, Veo). I tweet about deep learning (research + software), music, generative models (personal account).

ID: 2902658140

linkhttps://sander.ai calendar_today02-12-2014 18:02:01

2,2K Tweet

59,59K Followers

1,1K Following

Sander Dieleman (@sedielem) 's Twitter Profile Photo

Why do diffusion models generalise at all? It's not obvious that they would. It turns out underfitting plays an important role, as well as the architectural inductive biases of locality and translation equivariance. What other kinds of symmetry and structure could we hardcode? 🤔