Danilo J. Rezende
@danilojrezende
Head of AI Research @ EIT | ex-Director @ DeepMind Building models to accelerate fundamental sciences and medicine. Opinions my own.
ID: 797433864
https://danilorezende.com/ 02-09-2012 03:44:53
3,3K Tweet
35,35K Takipçi
1,1K Takip Edilen
This is interesting as a first large diffusion-based LLM. Most of the LLMs you've been seeing are ~clones as far as the core modeling approach goes. They're all trained "autoregressively", i.e. predicting tokens from left to right. Diffusion is different - it doesn't go left to