
Nancy Z
@nzheng89





This is interesting as a first large diffusion-based LLM. Most of the LLMs you've been seeing are ~clones as far as the core modeling approach goes. They're all trained "autoregressively", i.e. predicting tokens from left to right. Diffusion is different - it doesn't go left to




Garry Kasparov I truly do not understand how America benefits from siding with Russia. It's so mystifying that I can't help wondering again whether Putin has some kind of hold over Trump.










