SnowShadow
@alfredxia
ID: 192231368
http://blog.sina.com.cn/cloud614 18-09-2010 15:11:55
35 Tweet
8 Takipçi
78 Takip Edilen
This is interesting as a first large diffusion-based LLM. Most of the LLMs you've been seeing are ~clones as far as the core modeling approach goes. They're all trained "autoregressively", i.e. predicting tokens from left to right. Diffusion is different - it doesn't go left to
Discover Deep Innovation: Your ultimate Second Brain for navigating business challenges in the AI era. Stay ahead with smarter decisions and cutting-edge solutions! 🚀 Explore more: producthunt.com/products/deep-… via Product Hunt 😸