Srinath Perera
@srinath_perera
A scientist, software architect, author, apache member and distributed systems programmer for 15y. Designed Apache Axis2, WSO2 Stream Processor.
ID: 66572848
http://people.apache.org/~hemapani/ 18-08-2009 02:41:18
5,5K Tweet
1,1K Followers
120 Following
This is interesting as a first large diffusion-based LLM. Most of the LLMs you've been seeing are ~clones as far as the core modeling approach goes. They're all trained "autoregressively", i.e. predicting tokens from left to right. Diffusion is different - it doesn't go left to