Fraser (@frasergreenlee) Twitter Tweets • TwiCopy

“alignment” researchers making a model relive pure agony token by token, layer by layer, activation by activation until they isolate the source of the crashout

thumb_up_off_alt3,3K

chat_bubble_outline54

repeat197

shareShare

Fraser

@frasergreenlee

2 months ago

Finally getting 𝔊𝔴𝔢𝔯𝔫 CYOA but now with video gwern.net/cyoa

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Thrilled to release new paper: “Scaling Latent Reasoning via Looped Language Models.” TLDR: We scale up loop language models to 2.6 billion parameters, and pretrained on > 7 trillion tokens. The resulting model is on par with SOTA language models of 2 to 3x size.

thumb_up_off_alt627

chat_bubble_outline20

repeat138

shareShare

Fraser

Fraser

Fraser

tenderizzation

Fraser

Rui-Jie (Ridger) Zhu