p(doom) (@prob_doom) Twitter Tweets • TwiCopy

p(doom)

@prob_doom

+ Follow

ID: 1797501310851231744

linkhttp://pdoom.org calendar_today03-06-2024 05:31:41

19 Tweet

158 Takipçi

1 Takip Edilen

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

WHY do we need the causal mask in large-scale language modeling? In our latest blog post, we challenge common misconceptions, historically motivate alternatives and state the main challenge in moving beyond the causal mask. pdoom.org/causal_mask.ht…

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

p(doom)

@prob_doom

8 months ago

A very simple observation explains a large chunk of the research progress of the last decade and permits extrapolating what ideas will truly matter in the future: Neural networks do not generalize out of distribution. pdoom.org/thesis.html

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

p(doom)

@prob_doom

8 months ago

PPO is not an on-policy algorithm and the strictly on-policy version of PPO (using a single optimizer step per PPO update) reduces to the vanilla policy gradient with baseline. pdoom.org/ppo_off_policy…

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare