Teortaxes▶️ (@teortaxestex) 's Twitter Profile
Teortaxes▶️

@teortaxestex

Ours is the age of unaligned utilitarians. Other problems are relatively unimportant, but sometimes I tweet about them anyway.
@deepseek_ai stan #1
(кто/кого)

ID: 192201556

calendar_today18-09-2010 13:32:22

36,36K Tweet

14,14K Followers

1,1K Following

Teortaxes▶️ (@teortaxestex) 's Twitter Profile Photo

It surprises me - though maybe it shouldn't – that for all the historical RL focus and, as per Sherjil, gargantuan effort spent battling Gemini reward hacking, GDM ended up just bad at personality alignment. Their models are unusually passive aggressive, paranoid and neurotic.

It surprises me - though maybe it shouldn't – that for all the historical RL focus and, as per Sherjil, gargantuan effort spent battling Gemini reward hacking, GDM ended up just bad at personality alignment. Their models are unusually passive aggressive, paranoid and neurotic.