@stephenlcasper : More specifically, we identify three key assumptions that permeate the lit on LLM cultural alignment: stability, extrapolability, and steerability. All are false. Basically, LLMs just say random s*** all the time and can easily be manipulated into expressing all kinds of things. • TwiCopy

Cas (Stephen Casper)

@stephenlcasper

+ Follow

AI technical governance & risk management research. PhD Candidate @MIT_CSAIL / @MITEECS. Also at scasper.bsky.social.

stephencasper.com

ID: 704559922143322112

linkhttp://stephencasper.com calendar_today01-03-2016 06:52:30

1,1K Tweet

4,4K Followers

3,3K Following

Cas (Stephen Casper)

@stephenlcasper

6 months ago

More specifically, we identify three key assumptions that permeate the lit on LLM cultural alignment: stability, extrapolability, and steerability. All are false. Basically, LLMs just say random s*** all the time and can easily be manipulated into expressing all kinds of things.

thumb_up_off_alt64

chat_bubble_outline4

repeat9

shareShare