Cas (Stephen Casper) (@stephenlcasper) 's Twitter Profile
Cas (Stephen Casper)

@stephenlcasper

AI technical governance & risk management research. PhD Candidate @MIT_CSAIL / @MITEECS. Also at scasper.bsky.social.

stephencasper.com

ID: 704559922143322112

linkhttp://stephencasper.com calendar_today01-03-2016 06:52:30

1,1K Tweet

4,4K Followers

3,3K Following

Cas (Stephen Casper) (@stephenlcasper) 's Twitter Profile Photo

More specifically, we identify three key assumptions that permeate the lit on LLM cultural alignment: stability, extrapolability, and steerability. All are false. Basically, LLMs just say random s*** all the time and can easily be manipulated into expressing all kinds of things.

More specifically, we identify three key assumptions that permeate the lit on LLM cultural alignment: stability, extrapolability, and steerability. All are false.

Basically, LLMs just say random s*** all the time and can easily be manipulated into expressing all kinds of things.