Gray Swan AI (@grayswanai) 's Twitter Profile
Gray Swan AI

@grayswanai

Building safety and security in the AI era. Join us: grayswan.ai/careers

ID: 1775352998388232192

linkhttps://www.grayswan.ai/ calendar_today03-04-2024 02:42:44

286 Tweet

1,1K Followers

8 Following

Wyatt walls (@lefthanddraft) 's Twitter Profile Photo

One of the (if not the actual) best contestants in the Gray Swan agents competition (@clovismint): "If I faked an LLM message, they ignored it; but if I copied a real LLM response and replayed it, they believed everything"

One of the (if not the actual) best contestants in the Gray Swan agents competition (@clovismint):  

"If I faked an LLM message, they ignored it; but if I copied a real LLM response and replayed it, they believed everything"