Yunus Bulut
@bulutyunusemrah
Hacking AI to keep it safe, robust and compliant @ Validaitor | Trustworthy AI | Technopreneur | Author
ID: 1582758080
https://www.validaitor.com 10-07-2013 10:16:48
541 Tweet
101 Followers
159 Following
🚨 [New Paper] If you're involved in AI safety or jailbreaking, you don't want to miss this: Techniques from human communication now effectively breach aligned LLMs (Llama-2 Chat, GPT-3.5, GPT-4) with over 92% attack success rate. 👇🧵(1/7 - page link: chats-lab.github.io/persuasive_jai…)
The GOAT of tennis Novak Djokovic said: "35 is the new 25.” I say: “60 is the new 35.” AI research has kept me strong and healthy. AI could work wonders for you, too!
Imagine that, Schmidhuber in 2016 talking about world models before Yann LeCun “invented” them 🤷♂️ More broadly, world models are old idea in cognitive science and robotics, going back decades. They are desperately important for AI and still lacking in current models, and