four
@four
ID: 1331131
17-03-2007 03:04:50
61 Tweet
1,1K Followers
175 Following
🚨 [New Paper] If you're involved in AI safety or jailbreaking, you don't want to miss this: Techniques from human communication now effectively breach aligned LLMs (Llama-2 Chat, GPT-3.5, GPT-4) with over 92% attack success rate. 👇🧵(1/7 - page link: chats-lab.github.io/persuasive_jai…)
Congrats Google DeepMind on the new Gemma-2 27B & 9B release! Gemma-2 was tested in the Arena under the codename "*late-june-chatbots" and now out of stealth. Its early result matches the best open models (Llama-3-70B, Nemotron-340B) with only 27B parameters! Impressively,
Exciting News from Chatbot Arena! Google DeepMind's new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes. For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive