@iambrishing : Check out the latest Attack Atlas taxonomy at the @NeurIPSConf red teaming workshop! It maps out a framework for thinking through single-turn prompt attacks in Gen AI arxiv.org/abs/24509.15398 #AIsafety #RedTeam #AIsecurity @GiandomenicoC17 @pinyuchenTW @prasatti @krvarshney • TwiCopy

@iambrishing

+ Follow

Security, Privacy & Adversarial ML Researcher @IBMResearch · Previously @CambridgeMLG, @iitdelhi

ID: 1702286377

calendar_today26-08-2013 15:46:54

112 Tweet

138 Takipçi

504 Takip Edilen

9 months ago

Check out the latest Attack Atlas taxonomy at the NeurIPS Conference red teaming workshop! It maps out a framework for thinking through single-turn prompt attacks in Gen AI arxiv.org/abs/2409.15398 #AIsafety #RedTeam #AIsecurity Giandomenico Cornacchia Pin-Yu Chen Prasanna Sattigeri Kush Varshney कुश वार्ष्णेय

thumb_up_off_alt6

chat_bubble_outline0

repeat4

shareShare