
Ambrish Rawat
@iambrishing
Security, Privacy & Adversarial ML Researcher @IBMResearch · Previously @CambridgeMLG, @iitdelhi
ID: 1702286377
26-08-2013 15:46:54
112 Tweet
138 Takipçi
504 Takip Edilen

Check out the latest Attack Atlas taxonomy at the NeurIPS Conference red teaming workshop! It maps out a framework for thinking through single-turn prompt attacks in Gen AI arxiv.org/abs/2409.15398 #AIsafety #RedTeam #AIsecurity Giandomenico Cornacchia Pin-Yu Chen Prasanna Sattigeri Kush Varshney कुश वार्ष्णेय