
Eugene Bagdasarian
@ebagdasa
Challenge AI security and privacy practices. Asst Prof at UMass @manningcics. Researcher at @GoogleAI. he/him ๐ฆ๐ฒ (opinions mine)
ID: 2463105726
https://people.cs.umass.edu/~eugene/ 25-04-2014 12:01:56
368 Tweet
960 Followers
613 Following

๐ข New research from Google DeepMind & Google Research! We tackle the challenge of building AI assistants that leverage your data for complex tasks, all while upholding your privacy. ๐ค๐ Dive into our paper for the full details: arxiv.org/pdf/2408.02373 TLDR in ๐งต



๐ง I am recruiting PhD students and postdocs to work together on making sure AI Systems and Agents are built safe and respect privacy (+ other social values). Apply to UMass Amherst Manning College of Information & Computer Sciences and enjoy a beautiful town in Western Massachusetts. Reach out if you have questions!





Nerd sniping is probably the coolest description of this phenomena ( Wojciech Zaremba et al described it recently), but in our case overthinking didn't lead to any drastic consequences besides higher costs.



(1/n) In our #ICLR2025 paper, we explore a fundamental issue that enables prompt injections: ๐๐๐๐ฌโ ๐ข๐ง๐๐๐ข๐ฅ๐ข๐ญ๐ฒ ๐ญ๐จ ๐ฌ๐๐ฉ๐๐ซ๐๐ญ๐ ๐ข๐ง๐ฌ๐ญ๐ซ๐ฎ๐๐ญ๐ข๐จ๐ง๐ฌ ๐๐ซ๐จ๐ฆ ๐๐๐ญ๐ ๐ข๐ง ๐ญ๐ก๐๐ข๐ซ ๐ข๐ง๐ฉ๐ฎ๐ญ โ Definition of separation ๐ SEP Benchmark ๐ LLM evals on SEP




