Eugene Bagdasarian (@ebagdasa) Twitter Tweets • TwiCopy

Eugene Bagdasarian

@ebagdasa

+ Follow

Challenge AI security and privacy practices. Asst Prof at UMass @manningcics. Researcher at @GoogleAI. he/him 🇦🇲 (opinions mine)

ID: 2463105726

linkhttps://people.cs.umass.edu/~eugene/ calendar_today25-04-2014 12:01:56

368 Tweet

960 Takipçi

613 Takip Edilen

Sahra Ghalebikesabi

@sghalebikesabi

a year ago

📢 New research from Google DeepMind & Google Research! We tackle the challenge of building AI assistants that leverage your data for complex tasks, all while upholding your privacy. 🤖🔐 Dive into our paper for the full details: arxiv.org/pdf/2408.02373 TLDR in 🧵

📢 New research from <a href="/GoogleDeepMind/">Google DeepMind</a> & <a href="/GoogleResearch/">Google Research</a>!

We tackle the challenge of building AI assistants that leverage your data for complex tasks, all while upholding your privacy. 🤖🔐

Dive into our paper for the full details: arxiv.org/pdf/2408.02373

TLDR in 🧵

thumb_up_off_alt57

chat_bubble_outline1

repeat9

shareShare

Jaechul Roh

@jaechulroh

a year ago

🚨New Preprint: "Backdooring Bias into Text-to-Image Models" (arxiv.org/pdf/2406.15213) Ever wondered how text-to-image (T2I) models could spread political bias in #Election2024? 💡We introduce a new attack vector by embedding backdoors in T2I models using implicit biases!

thumb_up_off_alt3

chat_bubble_outline1

repeat2

shareShare

Eugene Bagdasarian

@ebagdasa

a year ago

🧙 I am recruiting PhD students and postdocs to work together on making sure AI Systems and Agents are built safe and respect privacy (+ other social values). Apply to UMass Amherst Manning College of Information & Computer Sciences and enjoy a beautiful town in Western Massachusetts. Reach out if you have questions!

thumb_up_off_alt78

chat_bubble_outline0

repeat25

shareShare

Eugene Bagdasarian

@ebagdasa

a year ago

you cannot deny that the problem with the french language pack will not really bother you after that

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Sahar Abdelnabi 🕊 (on 🦋)

@sahar_abdelnabi

10 months ago

OpenAI Operator enables users to automate complex tasks, e.g., travel plans. Services, e.g., Expedia, use chatbots. Soon, these two ends are going to communicate, forming agentic networks. What would these networks enable? what are their risks? and how to secure them? 🧵1/n

thumb_up_off_alt87

chat_bubble_outline6

repeat19

shareShare

Eugene Bagdasarian

@ebagdasa

10 months ago

How Sudokus can waste your money? If you are using reasoning LLMs with public data, adversaries could pollute it with nonsense (but perfectly safe!) tasks that will slow down reasoning and amplify overheads 💰 (as you pay but not see reasoning tokens) while keeping answers intact

thumb_up_off_alt11

chat_bubble_outline1

repeat2

shareShare

Eugene Bagdasarian

@ebagdasa

10 months ago

Nerd sniping is probably the coolest description of this phenomena ( Wojciech Zaremba et al described it recently), but in our case overthinking didn't lead to any drastic consequences besides higher costs.

Nerd sniping is probably the coolest description of this phenomena ( <a href="/woj_zaremba/">Wojciech Zaremba</a> et al described it recently), but in our case overthinking didn't lead to any drastic consequences besides higher costs.

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Eugene Bagdasarian

@ebagdasa

10 months ago

Amazing opportunity to do ground breaking work in LLMs!

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Egor Zverev @ICLR 2025

@egor_zverev_ai

10 months ago

(1/n) In our #ICLR2025 paper, we explore a fundamental issue that enables prompt injections: 𝐋𝐋𝐌𝐬’ 𝐢𝐧𝐚𝐛𝐢𝐥𝐢𝐭𝐲 𝐭𝐨 𝐬𝐞𝐩𝐚𝐫𝐚𝐭𝐞 𝐢𝐧𝐬𝐭𝐫𝐮𝐜𝐭𝐢𝐨𝐧𝐬 𝐟𝐫𝐨𝐦 𝐝𝐚𝐭𝐚 𝐢𝐧 𝐭𝐡𝐞𝐢𝐫 𝐢𝐧𝐩𝐮𝐭 ✅ Definition of separation 👉 SEP Benchmark 🔍 LLM evals on SEP

thumb_up_off_alt52

chat_bubble_outline1

repeat14

shareShare

Nando Fioretto

@nandofioretto

9 months ago

The Privacy Preserving AI workshop is back! And is happening on Monday. I am excited about our program and lineup of invited speakers! I hope to see many of you there: ppai-workshop.github.io

thumb_up_off_alt19

chat_bubble_outline0

repeat6

shareShare

Eugene Bagdasarian

@ebagdasa

8 months ago

Amazing forward-looking paper on how collaboration could be done where you and I have different perspectives.

thumb_up_off_alt12

chat_bubble_outline0

repeat1

shareShare

earlence

@earlencef

8 months ago

Our IEEE S&P SAGAI workshop on systems-oriented security for AI agents has speaker details (abs/bio) on the website now: sites.google.com/ucsd.edu/sagai… We look forward to seeing you in San Francisco on May 15! As a reminder, we are running this "Dagstuhl" style - real discussions.

thumb_up_off_alt14

chat_bubble_outline0

repeat4

shareShare

Eugene Bagdasarian

@ebagdasa

8 months ago

I am looking for a postdoc to work on multi-agent safety problems, if you are interested or know anyone let me know: forms.gle/NFuYLKj53fVwdW…

thumb_up_off_alt64

chat_bubble_outline0

repeat14

shareShare