Michael Aerni @ ICLR (@aernimichael) Twitter Tweets • TwiCopy

Michael Aerni @ ICLR

@aernimichael

+ Follow

AI privacy and security | PhD student @CSatETH | Ask me about coffee ☕️

ID: 927860226811981824

linkhttps://michaelaerni.com calendar_today07-11-2017 11:28:11

80 Tweet

165 Followers

157 Following

Michael Aerni @ ICLR

@aernimichael

a year ago

Great people on that list! PS: I'm on 🦋 too (aemai)

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

🔥 I'm thrilled that I'll be spending next year in the group of Florian Tramèr at ETH Zurich, working on privacy and memorization in ML 🔥 (Not an announcement, just what I usually do. It's a great group full of amazing people, and I'm thrilled to work with them every day!)

thumb_up_off_alt47

chat_bubble_outline1

repeat1

shareShare

Michael Aerni @ ICLR

@aernimichael

a year ago

I am in beautiful Vancouver for #NeurIPS2024 with those amazing folks! Say hi if you want to chat about ML privacy and security (or speciality ☕)

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Niloofar (on faculty job market!)

@niloofar_mire

a year ago

I've been thinking about Privacy & LLMs work for 2025 - here are 5 research directions and some key papers on privacy/memorization to get started: 🧵

thumb_up_off_alt339

chat_bubble_outline11

repeat54

shareShare

Javier Rando @ ICLR

@javirandor

10 months ago

Adversarial ML research is evolving, but not necessarily for the better. In our new paper, we argue that LLMs have made problems harder to solve, and even tougher to evaluate. Here’s why another decade of work might still leave us without meaningful progress. 👇

thumb_up_off_alt147

chat_bubble_outline4

repeat26

shareShare

ETH CS Department

@csateth

9 months ago

🔎Can #AI models be “cured” after a cyber attack? New research from Florian Tramèr's Secure and Private AI Lab reveals that removing poisoned data from AI is harder than we think – harmful info isn’t erased, just hidden. So how do we make AI truly secure?bit.ly/41bJB05

🔎Can #AI models be “cured” after a cyber attack?
New research from <a href="/florian_tramer/">Florian Tramèr</a>'s Secure and Private AI Lab reveals that removing poisoned data from AI is harder than we think – harmful info isn’t erased, just hidden. So how do we make AI truly secure?bit.ly/41bJB05

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Michael Aerni @ ICLR

@aernimichael

9 months ago

I will always believe!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Michael Aerni @ ICLR

@aernimichael

8 months ago

What a joy it was to discuss research and sled down icy slopes with these people!

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Edoardo Debenedetti

@edoardo_debe

8 months ago

1/🔒Worried about giving your agent advanced capabilities due to prompt injection risks and rogue actions? Worry no more! Here's CaMeL: a robust defense against prompt injection attacks in LLM agents that provides formal security guarantees without modifying the underlying model!

thumb_up_off_alt79

chat_bubble_outline2

repeat17

shareShare

Florian Tramèr

@florian_tramer

8 months ago

I’ll be mentoring MATS for the first time this summer, together with Daniel Paleka! Link below to apply

thumb_up_off_alt67

chat_bubble_outline2

repeat9

shareShare

Kristina Nikolic @ ICLR '25

@nkristina01_

7 months ago

Congrats, your jailbreak bypassed an LLM’s safety by making it pretend to be your grandma! But did the model actually give a useful answer? In our new paper we introduce the jailbreak tax — a metric to measure the utility drop due to jailbreaks.

thumb_up_off_alt199

chat_bubble_outline6

repeat26

shareShare

Michael Aerni @ ICLR

@aernimichael

6 months ago

IMO it's very important to measure LLM utility in tasks that we actually want them to perform well on, not just hard sandbox tasks. This is an excellent benchmark that does exactly that!

thumb_up_off_alt9

chat_bubble_outline1

repeat2

shareShare

Michael Aerni @ ICLR

@aernimichael

6 months ago

Imagine LLMs could tell you the future. But properly evaluating forecasts is incredibly tricky! This paper contains so many interesting thoughts about all the things that can go wrong.

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Kristina Nikolic @ ICLR '25

@nkristina01_

4 months ago

We will present our spotlight paper on the 'jailbreak tax' tomorrow at ICML, it measures how useful jailbreak outputs are. See you Tuesday 11am at East #804. I’ll be at ICML all week. Reach out if you want to chat about jailbreaks, agent security, or ML in general!

thumb_up_off_alt47

chat_bubble_outline1

repeat7

shareShare