Aly M. Kassem (@_akassem) Twitter Tweets • TwiCopy

Aly M. Kassem

@_akassem

+ Follow

Exploration over Exploitation.
RA @Mila_Quebec. MSc @UWindsor. Interested in Adversarial attacks, security & reliability of LLMs

ID: 2467964973

calendar_today07-04-2014 19:08:22

114 Tweet

51 Takipçi

736 Takip Edilen

Zhijing Jin✈️ ICLR Singapore

@zhijingjin

8 months ago

Check out my mentee's latest work on LLM Router attack! First work on this topic to the best of our knowledge. Read our paper at: zhijing-jin.com/files/papers/2… Great job to Aly M. Kassem!🎉 Intelligent Systems U of T Department of Computer Science

thumb_up_off_alt17

chat_bubble_outline0

repeat5

shareShare

Aly M. Kassem

@_akassem

8 months ago

Very useful thread — unfortunately, I learned it the hard way.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Wenhu Chen

@wenhuchen

7 months ago

Finally, the crazy weeks of NeurIPS ddl, ICCV rebuttal and EMNLP ddl have passed. Now it's time to take a rest till Sep!

thumb_up_off_alt103

chat_bubble_outline3

repeat3

shareShare

Aly M. Kassem

@_akassem

4 months ago

We observed similar biased behavior when evaluating LLM routers, even with commercial solutions such as Amazon Bedrock By biases, I mean that the router tends to favor certain categories or keywords, consistently directing them to the more powerful model arxiv.org/abs/2504.07113

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Aly M. Kassem

@_akassem

4 months ago

🎉 Thrilled that our work has been accepted at #EMNLP2025 (Main Conference)! TL;DR: We propose a framework to predict & explain unintended side effects in models (e.g., emergent toxicity, forgotten knowledge) using OOD data. Huge thanks to Golnoosh Farnadi @NeurIPS2024, Negar Rostamzadeh, and Zhuan Shi 🚀

thumb_up_off_alt15

chat_bubble_outline0

repeat2

shareShare