Aly M. Kassem (@_akassem) 's Twitter Profile
Aly M. Kassem

@_akassem

Exploration over Exploitation.
RA @Mila_Quebec. MSc @UWindsor. Interested in Adversarial attacks, security & reliability of LLMs

ID: 2467964973

calendar_today07-04-2014 19:08:22

114 Tweet

51 Followers

736 Following

Zhijing Jin✈️ ICLR Singapore (@zhijingjin) 's Twitter Profile Photo

Check out my mentee's latest work on LLM Router attack! First work on this topic to the best of our knowledge. Read our paper at: zhijing-jin.com/files/papers/2… Great job to Aly M. Kassem!🎉 Intelligent Systems U of T Department of Computer Science

Aly M. Kassem (@_akassem) 's Twitter Profile Photo

We observed similar biased behavior when evaluating LLM routers, even with commercial solutions such as Amazon Bedrock By biases, I mean that the router tends to favor certain categories or keywords, consistently directing them to the more powerful model arxiv.org/abs/2504.07113

Aly M. Kassem (@_akassem) 's Twitter Profile Photo

🎉 Thrilled that our work has been accepted at #EMNLP2025 (Main Conference)! TL;DR: We propose a framework to predict & explain unintended side effects in models (e.g., emergent toxicity, forgotten knowledge) using OOD data. Huge thanks to Golnoosh Farnadi @NeurIPS2024, Negar Rostamzadeh, and Zhuan Shi 🚀