
Aly M. Kassem
@_akassem
Exploration over Exploitation.
RA @Mila_Quebec. MSc @UWindsor. Interested in Adversarial attacks, security & reliability of LLMs
ID: 2467964973
07-04-2014 19:08:22
114 Tweet
51 Followers
736 Following

Check out my mentee's latest work on LLM Router attack! First work on this topic to the best of our knowledge. Read our paper at: zhijing-jin.com/files/papers/2… Great job to Aly M. Kassem!🎉 Intelligent Systems U of T Department of Computer Science




🎉 Thrilled that our work has been accepted at #EMNLP2025 (Main Conference)! TL;DR: We propose a framework to predict & explain unintended side effects in models (e.g., emergent toxicity, forgotten knowledge) using OOD data. Huge thanks to Golnoosh Farnadi @NeurIPS2024, Negar Rostamzadeh, and Zhuan Shi 🚀