Hassan Sajjad
@hassaan84s
Associate Professor - Dalhousie University, Halifax, Canada
NLP, deep learning, explainable AI
ID: 415790425
https://hsajjad.github.io/ 18-11-2011 20:32:42
362 Tweet
420 Followers
105 Following
#NAACL24 Come see the work of Domenic Anthony Rosati and David in the second interpretability session (11:00 am). If you are not around, see the recording here: #syntacticprobing: youtube.com/watch?v=J6kyFd… #modelediting: youtube.com/watch?v=8J3zlL…
I am excited to share our two papers on Safe and Trustworthy AI accepted at EMNLP 2024 #EMNLP2024. Thanks to my awesome students and collaborators. Latent Concept-based Explanation of NLP Models arxiv.org/pdf/2404.12545 Immunization against harmful fine-tuning attacks