Itay Nakash
@itay__nakash
IBM Research | AI Safety | Agents, LLMs & NLP
ID: 1544329316573581315
https://itay-nakash.github.io/ 05-07-2022 14:36:36
73 Tweet
127 Followers
363 Following
Current AI “alignment” is just a mask Our findings in The Wall Street Journal explore the limitations of today’s alignment techniques and what’s needed to get AI right 🧵
Happy to share that our work was accepted to Conference on Language Modeling 2025 ! 🇨🇦🍁
🚨Meet our panelists at the Actionable Interpretability Workshop Actionable Interpretability Workshop ICML2025 at ICML Conference! Join us July 19 at 4pm for a panel on making interpretability research actionable, its challenges, and how the community can drive greater impact. Naomi Saphra hiring my lab at ICML 🧈🪰 Samuel Marks Kyle Lo Fazl Barez
📣📣Presenting our platform used to build MTRAG!! RAGAPHENE: A RAG Annotation Platform with Human ENhancements and Edits Arxiv: arxiv.org/abs/2508.19272 MTRAG GitHub: github.com/IBM/mt-rag-ben… Join our MTRAGEval Task: ibm.github.io/mt-rag-benchma… Kshitij Fadnis Maeda Hanafi Marina Danilevsky
Excited to be at #EMNLP2025 in Suzhou 🇨🇳! I’ll present three papers, and I'm happy to chat about any of these works! 🏅 "Multi-Domain Explainability of Preferences" - Oral, Interpretability 2, Nov 5, 17:30 (A104-105) w/ Roi Reichart Liat Ein-Dor 🧠 "Dementia Through Different