
Itay Itzhak
@itay_itzhak_
NLProc, deep learning, and machine learning. Ph.D. student @TechnionLive and @HebrewU
ID: 1195653141934542848
http://itay1itzhak.github.io 16-11-2019 10:41:51
133 Tweet
282 Followers
220 Following





At #ACL2025 and not sure what to do next? GEM 💎² is the place to be for awesome talks on the future of LLM evaluation. Come hear Gabriel Stanovsky, Eliya Habba, Leshem (Legend) Choshen 🤖🤗 and others rethink what it means to actually evaluate LLMs beyond accuracy and vibes. Thursday @ Hall C!




Very pleased that "Trust me I'm Wrong" was accepted to EMNLP 2025 findings! Trust me I'm Wrong shows that LLMs can hallucinate with high certainty even when they know the correct answer! Check our latest work with Itay Itzhak, Fazl Barez, Gabriel Stanovsky, and Yonatan Belinkov.


Old news: Single-prompt eval is unreliable🤯 New news: PromptSuite🌈 - an easy way to augment your benchmark with thousands of paraphrases ➡️ robust eval, zero sweat! - Works on any dataset! - Python API + web UI Eliya Habba, Gili Lior, Gabriel Stanovsky eliyahabba.github.io/PromptSuite/



Now accepted to NeurIPS Conference! Want to better understand the performance gap in VLMs? Check out our work 👇🏻

Opportunities to join my group in fall 2026: * PhD applications direct or via ELLIS (ellis.eu/news/ellis-phd…) * Post-doc applications direct or via Azrieli Azrieli Foundation (azrielifoundation.org/fellows/intern…) or Zuckerman Zuckerman STEM Leadership Program (zuckermanstem.org/ourprograms/po…)