Itay Itzhak (@itay_itzhak_) Twitter Tweets • TwiCopy

Itay Itzhak

4 months ago

In Vienna for #ACL2025, and already had my first (vegan) Austrian sausage! Now hungry for discussing: – LLMs behavior – Interpretability – Biases & Hallucinations – Why eval is so hard (but so fun) Come say hi if that’s your vibe too!

thumb_up_off_alt23

chat_bubble_outline0

repeat4

shareShare

Itay Itzhak

@itay_itzhak_

4 months ago

Come to hear about our new dataset for robustness evaluation DOVE, tomorrow @ 18:00 poster session!

thumb_up_off_alt10

chat_bubble_outline1

repeat2

shareShare

BlackboxNLP

@blackboxnlp

4 months ago

📝 Technical report guidelines are out! If you're submitting to the MIB Shared Task at #BlackboxNLP, feel free to take a look to help you prepare your report: blackboxnlp.github.io/2025/task/

thumb_up_off_alt6

chat_bubble_outline0

repeat5

shareShare

Tal Haklay

@tal_haklay

4 months ago

Had my oral presentation at ACL ACL 2025 today! Big thanks to my collaborators, advisor, parents, and partner - and a special thanks to the “Goodbye Stress” gummies I picked up at the supermarket. Couldn’t have done it without any of you 🙈

Had my oral presentation at ACL <a href="/aclmeeting/">ACL 2025</a> today!
Big thanks to my collaborators, advisor, parents, and partner - and a special thanks to the “Goodbye Stress” gummies I picked up at the supermarket. Couldn’t have done it without any of you 🙈

thumb_up_off_alt54

chat_bubble_outline1

repeat2

shareShare

Itay Itzhak

@itay_itzhak_

4 months ago

At #ACL2025 and not sure what to do next? GEM 💎² is the place to be for awesome talks on the future of LLM evaluation. Come hear Gabriel Stanovsky, Eliya Habba, Leshem (Legend) Choshen 🤖🤗 and others rethink what it means to actually evaluate LLMs beyond accuracy and vibes. Thursday @ Hall C!

thumb_up_off_alt23

chat_bubble_outline0

repeat4

shareShare

Sebastian Gehrmann

@sebgehr

4 months ago

This year's GEM workshop is happening *today* starting at 9am in Vienna at #acl2025 in Hall C. I am looking forward to a day of evaluations.

thumb_up_off_alt14

chat_bubble_outline0

repeat2

shareShare

Enrico Santus

@enricosantus

4 months ago

I swear I warned all the romantics in the room — especially after the #Coldplay scandal! 😄🎶 If you were there (or wish you had been), tag yourself and your friends in the comments 👇 Bye bye from the #Gem organizers and speakers! #ACL2025 #ACL2025NLP #GEM2 #LLMs #NLP #Vienna

thumb_up_off_alt16

chat_bubble_outline4

repeat3

shareShare

Tomer Ashuach

@tomerashuach

4 months ago

🚨 New preprint out! CRISP: Persistent Concept Unlearning via SAEs LLMs often encode knowledge we want to remove. CRISP enables persistent, interpretable, precise unlearning while keeping models useful & coherent—tested on bio & cyber safety tasks🧵👇 📄arxiv.org/abs/2508.13650

thumb_up_off_alt79

chat_bubble_outline1

repeat19

shareShare

Adi Simhi

@adisimhi

3 months ago

Very pleased that "Trust me I'm Wrong" was accepted to EMNLP 2025 findings! Trust me I'm Wrong shows that LLMs can hallucinate with high certainty even when they know the correct answer! Check our latest work with Itay Itzhak, Fazl Barez, Gabriel Stanovsky, and Yonatan Belinkov.

Very pleased that "Trust me I'm Wrong" was accepted to <a href="/emnlpmeeting/">EMNLP 2025</a> findings!

Trust me I'm Wrong shows that LLMs can hallucinate with high certainty even when they know the correct answer!

Check our latest work with <a href="/Itay_itzhak_/">Itay Itzhak</a>, <a href="/FazlBarez/">Fazl Barez</a>, <a href="/GabiStanovsky/">Gabriel Stanovsky</a>, and <a href="/boknilev/">Yonatan Belinkov</a>.

thumb_up_off_alt114

chat_bubble_outline5

repeat13

shareShare

Noam Dahan

@dahan_noam

3 months ago

Old news: Single-prompt eval is unreliable🤯 New news: PromptSuite🌈 - an easy way to augment your benchmark with thousands of paraphrases ➡️ robust eval, zero sweat! - Works on any dataset! - Python API + web UI Eliya Habba, Gili Lior, Gabriel Stanovsky eliyahabba.github.io/PromptSuite/

thumb_up_off_alt58

chat_bubble_outline2

repeat14

shareShare

Eliya Habba

@eliyahabba

3 months ago

Proud to share PromptSuite! 🌈 A flexible framework for generating thousands of prompt variations per instance, enabling robust multi-prompt LLM evaluation across diverse tasks. Python API & web UI included. Check it out: eliyahabba.github.io/PromptSuite/

thumb_up_off_alt14

chat_bubble_outline0

repeat2

shareShare

Dana Arad 🎗️

@dana_arad4

3 months ago

Next Tuesday I’ll be giving a talk at MIT CSAIL about two of our recent papers on Sparse Autoencoders for Content Control 🧠✨ If you’re around, come by and say hi! csail.mit.edu/event/saes-con…

thumb_up_off_alt53

chat_bubble_outline0

repeat12

shareShare

Dana Arad 🎗️

@dana_arad4

3 months ago

Now accepted to NeurIPS Conference! Want to better understand the performance gap in VLMs? Check out our work 👇🏻

thumb_up_off_alt22

chat_bubble_outline1

repeat3

shareShare

Yonatan Belinkov

@boknilev

2 months ago

Opportunities to join my group in fall 2026: * PhD applications direct or via ELLIS (ellis.eu/news/ellis-phd…) * Post-doc applications direct or via Azrieli Azrieli Foundation (azrielifoundation.org/fellows/intern…) or Zuckerman Zuckerman STEM Leadership Program (zuckermanstem.org/ourprograms/po…)

thumb_up_off_alt335

chat_bubble_outline2

repeat52

shareShare