Noy Sternlicht (@noysternlicht) Twitter Tweets • TwiCopy

Noy Sternlicht

@noysternlicht

+ Follow

PhD candidate at @nlphuji | Using NLP to help scientists 📚

ID: 1174288985016999937

calendar_today18-09-2019 11:48:15

5 Tweet

67 Takipçi

308 Takip Edilen

Dana Arad 🎗️

@dana_arad4

6 months ago

Tried steering with SAEs and found that not all features behave as expected? Check out our new preprint - "SAEs Are Good for Steering - If You Select the Right Features" 🧵

thumb_up_off_alt166

chat_bubble_outline7

repeat32

shareShare

1/5 🚨 New paper alert! StressTest: Can YOUR Speech LM Handle the Stress? Sentence stress = emphasis on words to signal intent, contrast, or new info. We built StressTest — a benchmark for testing stress reasoning.🗣️💬 Then, meet StresSLM who finally gets it! Insights & Links 👇

thumb_up_off_alt49

chat_bubble_outline3

repeat14

shareShare

Kevin Lu

@kevinlu4588

6 months ago

When we "erase" a concept from a diffusion model, is that knowledge truly gone? 🤔 We investigated, and the answer is often 'no'! Using simple probing techniques, the knowledge traces of the erased concept can be easily resurfaced 🔍 Here is what we learned 🧵👇

thumb_up_off_alt33

chat_bubble_outline1

repeat8

shareShare

Esther Shizgal

@esthershizgal

4 months ago

🇵🇹 Spoke at #DH2025 about Religious Journeys in Holocaust Testimonies (arXiv link in thread) 🐟 Connecting with researchers using novel computational tools on real-world challenges in the humanities was inspiring! 🏰 Excited to build on these interdisciplinary methods!

thumb_up_off_alt18

chat_bubble_outline1

repeat7

shareShare

Eliya Habba

@eliyahabba

4 months ago

Presenting my poster : 🕊️ DOVE - A large-scale multi-dimensional predictions dataset towards meaningful LLM evaluation, Monday 18:00 Vienna, #ACL2025 Come chat about LLM evaluation, prompt sensitivity, and our 250M COLLECTION OF MODEL OUTPUTS!

thumb_up_off_alt46

chat_bubble_outline2

repeat11

shareShare

Asaf Yehudai

@asafyehudai

4 months ago

🚨 Benchmarks tell us which model is better — but not why it fails. For developers, this means tedious, manual error analysis. We're bridging that gap. Meet CLEAR: an open-source tool for actionable error analysis of LLMs. 🧵👇

thumb_up_off_alt41

chat_bubble_outline1

repeat13

shareShare

Noy Sternlicht

Dana Arad 🎗️

Iddo Yosha

Kevin Lu

Esther Shizgal

Eliya Habba

Asaf Yehudai