Arianna Bisazza (@ariannabisazza) Twitter Tweets • TwiCopy

Arianna Bisazza

@ariannabisazza

+ Follow

Associate Prof #NLProc | Find me on the other platform

ID: 882521793239691264

linkhttp://www.cs.rug.nl/~bisazza calendar_today05-07-2017 08:49:26

375 Tweet

1,1K Takipçi

266 Takip Edilen

Slator

@slatornews

8 months ago

👉 slator.ch/WordLevelQEAIT… Word-level quality estimation promises to help post-editors work more efficiently, but does it deliver? 🤔 A new study finds that while highlights may improve quality, ✅ they don’t always speed up editing ⏳ — and many post-editors find them

thumb_up_off_alt7

chat_bubble_outline0

repeat3

shareShare

Arianna Bisazza

@ariannabisazza

8 months ago

RAG is a powerful way to improve LLMs' answering abilities across many languages. But how do LLMs deal with multilingual contexts? Do they answer consistently when the retrieved info is provided to them in different languages? Joint work w/ Jirui Qi @EMNLP25 ✈️ & Raquel Fernández See thread! ⤵️

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

Arianna Bisazza

@ariannabisazza

6 months ago

Applying interpretability techniques to speech LMs is far from being a solved problem! Read why in Gaofei Shen’s paper, fruit of a nice collaboration w/ Hosein Mohebbi afra alishahi Grzegorz Chrupała 🇪🇺🇺🇦 where I keep learning interesting stuff about speech and SLMs! :-)

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Arianna Bisazza

@ariannabisazza

6 months ago

Large Reasoning Models are raising the bar for answer accuracy & transparency, but how does that work in multilingual settings? Can LRMs reason in your language, and what does that entail? See new preprint led by Jirui Qi & Shan Chen!

thumb_up_off_alt12

chat_bubble_outline0

repeat1

shareShare

Arianna Bisazza

@ariannabisazza

6 months ago

One step further in our quest to bring interpretability techniques to the service of MT end users: Are uncertainty & model-internals based metrics a viable alternative to supervised word-level quality estimation? New paper w/ Gabriele Sarti Vilém Zouhar #EMNLP Malvina Nissim!

thumb_up_off_alt7

chat_bubble_outline0

repeat3

shareShare

Jirui Qi

@jirui_qi

a month ago

[1/2] Heading to #EMNLP2025 to present our work on multilingual reasoning. (Fri Nov 7, 12:30-13:30) We analyze the trade-off between controlling reasoning languages and accuracy. We also explore mitigations like prompt hack, post-train (and GRPO🤩) for this issue. Come say hi!

thumb_up_off_alt6

chat_bubble_outline1

repeat2

shareShare

Jirui Qi

@jirui_qi

a month ago

1/ Multilinguality & RL folks: Previously, we found LMs often fail to produce reasoning traces in the user's language; prompting/SFT helps, but hurts accuracy. (To be presented on Fri Nov 7, 12:30-13:30 #EMNLP2025 ) ⚠️ More importantly, we already tested an RL fix! Thread below.

thumb_up_off_alt12

chat_bubble_outline1

repeat3

shareShare

Gabriele Sarti

@gsarti_

a month ago

Presenting today our work "Unsupervised Word-level Quality Estimation Through the Lens of Annotator (Dis)agreement" at the #EMNLP2025 Machine Translation morning session (Room A301, 11:45 China time). See you there! 🤗

thumb_up_off_alt29

chat_bubble_outline1

repeat5

shareShare