Andrei Mircea (@mirandrom) Twitter Tweets • TwiCopy

Andrei Mircea

@mirandrom

+ Follow

PhD student @Mila_Quebec ⊗ mechanistic interpretability + systematic generalization + LLMs for science ⊗ mirandrom.github.io

ID: 938449528348381185

linkhttp://mirandrom.github.io calendar_today06-12-2017 16:46:18

59 Tweet

88 Takipçi

359 Takip Edilen

Emily Cheng

@sparse_emcheng

6 months ago

Thomas Jiralerspong Work led by Jin Hwa Lee and Thomas Jiralerspong , with Jade Yu and Yoshua Bengio Updated preprint: arxiv.org/abs/2410.01444

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Do LLMs hallucinate randomly? Not quite. Our #ACL2025 (Main) paper shows that hallucinations under irrelevant contexts follow a systematic failure mode — revealing how LLMs generalize using abstract classes + context cues, albeit unreliably. 📎 Paper: arxiv.org/abs/2505.22630 1/n

thumb_up_off_alt34

chat_bubble_outline1

repeat20

shareShare

Andrei Mircea

@mirandrom

5 months ago

Mechanistic understanding of systematic failures in language models is something more research should strive for IMO. This is really interesting work in that vein by Ziling Cheng @ ACL 2025, highly recommend you check it out.

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Yijia Shao

@echoshao8899

5 months ago

🚨 70 million US workers are about to face their biggest workplace transmission due to AI agents. But nobody asks them what they want. While AI races to automate everything, we took a different approach: auditing what workers want vs. what AI can do across the US workforce.🧵

thumb_up_off_alt280

chat_bubble_outline6

repeat47

shareShare

Andrei Mircea

@mirandrom

4 months ago

Two kinds of mindset

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Abhilasha Ravichander

@lasha_nlp

4 months ago

Life update: I’m excited to share that I’ll be starting as faculty at the Max Planck Institute for Software Systems(Max Planck Institute for Software Systems) this Fall!🎉 I’ll be recruiting PhD students in the upcoming cycle, as well as research interns throughout the year: lasharavichander.github.io/contact.html

Life update: I’m excited to share that I’ll be starting as faculty at the Max Planck Institute for Software Systems(<a href="/mpi_sws_/">Max Planck Institute for Software Systems</a>) this Fall!🎉

I’ll be recruiting PhD students in the upcoming cycle, as well as research interns throughout the year: lasharavichander.github.io/contact.html

thumb_up_off_alt505

chat_bubble_outline75

repeat43

shareShare

Naomi Saphra hiring a lab 🧈🪰

@nsaphra

4 months ago

If you’re in Vienna for ACL go check out our interpretability poster on how feature interactions reflect linguistic structure! Wednesday, 11-12:30, Poster Session #4 (Session 12: IP-Posters), Hall 4/5

thumb_up_off_alt14

chat_bubble_outline0

repeat6

shareShare

Sherry Tongshuang Wu

@tongshuangwu

4 months ago

We all agree that AI models/agents should augment humans instead of replace us in many cases. But how do we pick when to have AI collaborators, and how do we build them? Come check out our #ACL2025NLP tutorial on Human-AI Collaboration w/ Diyi Yang Joseph Chee Chang, 📍7/27 9am@ Hall N!

thumb_up_off_alt117

chat_bubble_outline1

repeat20

shareShare

Lj Flores

@ljyflores38

4 months ago

⏰ Sharing our work on calibrated confidence scores at #ACL2025NLP, July 29 – 4PM Vienna time (Virtual)! 📰 Improving the Calibration of Confidence Scores in Text Generation Using the Output Distribution’s Characteristics aclanthology.org/2025.acl-short… Ori Ernst McGill NLP

thumb_up_off_alt38

chat_bubble_outline0

repeat9

shareShare

Cesare Spinoso-Di Piano

@cesare_spinoso

4 months ago

How can we use models of cognition to help LLMs interpret figurative language (irony, hyperbole) in a more human-like manner? Come to our #ACL2025NLP poster on Wednesday at 11AM (exhibit hall - exact location TBA) to find out! McGill NLP Mila - Institut québécois d'IA ACL 2025

thumb_up_off_alt26

chat_bubble_outline2

repeat11

shareShare

Ziling Cheng

@ziling_cheng

4 months ago

What do systematic hallucinations in LLMs tell us about their generalization abilities? Come to our poster at #ACL2025 on July 29th at 4 PM in Level 0, Halls X4/X5. Would love to chat about interpretability, hallucinations, and reasoning :) McGill NLP Mila - Institut québécois d'IA ACL 2025

thumb_up_off_alt25

chat_bubble_outline0

repeat7

shareShare

Andrei Mircea

@mirandrom

3 months ago

a lot of wisdom in *maybe* house cleaning

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare