Robin Jia (@robinomial) Twitter Tweets • TwiCopy

NAACL HLT 2025

@naaclmeeting

9 months ago

Join us as a student volunteer at #NAACL2015 ! aclweb.org/portal/content…

thumb_up_off_alt8

chat_bubble_outline0

repeat6

shareShare

Billion-parameter LLMs still struggle with simple arithmetic? 📞 FoNE (Fourier Number Embedding) tackles this problem. By mapping numbers directly into Fourier space, it bypasses tokenization and significantly improves numerical accuracy with better efficiency and accuracy.

thumb_up_off_alt22

chat_bubble_outline1

repeat12

shareShare

Johnny Tian-Zheng Wei

@johntzwei

9 months ago

Many works addressing copyright for LLMs focus on model outputs and their similarity to copyrighted training data, but few focus on how the model was trained. We analyze LLM memorization w.r.t. their training decisions and theorize on its use in court arxiv.org/abs/2502.16290

thumb_up_off_alt36

chat_bubble_outline2

repeat5

shareShare

Tianyi Lorena Yan

@lorenayannnnn

8 months ago

When answering queries with multiple answers (e.g., listing cities of a country), how do LMs simultaneously recall knowledge and avoid repeating themselves? 🚀 Excited to share our latest work with Robin Jia! We uncover a promote-then-suppress mechanism: LMs first recall all

thumb_up_off_alt104

chat_bubble_outline4

repeat20

shareShare

USC Center for AI in Society

@cais_usc

8 months ago

🎉Congrats to Aryan Gulati & Ryan Wang for receiving Honorable Mentions for the CRA Outstanding Undergraduate Researcher Awards! Aryan, a former CAIS++ co-president, was mentored by CAIS Associate Director Swabha Swayamdipta. Ryan worked with CAIS faculty Robin Jia. viterbischool.usc.edu/news/2025/03/f…

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

Workshop on Large Language Model Memorization

@l2m2_workshop

8 months ago

Hi all, reminder that our direct submission deadline is April 15th! We are co-located at ACL'25 and you can submit archival or non-archival. You can also submit work published elsewhere (non-archival) Hope to see your submission! sites.google.com/view/memorizat…

thumb_up_off_alt9

chat_bubble_outline0

repeat6

shareShare

Wang Bill Zhu

@billjohn1235813

7 months ago

🚨 New work! LLMs often sound helpful—but fail to challenge dangerous medical misconceptions in real patient questions. We test how well LLMs handle false assumptions in oncology Q&A. 📝 Paper: arxiv.org/abs/2504.11373 🌐 Website: cancermyth.github.io 👇 [1/n]

thumb_up_off_alt30

chat_bubble_outline2

repeat7

shareShare

Robin Jia

@robinomial

7 months ago

Really proud of this interdisciplinary LLM evaluation effort led by Wang Bill Zhu . We teamed up with oncologists from USC Keck SOM to understand LLM failure modes on realistic patient questions. Key finding: LLMs consistently fail to correct patients’ misconceptions!

thumb_up_off_alt15

chat_bubble_outline0

repeat0

shareShare

Deqing Fu

@deqingfu

7 months ago

I’ll be at NAACL HLT 2025 this week. Excited to meet old and new friends!

thumb_up_off_alt14

chat_bubble_outline0

repeat1

shareShare

Wang Bill Zhu

@billjohn1235813

7 months ago

At NAACL HLT 2025 this week! I’ll be presenting our work on LLM domain induction with Jesse Thomason on Thu (5/1) at 4pm in Hall 3, Section I. Would love to connect and chat about LLM planning, reasoning, AI4Science, multimodal stuff, or anything else. Feel free to DM!

At <a href="/naaclmeeting/">NAACL HLT 2025</a> this week! I’ll be presenting our work on LLM domain induction with <a href="/_jessethomason_/">Jesse Thomason</a> on Thu (5/1) at 4pm in Hall 3, Section I.

Would love to connect and chat about LLM planning, reasoning, AI4Science, multimodal stuff, or anything else. Feel free to DM!

thumb_up_off_alt25

chat_bubble_outline0

repeat6

shareShare

Robin Jia

@robinomial

7 months ago

Check out Wang Bill Zhu ‘s excellent work on combining LLMs with symbolic planners at NAACL on Thursday! I will also be at NAACL Friday-Sunday, looking forward to chatting about LLM memorization, interpretability, evaluation, and more

thumb_up_off_alt24

chat_bubble_outline1

repeat4

shareShare

Vered Shwartz

@veredshwartz

7 months ago

I've noticed (& confirmed with multiple people at #naacl2025) that NLP _for_ humans is more popular than ever while NLP _with_ humans (user studies, human eval, crowdsourcing) gets push back from reviewers who often don't consider this a valid contribution for *CL conferences 1/2

thumb_up_off_alt115

chat_bubble_outline8

repeat11

shareShare

Robin Jia

@robinomial

6 months ago

Becoming an expert requires first learning the basics of the field. Learning the basics requires doing exercises that AI can do. No amount of class redesign can change this. (What will change: the weight of exams in computing the final grade)

thumb_up_off_alt66

chat_bubble_outline1

repeat6

shareShare

Workshop on Large Language Model Memorization

@l2m2_workshop

6 months ago

📢 ACL 2025 notifications have been sent out, making this the perfect time to finalize your commitment. Don't miss the opportunity to be part of the workshop! 🔗 Commit here: openreview.net/group?id=aclwe… 🗓️ Deadline: May 20, 2025 (AoE) #ACL2025 #NLProc

thumb_up_off_alt11

chat_bubble_outline1

repeat7

shareShare

Deqing Fu

@deqingfu

6 months ago

Textual steering vectors can improve visual understanding in multimodal LLMs! You can extract steering vectors via any interpretability toolkit you like -- SAEs, MeanShift, Probes -- and apply them to image or text tokens (or both) of Multimodal LLMs. And They Steer!

thumb_up_off_alt48

chat_bubble_outline1

repeat14

shareShare

Stanford NLP Group

@stanfordnlp

6 months ago

For this week’s NLP Seminar, we are thrilled to host Deqing Fu to talk about Closing the Modality Gap: Benchmarking and Improving Visual Understanding in Multimodal LLMs! When: 5/22 Thurs 11am PT Non-Stanford affiliates registration form (closed at 9am PT on the talk day):

For this week’s NLP Seminar, we are thrilled to host <a href="/DeqingFu/">Deqing Fu</a> to talk about Closing the Modality Gap: Benchmarking and Improving Visual Understanding in Multimodal LLMs!

When: 5/22 Thurs 11am PT
Non-Stanford affiliates registration form (closed at 9am PT on the talk day):

thumb_up_off_alt56

chat_bubble_outline1

repeat9

shareShare

Deqing Fu

@deqingfu

6 months ago

posted my slides for today's talk here: deqingfu.github.io/_docs/20250522… check it out!

thumb_up_off_alt34

chat_bubble_outline1

repeat5

shareShare

Yuqing Yang

@yyqcode

6 months ago

🧐When do LLMs admit their mistakes when they should know better? In our new paper, we define this behavior as retraction: the model indicates that its generated answer was wrong. LLMs can retract—but they rarely do.🤯 arxiv.org/abs/2505.16170 👇🧵

thumb_up_off_alt109

chat_bubble_outline5

repeat24

shareShare

Robin Jia

@robinomial

6 months ago

If an LLM’s hallucinated claim contradicts its own knowledge, it should be able to retract the claim. Yet, it often reaffirms the claim instead. Why? Yuqing Yang dives deep to show that faulty model internal beliefs (representations of “truthfulness”) drive retraction failures!

thumb_up_off_alt61

chat_bubble_outline0

repeat8

shareShare

Johnny Tian-Zheng Wei

@johntzwei

5 months ago

Hi all, I'm going to ACM FAccT in Athens this week to present my paper on copyright and LLM memorization. Please reach out if you are interested to chat about law, policy, and LLMs!

thumb_up_off_alt23

chat_bubble_outline0

repeat3

shareShare

Robin Jia

NAACL HLT 2025

Tianyi Zhou

Johnny Tian-Zheng Wei

Tianyi Lorena Yan

USC Center for AI in Society

Workshop on Large Language Model Memorization

Wang Bill Zhu

Robin Jia

Deqing Fu

Wang Bill Zhu

Robin Jia

Vered Shwartz

Robin Jia

Workshop on Large Language Model Memorization

Deqing Fu

Stanford NLP Group

Deqing Fu

Yuqing Yang

Robin Jia

Johnny Tian-Zheng Wei