Addison Wu (@addisonwu_) Twitter Tweets • TwiCopy

Addison Wu

@addisonwu_

+ Follow

@princeton '27 | 🇨🇦🇺🇸 | befriending llms and vlms @cocosci_lab | eng/fr

ID: 1729488441648533504

calendar_today28-11-2023 13:12:42

5 Tweet

23 Takipçi

147 Takip Edilen

Addison Wu

@addisonwu_

a year ago

Why hello there

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

New preprint!📢 There's been lots of exciting work recently on identifying gaps in and enhancing LLM reasoning. But it's equally important to take a step back and consider when doing so might HURT performance. Check out our paper for a framework!

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Rohan Paul

@rohanpaul_ai

a year ago

CoT prompting can actually hurt LLM performance in some tasks. The paper shows LLMs and humans share similar limitations when forced to explain their thinking Identifies specific scenarios where asking LLMs to explain reduces their accuracy. i.e. When thinking out loud makes

thumb_up_off_alt252

chat_bubble_outline9

repeat38

shareShare

Ryan Liu @ NeurIPS 2024

@theryanliu

4 months ago

Chain of thought can hurt LLM performance 🤖 Verbal (over)thinking can hurt human performance 😵‍💫 Are when/why they happen similar? Come find out at our poster at West-320 ⏰11am tomorrow! #ICML2025

thumb_up_off_alt49

chat_bubble_outline0

repeat9

shareShare

Ed H. Chi

@edchi

4 months ago

One of the better posters I saw today at #icml25 This gets at the root of the problems we were thinking about when we conceived and wrote the CoT paper.

thumb_up_off_alt107

chat_bubble_outline3

repeat11

shareShare

Princeton Laboratory for Artificial Intelligence

@princetonainews

4 months ago

Shoutout to all the Princeton University researchers participating in ICML Conference #ICML2025 Browse through some of the cutting edge research from AI Lab students, post-docs and faculty being presented this year: pli.princeton.edu/blog/2025/prin…

Shoutout to all the <a href="/Princeton/">Princeton University</a> researchers participating in <a href="/icmlconf/">ICML Conference</a> #ICML2025

Browse through some of the cutting edge research from AI Lab students, post-docs and faculty being presented this year: pli.princeton.edu/blog/2025/prin…

thumb_up_off_alt47

chat_bubble_outline0

repeat9

shareShare

Addison Wu

@addisonwu_

4 months ago

Thanks so much for the excellent coverage and stopping by our poster at ICML! It was a pleasure to share our work with you!

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Addison Wu

@addisonwu_

4 months ago

Check out this sick video made by Ryan Liu about how LLMs can, counterintuitively, backfire when using chain-of-thought prompting!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Addison Wu

@addisonwu_

2 months ago

Stay tuned 👀

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Addison Wu

@addisonwu_

a month ago

How come LLM agents can carry out remarkable tasks like coding full-stack apps but still fall for poorly crafted pop-up scams? We formalize this using the psychological concept of motivational vigilance. Come to our PragLM spotlight talk (11:15 am 520B) and poster (1:30-2:30 pm

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Ryan Liu @ NeurIPS 2024

@theryanliu

a month ago

Everything online exists because someone had a reason to put it there. 🧐 LLMs process internet data, but do they consider why something was said 🤨 in the first place? NO 🛑 - in real online recommendations, LLMs get < .2 correlation with rationally weighing others' intent 🥴

thumb_up_off_alt42

chat_bubble_outline2

repeat12

shareShare

Rohan Paul

@rohanpaul_ai

a month ago

New Anthropic paper tests whether LLMs notice motives behind messages and adjust trust accordingly. It asks whether they can tell when someone has a hidden motive, like being paid to promote something. Shows a weakness in how current LLMs judge trust. In simple test cases, the

thumb_up_off_alt184

chat_bubble_outline22

repeat22

shareShare

Jiayi Geng

@jiayiigeng

16 days ago

We use LLMs for everyday tasks—research, writing, coding, decision-making. They remember our conversations, adapt to our needs and preferences. Naturally, we trust them more with repeated use. But this growing trust might be masking a hidden risk: what if their beliefs are

thumb_up_off_alt360

chat_bubble_outline18

repeat73

shareShare