Jared Moore (@jaredlcm) Twitter Tweets • TwiCopy

Jared Moore

@jaredlcm

+ Follow

@jaredlcm.bsky.social
AI Researcher, Writer
Stanford

ID: 874693103306846209

linkhttp://jaredmoore.org calendar_today13-06-2017 18:21:01

78 Tweet

172 Followers

295 Following

Harvey Yiyun Fu

@harveyiyun

5 months ago

LLMs excel at finding surprising “needles” in very long documents, but can they detect when information is conspicuously missing? 🫥AbsenceBench🫥 shows that even SoTA LLMs struggle on this task, suggesting that LLMs have trouble perceiving “negative space” in documents. paper:

thumb_up_off_alt159

chat_bubble_outline11

repeat33

shareShare

Jared Moore

@jaredlcm

2 months ago

I'll be presenting this paper as a poster (number 56) next Wednesday from 4:30 to 6:30 at Conference on Language Modeling. Please reach out if you'd like to chat about this or any of my other work in Montreal!

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Taylor Sorensen

@ma_tay_

2 months ago

Did you know that LLMs suffer from serious mode collapse? For example, if you ask models to tell you a joke, they almost always tell you the same joke? This is true across samples and even across model families! Why does this happen? Can we improve it? x.com/artetxem/statu…

thumb_up_off_alt18

chat_bubble_outline2

repeat5

shareShare

Taylor Sorensen

@ma_tay_

a month ago

🤖➡️📉 Post-training made LLMs better at chat and reasoning—but worse at distributional alignment, diversity, and sometimes even steering(!) We measure this with our new resource (Spectrum Suite) and introduce Spectrum Tuning (method) to bring them back into our models! 🌈 1/🧵

thumb_up_off_alt192

chat_bubble_outline5

repeat47

shareShare