Howard Yen (@howardyen1) Twitter Tweets • TwiCopy

Zexue He

7 months ago

🚀 News! Our 2nd Workshop on Long-Context Foundation Models (LCFM), to be held at ICML 2025 in Vancouver 🇨🇦! If you're working on long-context models, consider submitting your work! 🗓️ DDL: May 22, 2025 (AOE) 🌐 Web: longcontextfm.github.io 🔗 OpenReview: bit.ly/lcfmworkshop

thumb_up_off_alt29

chat_bubble_outline1

repeat8

shareShare

Jacqueline He

@jcqln_h

6 months ago

LMs often output answers that sound right but aren’t supported by input context. This is intrinsic hallucination: the generation of plausible, but unsupported content. We propose Precise Information Control (PIC): a task requiring LMs to ground only on given verifiable claims.

thumb_up_off_alt43

chat_bubble_outline1

repeat18

shareShare

Xi Ye

@xiye_nlp

6 months ago

🤔 Recent mech interp work showed that retrieval heads can explain some long-context behavior. But can we use this insight for retrieval? 📣 Introducing QRHeads (query-focused retrieval heads) that enhance retrieval Main contributions: 🔍 Better head detection: we find a

thumb_up_off_alt66

chat_bubble_outline1

repeat17

shareShare

Zexue He

@zexuehe

5 months ago

💡 Curious about long-context foundation models (LFCM)? 🧠 We’re hosting a panel at the LCFM workshop at #ICML2025 on “How to evaluate long-context foundation models?” — We’d love to feature your question! Anything on long-context evaluation or modeling — drop it below / DM me🎤

thumb_up_off_alt26

chat_bubble_outline1

repeat10

shareShare

Sadhika Malladi

@sadhikamalladi

3 months ago

Excited to share that I will be starting as an Assistant Professor in CSE at UCSD (UCSD CSE) in Fall 2026! I am currently recruiting PhD students who want to bridge theory and practice in deep learning - see here: cs.princeton.edu/~smalladi/recr…

thumb_up_off_alt495

chat_bubble_outline37

repeat66

shareShare

Howard Yen

@howardyen1

3 months ago

Congrats!!! As an empiricist, I always found your work super relevant and provide useful theoretical insights! Also thanks for giving great advices in our chats :)

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Danqi Chen

@danqi_chen

2 months ago

I am going to present two papers at #COLM2025 tomorrow from 4:30-6:30pm, as none of our leading authors can attend due to visa issues. Haven't done poster presentations for years 🤣🤣 .... so I will do my best! #76: LongProc #80: Goedel-Prover v1

thumb_up_off_alt347

chat_bubble_outline4

repeat27

shareShare

Xi Ye

@xiye_nlp

a month ago

We will present QRHead (Wuwei Zhang) #EMNLP2025 Without any training, we boosts Llama-3.1-8B’s performance by >10% 📈on context reasoning tasks (CLIPPER, LongMemEval), and outperforms specialized re-rankers on BEIR. Check out our (virtual) poster tonight!

We will present QRHead (<a href="/WuweiZhang0723/">Wuwei Zhang</a>) #EMNLP2025

Without any training, we boosts Llama-3.1-8B’s performance by >10% 📈on context reasoning tasks (CLIPPER, LongMemEval), and outperforms specialized re-rankers on BEIR. Check out our (virtual) poster tonight!

thumb_up_off_alt36

chat_bubble_outline0

repeat11

shareShare