Shirley Wu (@shirleyyxwu) Twitter Tweets • TwiCopy

Shirley Wu

@shirleyyxwu

+ Follow

CS PhD student at @Stanford working with @jure and @james_y_zou on LLM agents and alignment | Prev USTC, Intern @NUSingapore, @MSFTResearch

ID: 1424421061878116356

linkhttps://cs.stanford.edu/~shirwu/ calendar_today08-08-2021 17:23:54

131 Tweet

2,2K Takipçi

233 Takip Edilen

Shirley Wu

@shirleyyxwu

4 months ago

If ML taught me one thing: objective matters. Everyone is motivated to get more paper because Google scholar says "(total) citation" and "h-index". Just an idea: let's just show “average citations per publication” and see how it goes. Better yet, prioritize this metric in

thumb_up_off_alt93

chat_bubble_outline6

repeat2

shareShare

Serina Chang

@serinachang5

3 months ago

Excited to have two papers accepted to ACL 2025 main! 🎉 1. ChatBench with jake hofman Ashton Anderson - we conduct a large-scale user study converting static benchmark questions into human-AI conversations, showing how benchmarks fail to predict human-AI outcomes.

Excited to have two papers accepted to ACL 2025 main! 🎉

1. ChatBench with <a href="/jakehofman/">jake hofman</a> <a href="/ashton1anderson/">Ashton Anderson</a> - we conduct a large-scale user study converting static benchmark questions into human-AI conversations, showing how benchmarks fail to predict human-AI outcomes.

thumb_up_off_alt91

chat_bubble_outline2

repeat10

shareShare

Shirley Wu

@shirleyyxwu

3 months ago

Can we ever truly trust foundation models—and if so, how? Our ICCV TrustFM workshop (t2fm-ws.github.io/T2FM-ICCV25/in…) is now accepting submissions (deadline: 8/1, attending: 10/19-10/23, Hawai'i) Submit, attend, and learn from everyone around the world who is making FMs more

thumb_up_off_alt39

chat_bubble_outline0

repeat7

shareShare

Jure Leskovec

@jure

3 months ago

Announcing Biomni — the first general-purpose biomedical AI agent. Biomni is a free web platform where biomedical scientists can immediately delegate their tasks to Biomni, starting today! Biomni automates literature reviews, hypothesis generation, protocol design,

thumb_up_off_alt353

chat_bubble_outline12

repeat81

shareShare

Diyi Yang

@diyi_yang

3 months ago

🤝 Humans + AI = Better together? Our #ACL2025 tutorial offers an interdisciplinary overview of human-AI collaboration to explore its goals, evaluation, and societal impacts 🤖

thumb_up_off_alt115

chat_bubble_outline6

repeat14

shareShare

Sahil Verma

@sahil1v

3 months ago

🚨 New Paper! 🚨 Guard models slow, language-specific, and modality-limited? Meet OmniGuard that detects harmful prompts across multiple languages & modalities all using one approach with SOTA performance in all 3 modalities!! while being 120X faster 🚀 arxiv.org/abs/2505.23856

thumb_up_off_alt73

chat_bubble_outline1

repeat33

shareShare

James Zou

@james_y_zou

2 months ago

Excited to introduce #CollabLLM -- a method to train LLMs to collaborate better w/ humans! Selected as #icml2025 oral (top 1%)🏅 New multi-turn training objective + user simulator👇

thumb_up_off_alt51

chat_bubble_outline6

repeat7

shareShare

Andrew Ng

@andrewyng

2 months ago

One of the most effective things the U.S. or any other nation can do to ensure its competitiveness in AI is to welcome high-skilled immigration and international students who have the potential to become high-skilled. For centuries, the U.S. has welcomed immigrants, and this

thumb_up_off_alt1,1K

chat_bubble_outline93

repeat285

shareShare