Shirley Wu (@shirleyyxwu) 's Twitter Profile
Shirley Wu

@shirleyyxwu

CS PhD student at @Stanford working with @jure and @james_y_zou on LLM agents and alignment | Prev USTC, Intern @NUSingapore, @MSFTResearch

ID: 1424421061878116356

linkhttps://cs.stanford.edu/~shirwu/ calendar_today08-08-2021 17:23:54

131 Tweet

2,2K Takipçi

233 Takip Edilen

Shirley Wu (@shirleyyxwu) 's Twitter Profile Photo

If ML taught me one thing: objective matters. Everyone is motivated to get more paper because Google scholar says "(total) citation" and "h-index". Just an idea: let's just show “average citations per publication” and see how it goes. Better yet, prioritize this metric in

Serina Chang (@serinachang5) 's Twitter Profile Photo

Excited to have two papers accepted to ACL 2025 main! 🎉 1. ChatBench with jake hofman Ashton Anderson - we conduct a large-scale user study converting static benchmark questions into human-AI conversations, showing how benchmarks fail to predict human-AI outcomes.

Excited to have two papers accepted to ACL 2025 main! 🎉 

1. ChatBench with <a href="/jakehofman/">jake hofman</a> <a href="/ashton1anderson/">Ashton Anderson</a> - we conduct a large-scale user study converting static benchmark questions into human-AI conversations, showing how benchmarks fail to predict human-AI outcomes.
Shirley Wu (@shirleyyxwu) 's Twitter Profile Photo

Can we ever truly trust foundation models—and if so, how? Our ICCV TrustFM workshop (t2fm-ws.github.io/T2FM-ICCV25/in…) is now accepting submissions (deadline: 8/1, attending: 10/19-10/23, Hawai'i) Submit, attend, and learn from everyone around the world who is making FMs more

Can we ever truly trust foundation models—and if so, how?

Our ICCV TrustFM workshop (t2fm-ws.github.io/T2FM-ICCV25/in…) is now accepting submissions (deadline: 8/1, attending: 10/19-10/23, Hawai'i)

Submit, attend, and learn from everyone around the world who is making FMs more
Jure Leskovec (@jure) 's Twitter Profile Photo

Announcing Biomni — the first general-purpose biomedical AI agent. Biomni is a free web platform where biomedical scientists can immediately delegate their tasks to Biomni, starting today! Biomni automates literature reviews, hypothesis generation, protocol design,

Diyi Yang (@diyi_yang) 's Twitter Profile Photo

🤝 Humans + AI = Better together? Our #ACL2025 tutorial offers an interdisciplinary overview of human-AI collaboration to explore its goals, evaluation, and societal impacts 🤖

Sahil Verma (@sahil1v) 's Twitter Profile Photo

🚨 New Paper! 🚨 Guard models slow, language-specific, and modality-limited? Meet OmniGuard that detects harmful prompts across multiple languages & modalities all using one approach with SOTA performance in all 3 modalities!! while being 120X faster 🚀 arxiv.org/abs/2505.23856

🚨 New Paper! 🚨
Guard models slow, language-specific, and modality-limited?

Meet OmniGuard that detects harmful prompts across multiple languages &amp; modalities all using one approach with SOTA performance in all 3 modalities!! while being 120X faster 🚀

arxiv.org/abs/2505.23856
James Zou (@james_y_zou) 's Twitter Profile Photo

Excited to introduce #CollabLLM -- a method to train LLMs to collaborate better w/ humans! Selected as #icml2025 oral (top 1%)🏅 New multi-turn training objective + user simulator👇

Andrew Ng (@andrewyng) 's Twitter Profile Photo

One of the most effective things the U.S. or any other nation can do to ensure its competitiveness in AI is to welcome high-skilled immigration and international students who have the potential to become high-skilled. For centuries, the U.S. has welcomed immigrants, and this