Caleb Ziems (@cjziems) Twitter Tweets • TwiCopy

Caleb Ziems

@cjziems

+ Follow

bsky.app/profile/calebz…

PhD student at @StanfordNLP 🌲 Working on socially-aware + dialect-robust #NLP, #CSS

ID: 439511556

linkhttp://calebziems.com calendar_today17-12-2011 21:44:20

131 Tweet

927 Takipçi

930 Takip Edilen

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

My longtime collaborator Dave Patterson (long-time faculty at UC Berkeley, Association for Computing Machinery Turing Award winner, and fellow Laude Institute board member) wrote a very good op-ed about how continued investing in basic science and technology research is essential for the U.S. Dave

thumb_up_off_alt755

chat_bubble_outline17

repeat117

shareShare

Yanzhe Zhang

@stevenyzzhang

4 months ago

Soon, AI agents will act for us—collaborating, negotiating, and sharing data. But can they truly protect our privacy? We simulate privacy-critical scenarios, using alternating search to evolve attacks and defenses, uncovering severe vulnerabilities and building protections.

thumb_up_off_alt77

chat_bubble_outline2

repeat26

shareShare

Yanzhe Zhang

@stevenyzzhang

3 months ago

Introducing Generative Interfaces - a new paradigm beyond chatbots. We generate interfaces on the fly to better facilitate LLM interaction, so no more passive reading of long text blocks. Adaptive and Interactive: creates the form that best adapts to your goals and needs!

thumb_up_off_alt134

chat_bubble_outline4

repeat40

shareShare

Joachim Baumann

@joabaum

3 months ago

🚨 New paper alert 🚨 Using LLMs as data annotators, you can produce any scientific result you want. We call this **LLM Hacking**. Paper: arxiv.org/pdf/2509.08825

thumb_up_off_alt519

chat_bubble_outline16

repeat110

shareShare

Dora Zhao

@dorazhao9

3 months ago

LLMs are powerful, but they don't know your world. This knowledge gap can lead to generic, unhelpful, or incorrect responses. In our #UIST2025 paper, we explore how users can fill these gaps through creating a community knowledge ecosystem, giving models access to more specific

thumb_up_off_alt59

chat_bubble_outline3

repeat27

shareShare

Jenna Russell

@jennajrussell

2 months ago

AI is already at work in American newsrooms. We examine 186k articles published this summer and find that ~9% are either fully or partially AI-generated, usually without readers having any idea. Here's what we learned about how AI is influencing local and national journalism:

thumb_up_off_alt137

chat_bubble_outline4

repeat50

shareShare

Zora Wang

@zhiruow

a month ago

Agents are joining us at work -- coding, writing, design. But how do they actually work, especially compared to humans? Their workflows tell a different story: They code everything, slow down human flows, and deliver low-quality work fast. Yet when teamed with humans, they shine

thumb_up_off_alt244

chat_bubble_outline7

repeat53

shareShare

Tiancheng Hu

@tiancheng_hu

a month ago

Can AI simulate human behavior? 🧠 The promise is revolutionary for science & policy. But there’s a huge "IF": Do these simulations actually reflect reality? To find out, we introduce SimBench: The first large-scale benchmark for group-level social simulation. (1/9)

thumb_up_off_alt53

chat_bubble_outline3

repeat22

shareShare

elie

@eliebakouch

a month ago

Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably huggingface.co/spaces/Hugging…

thumb_up_off_alt4,4K

chat_bubble_outline99

repeat714

shareShare

Houjun Liu

@houjun_liu

a month ago

Good morning Suzhou! Amelia Hardy and I will be at EMNLP 2025 to present our work *TODAY, Hall C, 12:30PM; paper number 426* Come learn: ✅ why likelihood is important to simultaneously optimize with attack success ✅ online preference learning tricks for LM falsification

Good morning Suzhou!
<a href="/amelia_f_hardy/">Amelia Hardy</a> and I will be at <a href="/emnlpmeeting/">EMNLP 2025</a> to present our work *TODAY, Hall C, 12:30PM; paper number 426*

Come learn:
✅ why likelihood is important to simultaneously optimize with attack success
✅ online preference learning tricks for LM falsification

thumb_up_off_alt18

chat_bubble_outline1

repeat9

shareShare

Akaash Kolluri

@kolluriakaash

a month ago

New EMNLP main paper: “Finetuning LLMs for Human Behavior Prediction in Social Science Experiments” We built SocSci210—2.9M human responses from 210 social science experiments. Finetuning Qwen2.5-14B on SocSci210 beats its base model by 26% & GPT-4o by 13% on unseen studies.🧵

thumb_up_off_alt30

chat_bubble_outline2

repeat8

shareShare

Caleb Ziems

good girl

Jeff Dean

Yanzhe Zhang

Yanzhe Zhang

Joachim Baumann

Dora Zhao

Jenna Russell

Zora Wang

Tiancheng Hu

elie

Houjun Liu

Akaash Kolluri