Victoria Li (@victoria_r_li) Twitter Tweets • TwiCopy

Victoria Li

@victoria_r_li

+ Follow

ID: 1809291349205872645

calendar_today05-07-2024 18:21:00

1 Tweet

23 Followers

23 Following

Naomi Saphra hiring a lab 🧈🪰

@nsaphra

a year ago

Chatbots have biases in what they say—but what about biases in what they WON'T say? Our new paper (w/Victoria Li & Yida Chen) shows that personal info like a user's race, age, or love for the Los Angeles Chargers decides if ChatGPT refuses a request. arxiv.org/abs/2407.06866

Chatbots have biases in what they say—but what about biases in what they WON'T say? Our new paper (w/<a href="/victoria_r_li/">Victoria Li</a> & <a href="/YidaEdward/">Yida Chen</a>) shows that personal info like a user's race, age, or love for the Los Angeles Chargers decides if ChatGPT refuses a request. arxiv.org/abs/2407.06866

thumb_up_off_alt115

chat_bubble_outline4

repeat27

shareShare

Naomi Saphra hiring a lab 🧈🪰

@nsaphra

5 months ago

🚨 New preprint! 🚨 Everyone loves causal interp. It’s coherently defined! It makes testable predictions about mechanistic interventions! But what if we had a different objective: predicting model behavior not under mechanistic interventions, but on unseen input data?

thumb_up_off_alt238

chat_bubble_outline2

repeat24

shareShare