Seraphina Goldfarb-Tarrant (@seraphinagt) 's Twitter Profile
Seraphina Goldfarb-Tarrant

@seraphinagt

Head of AI Safety @cohere. Phd @EdinburghNLP @InfAtED.
If you don't recognise me, that's because I am invisible dl.acm.org/doi/10.1145/25…

ID: 85239409

linkhttps://seraphinatarrant.github.io calendar_today26-10-2009 04:41:41

243 Tweet

976 Followers

379 Following

Seraphina Goldfarb-Tarrant (@seraphinagt) 's Twitter Profile Photo

A break from the usual #ACL2024 program: Preethi Seshadri 🔥is working with me on open research for fairness of LLMs in hiring. 👇 is a short form to share a resume so we can check our synth data is predictive of real data. Please help! 🙏💖 (no training/ sharing, we delete it after)

Seraphina Goldfarb-Tarrant (@seraphinagt) 's Twitter Profile Photo

A good reminder for those of us in LLM land (like me) that we don't only need to mitigate gender biases *caused* by LM generation, but we should enable researchers to *use* LMs to discover biases in human content. From Isabelle Augenstein's keynote genderbiasnlp #ACL2024

A good reminder for those of us in LLM land (like me) that we don't only need to mitigate gender biases *caused* by LM generation, but we should enable researchers to *use* LMs to discover biases in human content. From <a href="/IAugenstein/">Isabelle Augenstein</a>'s keynote <a href="/genderbiasnlp/">genderbiasnlp</a> #ACL2024
Seraphina Goldfarb-Tarrant (@seraphinagt) 's Twitter Profile Photo

First oral in genderbiasnlp on stereotype reduction -- it's nice to see human evals on stereotypes instead of just benchmark results! *especially* because the benchmarks are so flawed (srsly don't use just the benchmarks) #ACL2024

First oral in <a href="/genderbiasnlp/">genderbiasnlp</a> on stereotype reduction -- it's nice to see human evals on stereotypes instead of just benchmark results! *especially* because the benchmarks are so flawed (srsly don't use just the benchmarks) #ACL2024
Seraphina Goldfarb-Tarrant (@seraphinagt) 's Twitter Profile Photo

Final oral of genderbiasnlp ! Actually being given by my MSc supervisor Fei 😂🔥 who is the last author. They do a super detailed taxonomy of gender bias types (way beyond usual) and use it to analyse bias in educational materials.

Final oral of <a href="/genderbiasnlp/">genderbiasnlp</a> ! Actually being given by my MSc supervisor Fei 😂🔥 who is the last author. They do a super detailed taxonomy of gender bias types (way beyond usual) and use it to analyse bias in educational materials.
Seraphina Goldfarb-Tarrant (@seraphinagt) 's Twitter Profile Photo

Last event of the day genderbiasnlp , the lightning talks!! Particularly love this one so far: an analysis of over refusal of certain identities in LLMs 🔥. We also don't talk about how safety tuning risks exacerbating erasure of minorities 🙊. We should 📢. #ACL2024

Last event of the day <a href="/genderbiasnlp/">genderbiasnlp</a> , the lightning talks!! Particularly love this one so far: an analysis of over refusal of certain identities in LLMs 🔥. We also don't talk about how safety tuning risks exacerbating erasure of minorities 🙊. We should 📢. #ACL2024
Seraphina Goldfarb-Tarrant (@seraphinagt) 's Twitter Profile Photo

A cool survey from our genderbiasnlp lightning talks: it's a nice visualisation of longitudinal fads in measurement and debiasing in language models #ACL2024

A cool survey from our <a href="/genderbiasnlp/">genderbiasnlp</a> lightning talks: it's a nice visualisation of longitudinal fads in measurement and debiasing in language models #ACL2024