
Maggie Huan
@maggie_h2024
Master’s @PennEngineers, working on language and RL.
ID: 1631964133238231041
https://maggiehuan.github.io/ 04-03-2023 10:25:57
62 Tweet
79 Takipçi
340 Takip Edilen

Introducing 🌈 Rainbow Teaming, a new method for generating diverse adversarial prompts for LLMs via LLMs It's a versatile tool 🛠️ for diagnosing model vulnerabilities across domains and creating data to enhance robustness & safety 🦺 Co-lead w/ Sharath Raparthy & Andrei Lupu




So excited and so very humbled to be stepping in to head AI Safety and Alignment at Google DeepMind. Lots of work ahead, both for present-day issues and for extreme risks in anticipation of capabilities advancing.









For this week’s NLP Seminar, we are thrilled to host Nicholas Tomlin to talk about Reasoning with Language Models! When: 4/17 Thurs 11am PT Non-Stanford affiliates registration form: forms.gle/cxRmN3oovz8w7a…




