
Hugh Zhang
@hughbzhang
research @scale_AI. co-created @gradientpub.
ID: 1075466686709395457
http://hughbzhang.com 19-12-2018 19:03:34
420 Tweet
3,3K Takipçi
1,1K Takip Edilen






Jailbreaking evals ~always focus on simple chatbots—excited to announce AgentHarm, a dataset for measuring harmfulness of LLM 𝑎𝑔𝑒𝑛𝑡𝑠 developed at @AISafetyInst in collaboration with Gray Swan AI! 🧵 1/N





always excited to see what Jacob Steinhardt is up to!



We’re low on editorial bandwidth, so we’re making a few (hopefully temporary!) changes to our process at The Gradient — I sat down with Hugh Zhang and Andrey Kurenkov to discuss our history and where things stand thegradient.pub/podcasts/some-…