bilal 🇵🇸
@bilalchughtai_
interpretability @ google deepmind | ai safety | cambridge mmath
ID: 3297675443
https://bilalchughtai.co.uk/ 25-05-2015 10:59:27
229 Tweet
777 Followers
660 Following
If you're at NeurIPS, come see Kaivu Hariharan present our LLM situational awareness benchmark, the SAD paper, on Friday, 4:30-7:30pm, West Ballroom A-D #5101
SAD to announce i won't be at neurips this year, but Kaivu Hariharan will be presenting our work on situational awareness on friday from 4:30-7:30pm in west ballroom a-d, poster #5101 - go check it out!
Pretty wild: Pope Leo XIV says that the potential existential risk from AI "demands serious attention"
🧵 Announcing Open Philanthropy's Technical AI Safety RFP! We're seeking proposals across 21 research areas to help make AI systems more trustworthy, rule-following, and aligned, even as they become more capable.
"How, exactly, could AI take over by 2027?" Introducing AI 2027: a deeply-researched scenario forecast I wrote alongside Scott Alexander, Eli Lifland, and Thomas Larsen