Gabriel Recchia
@mesotronium
Cognitive scientist, previously at @Cambridge_Uni 's Winton Centre for Risk and Evidence Communication, now working on LLM capability evaluation & alignment
ID: 182045911
http://gabrielrecchia.com 23-08-2010 18:07:31
503 Tweet
269 Followers
322 Following
This has been accepted at ICML 2025! See you all in Vancouver. Credit to Tingchen Fu for leading this work and to my wonderful collaborators! mrinank 🍂 philip Shay B.Cohen and David Krueger
This is an excellent and, I think, very important piece that I hope gets the attention it deserves within the AI safety community. Many congratulations to Josh Engels, David D. Baek, Subhash Kantamneni, Max Tegmark
Humans are often very wrong. This is a big problem if you want to use human judgment to oversee super-smart AI systems. In our new post, Geoffrey Irving argues that we might be able to deal with this issue – not by fixing the humans, but by redesigning oversight protocols.