Vikrant Varma
@vikrantvarma_
Research Engineer working on AI alignment at DeepMind.
ID: 1678776235441393666
11-07-2023 14:40:25
20 Tweet
632 Followers
22 Following
Very cool find by Senthooran Rajamanoharan, Arthur Conmy, and the rest of the DeepMind mechinterp team! I’m excited by the rate of progress here.
Excited to see what people try with these shiny new open source SAEs! Great work by Senthooran Rajamanoharan and the team on pushing SOTA here
We are excited to release a short course on AGI safety! The course offers a concise and accessible introduction to AI alignment problems and our technical & governance approaches, consisting of short recorded talks and exercises (75 minutes total). deepmindsafetyresearch.medium.com/1072adb7912c