LASR Labs
@lasrlabs
London AI Safety Research (LASR) Labs is an AI safety research programme focussed on reducing the risk of loss of control to advanced AI.
ID: 1907445660019990528
http://LASRlabs.org 02-04-2025 14:51:20
6 Tweet
32 Followers
51 Following
0/8 I’m super excited about work done by my LASR scholars David Chanin, Tomáš, Hardik Bhatnagar and James Wilken-Smith. This work demonstrates a critical but likely solvable issue with SAEs! Arxiv link: arxiv.org/abs/2409.14507. Blog post: tinyurl.com/u45waafj