
Luke Elin
@lukeelin
From Fat to Fit Nerd, exploring the boundaries of hacking and latent space. CEO, Cybersecurity Expert, AI maximalist . Living with energy, pushing limits.
ID: 2676567938
http://www.lukeelin.com 24-07-2014 11:36:41
3,3K Tweet
666 Followers
1,1K Following












Today we’re releasing research with Apollo Research. In controlled tests, we found behaviors consistent with scheming in frontier models—and tested a way to reduce it. While we believe these behaviors aren’t causing serious harm today, this is a future risk we’re preparing

