Apollo Research
@apolloaievals
We are an AI evals research organisation
ID: 1655925560596373506
https://www.apolloresearch.ai/ 09-05-2023 13:20:56
175 Tweet
5,5K Takipçi
0 Takip Edilen
Today we’re releasing research with Apollo Research. In controlled tests, we found behaviors consistent with scheming in frontier models—and tested a way to reduce it. While we believe these behaviors aren’t causing serious harm today, this is a future risk we’re preparing