sarah (@littieramblings) 's Twitter Profile
sarah

@littieramblings

AI worrier & houseplant enthusiast

ID: 617055739

linkhttps://linktr.ee/sarahhw calendar_today24-06-2012 11:54:36

5,5K Tweet

3,3K Followers

702 Following

Nathan Labenz (@labenz) 's Twitter Profile Photo

I recently returned to sarah' Consistently Candid podcast to discuss the most important AI developments in the ~9 months since my last appearance #1: (the obvious one) RL on LLMs works, and we'll soon see superhuman math & programming AIs, at least. The race is on!

Eli Lifland (@eli_lifland) 's Twitter Profile Photo

Takeways from AI 2027: a scenario focusing on superintelligence (ASI). (a) ASI could arrive soon (b) ASI will dictate humanity's future (c) We're not ready for ASI: misalignment and human power grabs are both threats. Race dynamics might be crippling for safety. 🧵

Takeways from AI 2027: a scenario focusing on superintelligence (ASI).
(a) ASI could arrive soon
(b) ASI will dictate humanity's future
(c) We're not ready for ASI: misalignment and human power grabs are both threats. Race dynamics might be crippling for safety.
🧵
BlueDot Impact (@bluedotimpact) 's Twitter Profile Photo

Do scientists have a plan to make powerful AI safe? AI companies predict the arrival of AIs more intelligent than humans this decade. They admit these systems may cause catastrophe if uncontrolled. Some research agendas aim to prevent this, but none are silver bullets🧵

sarah (@littieramblings) 's Twitter Profile Photo

relatedly, today I saw 5(!) policemen demand to check a busker’s documentation before he was allowed to resume his cover of coldplay’s Fix You

Daniel Kokotajlo (@dkokotajlo) 's Twitter Profile Photo

Internal deployment was always the main threat model IMO. IIRC I tried to get the Preparedness Framework to cover internal deployment too, but was overruled. It's one of the ways in which this whole evals thing could end up being just safetywashing. (In the AI 2027 scenario, the

sarah (@littieramblings) 's Twitter Profile Photo

I am strongly in favour of making discussions of our dystopian AI future cosier for example, EAG fireside chats would benefit from actual fireplaces

BlueDot Impact (@bluedotimpact) 's Twitter Profile Photo

AI is improving rapidly, but when will it get smarter than humans across the board? It depends who you ask. Here’s what the experts are saying 🧵

AI is improving rapidly, but when will it get smarter than humans across the board?

It depends who you ask.

Here’s what the experts are saying 🧵
sarah (@littieramblings) 's Twitter Profile Photo

i'm getting insecure seeing all these posts about 4o being an extreme sycophant bc it isn't any nicer to me than claude like he must hate me