Peter Barnett (@peterbarnett_) 's Twitter Profile
Peter Barnett

@peterbarnett_

Trying to ensure the future is bright. Researcher at @MIRIBerkeley

ID: 824553073976635394

linkhttp://peterbarnett.org calendar_today26-01-2017 09:42:27

354 Tweet

454 Takipçi

454 Takip Edilen

Ricki Heicklen (@tradegal_) 's Twitter Profile Photo

a fun prompt for introspection is "what contribution to societal flourishing do you most crave to be recognized for" and mine is definitely my list of synonym-based puns Unparalleled Misalignments

a fun prompt for introspection is "what contribution to societal flourishing do you most crave to be recognized for" and mine is definitely my list of synonym-based puns Unparalleled Misalignments
Peter Barnett (@peterbarnett_) 's Twitter Profile Photo

I like using o3 for quick and dirty automated data analysis. Eg give it a CSV, plot things, look at trends, brainstorm what to look into. I usually just use the default chatGPT interface for this. Are there better ways to do this, or easy ways to do this with other models?

Eliezer Yudkowsky ⏹️ (@esyudkowsky) 's Twitter Profile Photo

Nate Soares and I are publishing a traditional book: _If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All_. Coming in Sep 2025. You should probably read it! Given that, we'd like you to preorder it! Nowish!

Nate Soares and I are publishing a traditional book:  _If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All_.  Coming in Sep 2025.

You should probably read it!  Given that, we'd like you to preorder it!  Nowish!
Peter Barnett (@peterbarnett_) 's Twitter Profile Photo

AlphaEvolve uses Gemini 2.0 Flash and Pro, I'd guess it would be substantially more capable today if they subbed in Gemini 2.5 models.

AlphaEvolve uses Gemini 2.0 Flash and Pro, I'd guess it would be substantially more capable today if they subbed in Gemini 2.5 models.
Peter Barnett (@peterbarnett_) 's Twitter Profile Photo

I'm vegan but I am fairly pro people eating lobster (assuming you don't boil it alive). Lobsters have about the same number of neurons as fruit flies, despite weighing 500,000x more. (Data from this plot is from o3 and might be wrong lol)

I'm vegan but I am fairly pro people eating lobster (assuming you don't boil it alive). Lobsters have about the same number of neurons as fruit flies, despite weighing 500,000x more. 
(Data from this plot is from o3 and might be wrong lol)
Peter Barnett (@peterbarnett_) 's Twitter Profile Photo

I think it’s under discussed that many plans for navigating the transition to ASI (at least implicitly) likely involve pausing AI capabilities progress for an extended period of time.

Loquacious Bibliophilia ⏸️ (@locbibliophilia) 's Twitter Profile Photo

Yoshua Bengio is a continuing inspiration in this dire time with AI safety, and a reminder of why we do what we do - out of a sincere, and genuine love for humanity.

Yoshua Bengio is a continuing inspiration in this dire time with AI safety, and a reminder of why we do what we do - out of a sincere, and genuine love for humanity.
Peter Barnett (@peterbarnett_) 's Twitter Profile Photo

Weak prediction that Claude 4 will be closer to the AI 2027 extrapolation than the METR trend (including the METR trend updated for o3).

Charles Goddard (@chargoddard) 's Twitter Profile Photo

🤯 MIND-BLOWN! A new paper just SHATTERED everything we thought we knew about AI reasoning! This is paradigm-shifting. A MUST-READ. Full breakdown below 👇 🧵 1/23

🤯 MIND-BLOWN! A new paper just SHATTERED everything we thought we knew about AI reasoning!

This is paradigm-shifting. A MUST-READ. Full breakdown below 👇
🧵 1/23
Peter Barnett (@peterbarnett_) 's Twitter Profile Photo

Maybe AI safety orgs should have a "Carthago delenda est" passage that they add to the end of all their outputs, saying "To be clear, we think that AI development poses a double digit percent chance of literally killing everyone; this should be considered crazy and unacceptable".

Peter Barnett (@peterbarnett_) 's Twitter Profile Photo

Successful sabotage rates were “low”, that is Claude Opus 4 manages to sabotage and evade detection on “only” 30% of tasks. Very cool work, but this doesn’t seem that low to me!