Peter Barnett (@peterbarnett_) Twitter Tweets • TwiCopy

sma ⏹️

a year ago

Man goes to doctor. "Doctor, I'm worried AGI will kill us all." "Don't worry," says doctor, "they wouldn't build it if they thought it might kill everyone." The man breaks down, sobbing. "But doctor, I *am* building AGI..."

thumb_up_off_alt639

chat_bubble_outline10

repeat66

shareShare

Aaron Scher

@aaronscher

7 months ago

Excited to have this research agenda out! Now here's my sarcastic/opinionated version 🧵

thumb_up_off_alt18

chat_bubble_outline1

repeat4

shareShare

Ricki Heicklen

@tradegal_

7 months ago

a fun prompt for introspection is "what contribution to societal flourishing do you most crave to be recognized for" and mine is definitely my list of synonym-based puns Unparalleled Misalignments

thumb_up_off_alt15,15K

chat_bubble_outline92

repeat824

shareShare

Peter Barnett

@peterbarnett_

7 months ago

I like using o3 for quick and dirty automated data analysis. Eg give it a CSV, plot things, look at trends, brainstorm what to look into. I usually just use the default chatGPT interface for this. Are there better ways to do this, or easy ways to do this with other models?

thumb_up_off_alt9

chat_bubble_outline1

repeat0

shareShare

Eliezer Yudkowsky ⏹️

@esyudkowsky

7 months ago

Nate Soares and I are publishing a traditional book: _If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All_. Coming in Sep 2025. You should probably read it! Given that, we'd like you to preorder it! Nowish!

thumb_up_off_alt1,1K

chat_bubble_outline187

repeat302

shareShare

Peter Barnett

@peterbarnett_

6 months ago

AlphaEvolve uses Gemini 2.0 Flash and Pro, I'd guess it would be substantially more capable today if they subbed in Gemini 2.5 models.

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Peter Barnett

@peterbarnett_

6 months ago

I'm vegan but I am fairly pro people eating lobster (assuming you don't boil it alive). Lobsters have about the same number of neurons as fruit flies, despite weighing 500,000x more. (Data from this plot is from o3 and might be wrong lol)

thumb_up_off_alt17

chat_bubble_outline1

repeat0

shareShare

Peter Barnett

@peterbarnett_

6 months ago

Casey, not to be mistaken for U-Casey

thumb_up_off_alt15

chat_bubble_outline0

repeat0

shareShare

Peter Barnett

@peterbarnett_

6 months ago

I think it’s under discussed that many plans for navigating the transition to ASI (at least implicitly) likely involve pausing AI capabilities progress for an extended period of time.

thumb_up_off_alt18

chat_bubble_outline0

repeat0

shareShare

Loquacious Bibliophilia ⏸️

@locbibliophilia

6 months ago

Yoshua Bengio is a continuing inspiration in this dire time with AI safety, and a reminder of why we do what we do - out of a sincere, and genuine love for humanity.

thumb_up_off_alt77

chat_bubble_outline1

repeat10

shareShare

Peter Barnett

@peterbarnett_

6 months ago

Weak prediction that Claude 4 will be closer to the AI 2027 extrapolation than the METR trend (including the METR trend updated for o3).

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Charles Goddard

@chargoddard

6 months ago

🤯 MIND-BLOWN! A new paper just SHATTERED everything we thought we knew about AI reasoning! This is paradigm-shifting. A MUST-READ. Full breakdown below 👇 🧵 1/23

thumb_up_off_alt1,1K

chat_bubble_outline92

repeat211

shareShare

Séb Krier

@sebkrier

6 months ago

mfw I prompt both gemini and claude to compare

thumb_up_off_alt406

chat_bubble_outline3

repeat18

shareShare

Peter Barnett

@peterbarnett_

6 months ago

Maybe AI safety orgs should have a "Carthago delenda est" passage that they add to the end of all their outputs, saying "To be clear, we think that AI development poses a double digit percent chance of literally killing everyone; this should be considered crazy and unacceptable".

thumb_up_off_alt174

chat_bubble_outline8

repeat14

shareShare

Peter Barnett

@peterbarnett_

6 months ago

Pack it up, the trends are clear, AI only gets less capable from now onwards.

thumb_up_off_alt143

chat_bubble_outline2

repeat5

shareShare

Peter Barnett

@peterbarnett_

6 months ago

Successful sabotage rates were “low”, that is Claude Opus 4 manages to sabotage and evade detection on “only” 30% of tasks. Very cool work, but this doesn’t seem that low to me!

thumb_up_off_alt5

chat_bubble_outline1

repeat0

shareShare