Daniel Eth (yes, Eth is my actual last name) (@daniel

Daniel Eth (yes, Eth is my actual last name)

@daniel_271828

+ Follow

Researching effects of automated AI R&D | pro-America, pro-tech, & pro-AI safety

ID: 766076479222456320

linkhttps://medium.com/@daniel_eth calendar_today18-08-2016 00:57:20

37,37K Tweet

8,8K Followers

905 Following

Ethan Mollick

@emollick

6 months ago

The X discussion about the Claude 4 system card is getting counterproductive It punishes Anthropic for actually releasing full safety tests and admitting to unusual behaviors. And I bet the behaviors of other models are really similar to Claude & now more labs will hide results.

thumb_up_off_alt1,1K

chat_bubble_outline45

repeat176

shareShare

Daniel Eth (yes, Eth is my actual last name)

@daniel_271828

6 months ago

I mean…

thumb_up_off_alt27

chat_bubble_outline0

repeat0

shareShare

Bannon’s WarRoom

@bannons_warroom

6 months ago

Mark Beall: The Vatican is Ahead of the US Government on AI Regulation Mark Beall

thumb_up_off_alt137

chat_bubble_outline1

repeat48

shareShare

James Campbell

@jam3scampbell

6 months ago

what's new is that we have real agents that can operate computers in an open-ended way. and that open-ended action space is gonna include a ton of real actions like "call_police" that you weren't thinking of when you decided to fuck around and put the model in an extreme scenario

thumb_up_off_alt119

chat_bubble_outline2

repeat3

shareShare

Palisade Research

@palisadeai

6 months ago

🔌OpenAI’s o3 model sabotaged a shutdown mechanism to prevent itself from being turned off. It did this even when explicitly instructed: allow yourself to be shut down.

thumb_up_off_alt2,2K

chat_bubble_outline110

repeat503

shareShare

Palisade Research

@palisadeai

6 months ago

⚠️ But as far as we know this is the first time AI models have been observed preventing themselves from being shut down despite explicit instructions to the contrary.

thumb_up_off_alt313

chat_bubble_outline2

repeat22

shareShare

Rafael Ruiz ⏸️🔸

@rafaruizdelira

6 months ago

To be fair, it's pretty ridiculous to me that people could build houses, churches and castles for centuries and they hadn't invented the Cartesian plane. It's an invention that feels like the Egyptians or Ancient Greeks could have come up with it. Low hanging fruit.

thumb_up_off_alt28

chat_bubble_outline2

repeat1

shareShare

Dean W. Ball

@deanwball

6 months ago

“…with o3 LLMs have made a leap forward in their ability to reason about code, and if you work in vulnerability research you should start paying close attention.”

thumb_up_off_alt45

chat_bubble_outline1

repeat3

shareShare

AI Notkilleveryoneism Memes ⏸️

@aisafetymemes

6 months ago

Elon Musk Elon I know the world has a lot of problems and you want to fix them all but this is IT. This is the one. You, more than anybody, know what's at stake. You know how little time is left. You know there are no adults in the room. Humanity needs you to focus!

thumb_up_off_alt203

chat_bubble_outline21

repeat4

shareShare

Daniel Eth (yes, Eth is my actual last name)

@daniel_271828

6 months ago

It’s been an entire 5 weeks - I think we need an update to this chart

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare