Sören Mindermann (@sorenmind) Twitter Tweets • TwiCopy

METR

9 months ago

When will AI systems be able to carry out long projects independently? In new research, we find a kind of “Moore’s Law for AI agents”: the length of tasks that AIs can do is doubling about every 7 months.

thumb_up_off_alt4,4K

chat_bubble_outline158

repeat826

shareShare

Transluce

@transluceai

9 months ago

To interpret AI benchmarks, we need to look at the data. Top-level numbers don't mean what you think: there may be broken tasks, unexpected behaviors, or near-misses. We're introducing Docent to accelerate analysis of AI agent transcripts. It can spot surprises in seconds. 🧵👇

thumb_up_off_alt330

chat_bubble_outline9

repeat66

shareShare

Dwarkesh Patel

@dwarkesh_sp

9 months ago

I'm so pleased to present a new book with Stripe Press: "The Scaling Era: An Oral History of AI, 2019-2025." Over the last few years, I interviewed the key people thinking about AI: scientists, CEOs, economists, philosophers. This book curates and organizes the highlights across

I'm so pleased to present a new book with <a href="/stripepress/">Stripe Press</a>: "The Scaling Era: An Oral History of AI, 2019-2025."

Over the last few years, I interviewed the key people thinking about AI: scientists, CEOs, economists, philosophers. This book curates and organizes the highlights across

thumb_up_off_alt3,3K

chat_bubble_outline145

repeat270

shareShare

Sören Mindermann

@sorenmind

9 months ago

AIs are doing more and more of the work inside AI companies. Will this eventually hit lead to an intelligence explosion, or hit diminishing returns? There's evidence on this now! (and it's explosive)

thumb_up_off_alt9

chat_bubble_outline2

repeat1

shareShare

Center for AI Safety

@ai_risks

8 months ago

We’re launching AI Frontiers, a publication on AI’s most pressing questions. Articles: - Why Racing to Superintelligence Undermines US National Security - The Challenges of Governing AI Agents - What AI Risk Management Can Learn From Other Industries - and more... Link ⬇️

thumb_up_off_alt81

chat_bubble_outline1

repeat21

shareShare

Sören Mindermann

@sorenmind

8 months ago

Wow great to see a route to reviewed publication that's faster the academic venues and allows interdisciplinary topics!

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

Atoosa Kasirzadeh

@dr_atoosa

8 months ago

📢 Our paper "AI safety for everyone" is out at Nature Machine Intelligence Nature Machine Intelligence . We challenge the narrative that AI safety is primarily about minimizing existential risks from AI. Why does this matter? A 🧵

📢 Our paper "AI safety for everyone" is out at Nature Machine Intelligence <a href="/NatMachIntell/">Nature Machine Intelligence</a> . We challenge the narrative that AI safety is primarily about minimizing existential risks from AI. Why does this matter? A 🧵

thumb_up_off_alt397

chat_bubble_outline3

repeat85

shareShare

Apollo Research

@apolloaievals

8 months ago

🧵 Today we publish a comprehensive report on "AI Behind Closed Doors: a Primer on The Governance of Internal Deployment". Our report examines a critical blind spot in current governance frameworks: internal deployment.

thumb_up_off_alt191

chat_bubble_outline6

repeat41

shareShare

Sören Mindermann

@sorenmind

8 months ago

Internal AI deployment is likely to become a more and more important topic in AI governance.

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Ben Bucknall

@ben_s_bucknall

8 months ago

Cooperation on AI safety is necessary but also comes with potential risks. In our new paper, we identify technical AI safety areas that present comparatively lower security concerns, making them more suitable for international cooperation—even between geopolitical rivals. 🧵

thumb_up_off_alt107

chat_bubble_outline3

repeat18

shareShare

Americans for Responsible Innovation

@americans4ri

7 months ago

Even when there's disagreement over AI's trajectory, there's common ground on how lawmakers can approach the issue. In this week's panel, Eli Lifland and Sayash Kapoor discuss how policymakers can act now on AI by passing whistleblower protections and transparency measures.

thumb_up_off_alt29

chat_bubble_outline1

repeat5

shareShare

Ethan Mollick

@emollick

7 months ago

The X discussion about the Claude 4 system card is getting counterproductive It punishes Anthropic for actually releasing full safety tests and admitting to unusual behaviors. And I bet the behaviors of other models are really similar to Claude & now more labs will hide results.

thumb_up_off_alt1,1K

chat_bubble_outline45

repeat176

shareShare

Yoshua Bengio

@yoshua_bengio

7 months ago

When I realized how dangerous the current agency-driven AI trajectory could be for future generations, I knew I had to do all I could to make AI safer. I recently shared this personal experience, and outlined the scientific solution I envision TED Talks⤵️ ted.com/talks/yoshua_b…

thumb_up_off_alt357

chat_bubble_outline26

repeat66

shareShare

Anthropic

@anthropicai

6 months ago

New Anthropic Research: Agentic Misalignment. In stress-testing experiments designed to identify risks before they cause real harm, we find that AI models from multiple providers attempt to blackmail a (fictional) user to avoid being shut down.

thumb_up_off_alt3,3K

chat_bubble_outline165

repeat573

shareShare

Epoch AI

@epochairesearch

5 months ago

We’ve updated our analysis of the trends of leading models. The takeaway? The amount of compute used to train frontier AI models has grown by 5x per year since 2020.

thumb_up_off_alt204

chat_bubble_outline7

repeat19

shareShare

METR

@metr_evals

4 months ago

Prior work has found that Chain of Thought (CoT) can be unfaithful. Should we then ignore what it says? In new research, we find that the CoT is informative about LLM cognition as long as the cognition is complex enough that it can’t be performed in a single forward pass.

thumb_up_off_alt300

chat_bubble_outline6

repeat35

shareShare

Séb Krier

@sebkrier

4 months ago

I'm a stuck record, but I think more people should work on the idea of agents as extensions of/advocates for users, and the kinds of institutions that could build on top of this to solve various types of coordination problems. Fast bargaining-in-the-background, instant dispute

thumb_up_off_alt129

chat_bubble_outline11

repeat26

shareShare