Trenton Bricken (@trentonbricken) Twitter Tweets • TwiCopy

Trenton Bricken

@trentonbricken

+ Follow

Trying to figure out what makes minds and machines go "Beep Bop!" @AnthropicAI

ID: 2373791492

linkhttp://trentonbricken.com calendar_today05-03-2014 13:50:47

1,1K Tweet

9,9K Takipçi

1,1K Takip Edilen

Trenton Bricken

@trentonbricken

5 months ago

Let’s get Dwarkesh Patel to read poetry in a bubble bath 😂

Let’s get <a href="/dwarkesh_sp/">Dwarkesh Patel</a> to read poetry in a bubble bath 😂

thumb_up_off_alt83

chat_bubble_outline3

repeat1

shareShare

Claude wasn’t designed to be a calculator; it was trained to predict text. And yet it can do math "in its head". How? We find that, far from merely memorizing the answers to problems, it employs sophisticated parallel computational paths to do "mental arithmetic".

thumb_up_off_alt718

chat_bubble_outline8

repeat46

shareShare

Jack Lindsey

@jack_w_lindsey

5 months ago

Human thought is built out of billions of cellular computations each second. Language models also perform billions of computations for each word they write. But do these form a coherent “thought process?” We’re starting to build tools to find out! Some reflections in thread.

thumb_up_off_alt198

chat_bubble_outline5

repeat22

shareShare

Dwarkesh Patel

@dwarkesh_sp

5 months ago

The Scott Alexander & Daniel Kokotajlo episode. Scott and Daniel break down every month from now until the 2027 intelligence explosion. Misaligned hive minds, Xi and Trump waking up, automated Ilyas accelerating AI progress. I went in quite skeptical. But I learned a tremendous

thumb_up_off_alt1,1K

chat_bubble_outline57

repeat202

shareShare

Amanda Askell

@amandaaskell

5 months ago

If you're a prompting genius, please apply to this role and include an example that shows off how well you can inspire models, regardless of the target. Scaffolding pipelines, metaprompts, prompts that improve outputs, and so on are all great. job-boards.greenhouse.io/anthropic/jobs…

thumb_up_off_alt1,1K

chat_bubble_outline97

repeat130

shareShare

Dario Amodei

@darioamodei

4 months ago

The Urgency of Interpretability: Why it's crucial that we understand how AI models work darioamodei.com/post/the-urgen…

thumb_up_off_alt2,2K

chat_bubble_outline203

repeat544

shareShare

Trenton Bricken

@trentonbricken

4 months ago

METR is great. Chris Painter is great. You should strongly consider working with them!

thumb_up_off_alt15

chat_bubble_outline1

repeat0

shareShare

Joshua Batson

@thebasepoint

4 months ago

Great post "So you want to work in mechanistic interpretability" about skills to develop and resources to use, whether you're coming more from research or engineering. (link in thread)

thumb_up_off_alt477

chat_bubble_outline5

repeat31

shareShare

Trenton Bricken

@trentonbricken

4 months ago

Come ask us questions :)

thumb_up_off_alt24

chat_bubble_outline1

repeat0

shareShare

Tristan Hume

@trishume

4 months ago

Anthropic is hosting a recruiting social in NYC targeted at the quant trading industry! Signup in thread. I enjoyed trading systems, and Anthropic combines the technical depth of trading with being in the fastest most impactful area of tech.

thumb_up_off_alt841

chat_bubble_outline25

repeat35

shareShare

Trenton Bricken

@trentonbricken

4 months ago

🚀

thumb_up_off_alt92

chat_bubble_outline5

repeat2

shareShare

Sam Bowman

@sleepinyourhat

4 months ago

🧵✨🙏 With the new Claude Opus 4, we conducted what I think is by far the most thorough pre-launch alignment assessment to date, aimed at understanding its values, goals, and propensities. Preparing it was a wild ride. Here’s some of what we learned. 🙏✨🧵

thumb_up_off_alt1,1K

chat_bubble_outline48

repeat157

shareShare

Trenton Bricken

@trentonbricken

4 months ago

Round Two!

thumb_up_off_alt407

chat_bubble_outline5

repeat12

shareShare

Trenton Bricken

@trentonbricken

3 months ago

Circuits at home! (but it’s actually really good) Big win from the Anthropic Fellows program and open source interp collaborators

thumb_up_off_alt193

chat_bubble_outline8

repeat7

shareShare

Anthropic

@anthropicai

3 months ago

New Anthropic Research: A new set of evaluations for sabotage capabilities. As models gain more agentic abilities, we need to get smarter in how we monitor them. We’re publishing a new set of complex evaluations that test for sabotage—and sabotage-monitoring—capabilities.

thumb_up_off_alt1,1K

chat_bubble_outline53

repeat229

shareShare