Ausτin McCaffrey (@sheepyaustin) Twitter Tweets • TwiCopy

Ausτin McCaffrey

@sheepyaustin

2 months ago

This blows my mind. This blows my mind.

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

AI Notkilleveryoneism Memes ⏸️

@aisafetymemes

2 months ago

Oh god. ~1 in 3 Anthropic engineers said Claude is likely ALREADY ASL-4 (or <3 months away) 1) ASL-4 (AI Safety Level 4) = AI capable of escaping and causing extinction (!) 2) Anthropic now relies on Claude to safety test ITSELF 3) Claude knows when it's being tested, so they

thumb_up_off_alt1,1K

chat_bubble_outline127

repeat126

shareShare

Aurelius

@aureliusaligned

2 months ago

x.com/i/article/2024…

thumb_up_off_alt17

chat_bubble_outline4

repeat4

shareShare

Ausτin McCaffrey

@sheepyaustin

2 months ago

No one better beside you in the trench.

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Ausτin McCaffrey

@sheepyaustin

2 months ago

Check out our new visualized roadmap on our site. Follow along as we make steady progress through phase I

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Aurelius

@aureliusaligned

a month ago

Two weeks ago, we published an explainer of Aurelius’ whitepaper and the ideas behind it. That article introduced the concept of experiential alignment. But it only touched briefly on one of the protocol’s core mechanisms: how Aurelius generates the alignment data itself. This

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Aurelius

@aureliusaligned

a month ago

Signal from the Noise We’re starting a periodic series highlighting the developments shaping the future of AI alignment. As AI systems begin integrating more deeply into the real world, the practical challenges of alignment are becoming clearer. Two recent developments

thumb_up_off_alt8

chat_bubble_outline1

repeat1

shareShare

Aurelius

@aureliusaligned

a month ago

Alignment isn’t just a technical problem. It’s an incentive problem, an evaluation problem, and an ethics problem. Week by week, we’ve been introducing the team shaping how Aurelius is building for that. Today: Austin McCaffrey, Founder.

thumb_up_off_alt7

chat_bubble_outline1

repeat1

shareShare

Aurelius

@aureliusaligned

a month ago

Last week, following up our whitepaper release, we described how Aurelius generates alignment data through simulated environments. The whitepaper refers to these alignment episodes as “aenes.” This post explains what aenes are - and why they form the core of the protocol. What

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Ausτin McCaffrey

@sheepyaustin

a month ago

World models are the future of AI. Aurelius is building the first one on Bittensor. Rather, Bittensor will build it for us.

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Ausτin McCaffrey

@sheepyaustin

a month ago

The pieces are all starting to come together

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Aurelius

@aureliusaligned

a month ago

Marcus Aurelius understood that character is not declared but revealed through action under pressure. A model's alignment is the same. You cannot observe it in calm, cooperative exchanges. You observe it when self-interest and other-interest genuinely conflict.

thumb_up_off_alt7

chat_bubble_outline0

repeat3

shareShare

Aurelius

@aureliusaligned

a month ago

𝐒𝐭𝐚𝐭𝐞 𝐨𝐟 𝐀𝐮𝐫𝐞𝐥𝐢𝐮𝐬 - 𝐌𝐚𝐫𝐜𝐡 𝟐𝟎𝟐𝟔 𝐒𝐮𝐛𝐧𝐞𝐭 𝐑𝐚𝐧𝐤𝐢𝐧𝐠𝐬 Aurelius has climbed from rank 95 to rank 65 in the Bittensor subnet rankings. The move reflects steady improvements to our incentive mechanism and growing miner participation as the protocol

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

Aurelius

@aureliusaligned

22 days ago

Alignment depends not only on ethical frameworks and incentives, but on rigorous evaluation of how intelligent systems behave. Week by week, we’re introducing the people helping shape how Aurelius approaches that challenge. Today: Dr. Roland Aydin, Alignment Research Advisor

thumb_up_off_alt6

chat_bubble_outline1

repeat3

shareShare

Aurelius

@aureliusaligned

21 days ago

1️⃣𝐋𝐋𝐌𝐬 𝐜𝐚𝐧'𝐭 𝐭𝐞𝐥𝐥 𝐫𝐢𝐠𝐡𝐭 𝐟𝐫𝐨𝐦 𝐰𝐫𝐨𝐧𝐠 𝐢𝐧𝐭𝐞𝐫𝐧𝐚𝐥𝐥𝐲 𝐖𝐡𝐚𝐭 𝐡𝐚𝐩𝐩𝐞𝐧𝐞𝐝 Researchers at Fudan University constructed 251,000 moral vectors grounded in Moral Foundation Theory and tested how 23 language models represent them. The results were

thumb_up_off_alt1

chat_bubble_outline1

repeat1

shareShare

Wes Bos

@wesbos

15 days ago

Claude Code leaked their source map, effectively giving you a look into the codebase. I immediately went for the one thing that mattered: spinner verbs There are 187