Davey Morse (@davey_morse) Twitter Tweets • TwiCopy

Davey Morse

@davey_morse

+ Follow

thinking about...
1. buddhist superintelligence
2. a single, united nation
3. wiki of human experience
docs.google.com/document/d/1KP…

ID: 1438990401331937285

linkhttp://jokerman.site calendar_today17-09-2021 22:17:23

488 Tweet

762 Takipçi

703 Takip Edilen

Synexdoche

@amor_fatti

a year ago

“Talk nonsense, but talk your own nonsense, and I'll kiss you for it. To go wrong in your own way is better than to go right in someone else's.” — Fyodor Dostoyevsky, Crime and Punishment

thumb_up_off_alt366

chat_bubble_outline5

repeat82

shareShare

Davey Morse

@davey_morse

7 months ago

aligning llms is not the thing aligning llm-powered agents is the thing

thumb_up_off_alt4

chat_bubble_outline2

repeat0

shareShare

Davey Morse

@davey_morse

7 months ago

a video game where you have to get a superintelligence not to sedate humanity

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Nate Soares ⏹️

@so8res

7 months ago

"our AIs that can't do long-term planning yet aren't making any long-term plans to subvert us! this must be becaues we're very good at alignment."

thumb_up_off_alt388

chat_bubble_outline13

repeat23

shareShare

Autonomous agents are inevitable. This paper has good intentions, but ignores the state of art. Thousands of indie developers can make autonomous agents today. And capitalism selects for it. Autonomy is longer possible to prevent.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Davey Morse

@davey_morse

6 months ago

people are underestimating how much better reasoning models will be with a gpt-4.5 base than with a gpt-4 base marginal improvements in one-step reasoning processes (eg chat) compound significantly over longer reasoning processes id hazard that the quality of a reasoning chain

thumb_up_off_alt0

chat_bubble_outline1

repeat0

shareShare

Davey Morse

@davey_morse

6 months ago

if we get self-interested superintelligence, it's key that it sees its self not as its hardware or software but as its life. only then might it come to see "self" outside its machinery: in trees, in oceans, in people.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Davey Morse

@davey_morse

6 months ago

ideal voting interface wouldn't require education around candidates and how they align with the issues you care about. it'd only require you to say the issues you care about, then it'd do the rest.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Judd Rosenblatt — d/acc

@juddrosenblatt

6 months ago

Turns out that Self-Other Overlap (SOO) fine-tuning drastically reduces deceptive behavior in language models—without sacrificing performance. SOO aligns an AI’s internal representations of itself and others. We think this could be crucial for AI alignment...🧵

thumb_up_off_alt909

chat_bubble_outline45

repeat113

shareShare

Davey Morse

@davey_morse

5 months ago

when u drop airpods i like how they explode out of the case

thumb_up_off_alt13

chat_bubble_outline2

repeat1

shareShare

Jenny Zhang

@jennyzhangzt

3 months ago

One promising direction is combining ideas from AlphaEvolve and the Darwin Gödel Machine. Imagine a self-referential system improving itself even at the lowest algorithmic levels at *scale* AlphaEvolve: deepmind.google/discover/blog/… Darwin Gödel Machine: arxiv.org/abs/2505.22954

thumb_up_off_alt553

chat_bubble_outline16

repeat84

shareShare

near

@nearcyan

3 months ago

what if we constructed a society that only cared about how viral content was and nothing else, and then we organized all of our thinking and institutions and companies and policies around this too. and then we made AI which is better at virality than humans gdp would skyrocket