ellie (@ellie__hain) Twitter Tweets • TwiCopy

ellie

a year ago

"The time for internet preaching and demagoguery has concluded. Your job, and mine, is to seize to the ground." This has also been one of my insights this past year, and it's great to see such a good recap of why that matters

thumb_up_off_alt9

chat_bubble_outline0

repeat2

shareShare

Joe Edelman

@edelwax

a year ago

“What are human values, and how do we align to them?” Very excited to release our new paper on values alignment, co-authored with Ryan Lowe and funded by @openai. 📝: meaningalignment.org/values-and-ali…

“What are human values, and how do we align to them?”

Very excited to release our new paper on values alignment, co-authored with <a href="/ryan_t_lowe/">Ryan Lowe</a> and funded by @openai.

📝: meaningalignment.org/values-and-ali…

thumb_up_off_alt372

chat_bubble_outline25

repeat76

shareShare

Joe Edelman

@edelwax

a year ago

Our approach, MGE, outperforms alternatives like CCAI by @anthropic on legitimacy in a case study, and offers robustness against ideological rhetoric. 89% even agree the winning values were fair, even if their own value didn't win!

thumb_up_off_alt33

chat_bubble_outline1

repeat2

shareShare

Joe Edelman

@edelwax

a year ago

We reconcile value conflicts by asking which values participants think are wiser than others within a context. This lets us build an alignment target we call a "moral graph". It surfaces the wisest values of a large population, without relying on an ultimate moral theory.

thumb_up_off_alt35

chat_bubble_outline1

repeat3

shareShare

timour kosters

@timourxyz

a year ago

Super interesting work from Meaning Alignment Institute. This could help achieve two hard things: 1) Help humans understand their values and at the same time 2) help align AI closer to those human values. An important step towards more human flourishing.

thumb_up_off_alt12

chat_bubble_outline4

repeat4

shareShare

ellie

@ellie__hain

a year ago

Super proud of the work my team has done here 🔥🔥 ✅ Defines values concretely and practically ✅ Creates a new method for surfacing wise values in a non ideological way ✅ Creates a new alignment target ✅ First step towards Wise AI

thumb_up_off_alt22

chat_bubble_outline1

repeat1

shareShare

Oliver Klingefjord

@klingefjord

a year ago

You gotta be meaning-maxxing! ✨

thumb_up_off_alt7

chat_bubble_outline1

repeat1

shareShare

ellie

@ellie__hain

a year ago

Excited for this! I will also be at Esmeralda for the first week and will be hosting a meaning exploration session at Spirit of Toby Shorin's pop up clinic 💫

thumb_up_off_alt27

chat_bubble_outline1

repeat4

shareShare

ellie

@ellie__hain

a year ago

my friends have been working on this for years and the results are actually really good!

thumb_up_off_alt16

chat_bubble_outline2

repeat1

shareShare

ellie

@ellie__hain

a year ago

excited about this!

thumb_up_off_alt15

chat_bubble_outline1

repeat2

shareShare

Spirit of Toby Shorin

@tobyshorin

a year ago

I like to take on quests with my friends

thumb_up_off_alt29

chat_bubble_outline2

repeat4

shareShare

xuan (ɕɥɛn / sh-yen)

@xuanalogue

a year ago

Should AI be aligned with human preferences, rewards, or utility functions? Excited to finally share a preprint that Micah Carroll Matija Hal Ashton & I have worked on for almost 2 years, arguing that AI alignment has to move beyond the preference-reward-utility nexus!

Should AI be aligned with human preferences, rewards, or utility functions?

Excited to finally share a preprint that <a href="/MicahCarroll/">Micah Carroll</a> <a href="/FranklinMatija/">Matija</a> <a href="/hal_ashton/">Hal Ashton</a> & I have worked on for almost 2 years, arguing that AI alignment has to move beyond the preference-reward-utility nexus!

thumb_up_off_alt993

chat_bubble_outline33

repeat172

shareShare

Combinations

@combinationsmag

10 months ago

combinationsmag.com is live! ✨ A publication by RadicalxChange, exploring new ideas about economics, democracy, and the relationship between technology and power. If our societies are irreversibly technological, we must steer technology itself towards the common good, rather

thumb_up_off_alt48

chat_bubble_outline1

repeat17

shareShare

Ryan Lowe

@ryan_t_lowe

2 months ago

Introducing: Full-Stack Alignment 🥞 A research program dedicated to co-aligning AI systems *and* institutions with what people value. It's the most ambitious project I've ever undertaken. Here's what we're doing: 🧵

thumb_up_off_alt195

chat_bubble_outline11

repeat45

shareShare