Sarah Cogan (@sarah_cogan) Twitter Tweets • TwiCopy

Frances Lorenz

7 months ago

Remember when AI researchers used to be like, "this is a neural network, isn't it cool? It can say DOG or CAT if you show it a DOG or a CAT :)" and now they're like, "we're gonna make God before China does" okay

thumb_up_off_alt16,16K

chat_bubble_outline80

repeat1,1K

shareShare

Sarah Cogan

@sarah_cogan

7 months ago

Frontier Safety Framework v2 just dropped! 💥

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

Rohin Shah

@rohinmshah

7 months ago

Now with an approach to deceptive alignment -- first such policy to do so! x.com/GoogleDeepMind…

thumb_up_off_alt114

chat_bubble_outline2

repeat12

shareShare

Tom Lieberum 🔸

@lieberum_t

7 months ago

Are you worried about risks from AGI and want to mitigate them? Come work with me and my colleagues! We're hiring on the AGI Safety & Alignment team (ASAT) and the Gemini Safety team! Research Engineers: boards.greenhouse.io/deepmind/jobs/… Research Scientists: boards.greenhouse.io/deepmind/jobs/…

thumb_up_off_alt141

chat_bubble_outline4

repeat12

shareShare

Victoria Krakovna

@vkrakovna

7 months ago

We are excited to release a short course on AGI safety! The course offers a concise and accessible introduction to AI alignment problems and our technical & governance approaches, consisting of short recorded talks and exercises (75 minutes total). deepmindsafetyresearch.medium.com/1072adb7912c

thumb_up_off_alt259

chat_bubble_outline5

repeat47

shareShare

Rohin Shah

@rohinmshah

7 months ago

We're hiring! Join an elite team that sets an AGI safety approach for all of Google -- both through development and implementation of the Frontier Safety Framework (FSF), and through research that enables a future stronger FSF.

thumb_up_off_alt297

chat_bubble_outline11

repeat37

shareShare

Sarah Cogan

@sarah_cogan

6 months ago

But doctor…I am Pagliacci

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Sarah Cogan

@sarah_cogan

6 months ago

Tool the Love Science personality test and I think my personality is Good.

thumb_up_off_alt16

chat_bubble_outline4

repeat0

shareShare

Sarah Cogan

@sarah_cogan

6 months ago

I have long timelines (like, I’m on Twitter a lot)

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Rohin Shah

@rohinmshah

5 months ago

Just released GDM’s 100+ page approach to AGI safety & security! (Don’t worry, there’s a 10 page summary.) AGI will be transformative. It enables massive benefits, but could also pose risks. Responsible development means proactively preparing for severe harms before they arise.

thumb_up_off_alt362

chat_bubble_outline13

repeat68

shareShare

Anca Dragan

@ancadianadragan

4 months ago

Per our Frontier Safety Framework, we continue to test our models for critical capabilities. Here’s the updated model card for Gemini 2.5Pro with frontier safety evaluations + explanation of how our safety buffer / alert thresholds approach applies to 2.0, 2.5, and what’s coming.

thumb_up_off_alt79

chat_bubble_outline1

repeat13

shareShare

meatball times

@meatballtimes

4 months ago

has anyone stopped to ask WHY students cheat? would a buddhist monk "cheat" at meditation? would an artist "cheat" at painting? no. when process and outcomes are aligned, there's no incentive to cheat. so what's happening differently at colleges? the answer is in the article: