Ajeya Cotra (@ajeya_cotra) Twitter Tweets • TwiCopy

Ajeya Cotra

@ajeya_cotra

+ Follow

AI could get really powerful soon and I worry we're underprepared. Funding research on AI capabilities+risks @open_phil (views my own), editor @plannedobs.

ID: 917512572965761024

linkhttp://planned-obsolescence.org calendar_today09-10-2017 22:10:18

348 Tweet

7,7K Followers

363 Following

FAR.AI

@farairesearch

11 days ago

Vienna #AlignmentWorkshop: 129 researchers tackled #AISafety from interpretability & robustness to governance. Keynote by Jan Leike + talks by Victoria Krakovna David Krueger Gillian Hadfield Robert Trager Neel Nanda David Bau Helen Toner Mary Phuong and more. Blog recap & videos. 👇

We put OpenAI o1 to the test against ARC Prize. Results: both o1 models beat GPT-4o. And o1-preview is on par with Claude 3.5 Sonnet. Can chain-of-thought scale to AGI? What explains o1's modest scores on ARC-AGI? Our notes: arcprize.org/blog/openai-o1…

thumb_up_off_alt853

chat_bubble_outline47

repeat151

shareShare

Ethan Mollick

@emollick

7 days ago

In this post, I tried to lay out, in a non-technical way, where we are in the development of AI, the state of the top five AI models available now, & what the near future of AI is going to be The short answer is that LLMs are going to keep getting better. oneusefulthing.org/p/scaling-the-…

thumb_up_off_alt716

chat_bubble_outline27

repeat149

shareShare

Ajeya Cotra

@ajeya_cotra

6 days ago

By 2026, will some companies allow AI agents to autonomously spend at least $10K at a time on behalf of the company, without any human review (though possibly with AI review)?

thumb_up_off_alt11

chat_bubble_outline8

repeat3

shareShare

Max Nadeau

@maxnadeau_

6 days ago

Update: Human baseline now exists! 1st-try acc is 48% on the pub eval set: arxiv.org/abs/2409.01374 AI pass@1 is somewhere in the 40s: x.com/mikeknoop/stat… So AI ≈ humans! But human=64% & AI=50s if given 3 tries. Still: A far cry from humans >> AIs. And scaling trends!

thumb_up_off_alt36

chat_bubble_outline0

repeat4

shareShare

Ethan Mollick

@emollick

5 days ago

After spending a decent amount of time with o1-preview, I would be very surprised if it is not able to do economically valuable analytical work inside large companies. The main issue is that prompting it remains really weird. But a real R&D effort inside firms might crack that.

thumb_up_off_alt741

chat_bubble_outline39

repeat39

shareShare

Ajeya Cotra

@ajeya_cotra

5 days ago

We'll see some signs of deceptive capabilities before it's unrecoverable (I'm excited about research on this e.g. anthropic.com/research/rewar…) but "sure we'll see it coming from far away and have plenty of time to stop it" is overconfident IMO.

thumb_up_off_alt45

chat_bubble_outline2

repeat4

shareShare

Garrison Lovely

@garrisonlovely

5 days ago

OpenAI whistleblower William Saunders is testifying before a Senate subcommittee today (so is Helen Toner and Margaret Mitchell). His written testimony is online now. Here are the most important parts 🧵

thumb_up_off_alt939

chat_bubble_outline36

repeat171

shareShare

Ethan Perez

@ethanjperez

4 days ago

I’m taking applications for collaborators via ML Alignment & Theory Scholars! It’s a great way for new or experienced researchers outside AI safety research labs to work with me/others in these groups: Neel Nanda, Evan Hubinger, mrinank 💗 enjoying solitude, Nina, Fabien Roger, Rylan Schaeffer, ...🧵