Jam Kraprayoon (@jkraprayoon) Twitter Tweets • TwiCopy

Jam Kraprayoon

@jkraprayoon

+ Follow

Researcher @iapsAI Fellow @scientistsorg Oxford/LSE. AI governance and policy. Fmr international civil servant. Also poet.

ID: 1014864640839307265

calendar_today05-07-2018 13:32:40

332 Tweet

223 Followers

1,1K Following

Palisade Research

@palisadeai

a year ago

⛳️ Our new LLM Agent achieved 95% success on InterCode-CTF, a high-school level hacking benchmark, using simple prompting techniques. 🚀 This surpasses prior work by a large margin:

thumb_up_off_alt64

chat_bubble_outline3

repeat9

shareShare

After 11 months of work, we proudly announce Third Opinion: A free of charge expert consultation service for frontier AI professionals. To help you clarify if what you're seeing is cause for concern. Anonymous, without sharing confidential information 🧵 tinyurl.com/2rbk2w59

thumb_up_off_alt75

chat_bubble_outline2

repeat22

shareShare

Tamay Besiroglu

@tamaybes

a year ago

1/11 I’m genuinely impressed by OpenAI’s 25.2% Pass@1 performance on FrontierMath—this marks a major leap from prior results and arrives about a year ahead of my median expectations.

thumb_up_off_alt1,1K

chat_bubble_outline29

repeat152

shareShare

Elliot Glazer

@elliotglazer

a year ago

1/12 FrontierMath’s three-part rating—Background (1–5), Creativity (hours of insight), and Execution (solution time)—lets us precisely gauge problem difficulty. These ratings help provide context on o3’s benchmark results.

thumb_up_off_alt163

chat_bubble_outline3

repeat18

shareShare

David Lawrence

@dc_lawrence

a year ago

3. Building AI assurance in the UK: Jam Kraprayoon and Bill Anderson-Samways explain how the UK can lead in AI safety and AI opportunities at the same time by becoming a global leader in AI testing and assurance. ukdayone.org/briefings/assu…

3. Building AI assurance in the UK: <a href="/JKraprayoon/">Jam Kraprayoon</a> and <a href="/BillSamways/">Bill Anderson-Samways</a> explain how the UK can lead in AI safety and AI opportunities at the same time by becoming a global leader in AI testing and assurance.

ukdayone.org/briefings/assu…

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Alejandro Cuadron

@alex_cuadron

a year ago

Surprising find: OpenAI's O1 - reasoning-high only hit 30% on SWE-Bench Verified - far below their 48.9% claim. Even more interesting: Claude achieves 53% in the same framework. Something's off with O1's "enhanced reasoning"... 🧵1/8

thumb_up_off_alt1,1K

chat_bubble_outline54

repeat154

shareShare

Julia Garayo Willemyns

@jujulemons

a year ago

The report also supports of the work done by David Lawrence & Elizabeth A. Seger advocating for more compute capacity, and the work done by Jam Kraprayoon & Bill Anderson-Samways making the case for an AI assurance market in the UK.

The report also supports of the work done by <a href="/dc_lawrence/">David Lawrence</a> & <a href="/ea_seger/">Elizabeth A. Seger</a> advocating for more compute capacity, and the work done by <a href="/JKraprayoon/">Jam Kraprayoon</a> & <a href="/BillSamways/">Bill Anderson-Samways</a> making the case for an AI assurance market in the UK.

thumb_up_off_alt4

chat_bubble_outline1

repeat3

shareShare

Dr. Chinasa T. Okolo

@chinasatokolo

a year ago

For our next piece within the “AI Safety and the Global Majority” series, Shaun K.E. Ee and Jam Kraprayoon discuss the growing ecosystem of AI safety in Southeast Asia and opportunities to strengthen AI governance throughout the region. Brookings Governance brookings.edu/articles/ai-sa…

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Institute for AI Policy and Strategy (IAPS)

@iapsai

a year ago

AI “agents”—systems that can autonomously pursue goals—are advancing fast. If current trends continue, we could soon see millions of agents deployed across society. Are we ready? Here’s what you need to know, from a report from Jam Kraprayoon, Zoe Wiliams, and Rida Fayyaz. 👇

thumb_up_off_alt27

chat_bubble_outline1

repeat7

shareShare

Joe O'Brien

@__j0e___

a year ago

I explore this question with Jeremy Dolan, Jay Kim, Jonah, Jeba Sania, Sebastian Becker, Jam Kraprayoon, and R. Cara Labrador, in our new report, available here: iaps.ai/research/ai-re…

thumb_up_off_alt6

chat_bubble_outline1

repeat2

shareShare

Joe O'Brien

@__j0e___

a year ago

Surprise finding: Every single multi-agent research area ranked in the top 30. Experts see multi-agent systems as a critical, underexplored frontier for AI risks.

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare

Jack Cooper

@jackcooper0696

10 months ago

Very taken by Jam Kraprayoon's work in the Brotherton Poetry Prize Anthology III (Jam Kraprayoon)!

Very taken by Jam Kraprayoon's work in the Brotherton Poetry Prize Anthology III (<a href="/JKraprayoon/">Jam Kraprayoon</a>)!

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Peter Wildeford 🇺🇸🚀

@peterwildeford

10 months ago

The US must invest in AI assurance + security tech to stay competitive. Institute for AI Policy and Strategy (IAPS) 's memo with FAS is now @scientistsorg outlines 3 critical gaps (emergent behaviors, infra security, autonomous agents) + 3 solutions (coordinated R&D strategy, public-private consortium, frontier fellowships)

thumb_up_off_alt7

chat_bubble_outline1

repeat3

shareShare

Jam Kraprayoon

Palisade Research

OAISIS

Tamay Besiroglu

Elliot Glazer

David Lawrence

Alejandro Cuadron

Julia Garayo Willemyns

Dr. Chinasa T. Okolo

Institute for AI Policy and Strategy (IAPS)

Joe O'Brien

Joe O'Brien

Jack Cooper

Peter Wildeford 🇺🇸🚀