Ryan Othniel Kearns (@ryanothkearns) 's Twitter Profile
Ryan Othniel Kearns

@ryanothkearns

Cosmic Sociology & Psychohistory @OIIOxford; @Cosmos_Inst Fellow, Laboratory for Human-Centered AI, @EthicsInAI, @UniOfOxford

ID: 1051194447138971648

linkhttps://www.ryanothnielkearns.com calendar_today13-10-2018 19:34:21

181 Tweet

158 Followers

427 Following

Jude Khouja (@mkhooja) 's Twitter Profile Photo

Are reasoning models better at reasoning or memorization? Our new reasoning benchmark L2 tests R1, o1, o3-mini, GPT 4.5 and Claude 3.7 Sonnet. None of the models performs well but Claude 3.7 Sonnet outperforms o1-preview and R1 by a larger margin!

Are reasoning models better at reasoning or memorization? 

Our new reasoning benchmark L2 tests R1,  o1, o3-mini, GPT 4.5 and Claude 3.7 Sonnet. 

None of the models performs well but Claude 3.7 Sonnet outperforms o1-preview and  R1 by a larger margin!
Harry Mayne (@harrymayne5) 's Twitter Profile Photo

📣 Introducing LingOly-TOO: A benchmark to separate reasoning from memorisation 📣 Reasoning evals should measure reasoning—and ONLY reasoning! However, current evals are confounded by: 1️⃣ Requiring specific world knowledge 2️⃣ Prior data exposure LingOly-TOO addresses this! 🧵

📣 Introducing LingOly-TOO: A benchmark to separate reasoning from memorisation 📣

Reasoning evals should measure reasoning—and ONLY reasoning!

However, current evals are confounded by:
1️⃣ Requiring specific world knowledge
2️⃣ Prior data exposure

LingOly-TOO addresses this! 🧵
Ben Walker (@ml_benwalker) 's Twitter Profile Photo

Don’t be Dense, SLiCE the Cost! 🪓💸 Structured Linear CDEs (SLiCEs) are expressive, efficient, and parallel-in-time sequence models. - ✅ Maximal expressivity without the cost of dense matrices - ⚡ 20× faster per-step training compared to non-linear NCDEs on real-world

Don’t be Dense, SLiCE the Cost! 🪓💸

Structured Linear CDEs (SLiCEs) are expressive, efficient, and parallel-in-time sequence models.

- ✅ Maximal expressivity without the cost of dense matrices    
- ⚡ 20× faster per-step training compared to non-linear NCDEs on real-world
Trent Fowler (@trent_stempunk) 's Twitter Profile Photo

I'm attending a Cosmos Institute seminar on the intersection of AI human agency, and collective intelligence, a topic very dear to my heart. Here are some thoughts.

Joe Edelman (@edelwax) 's Twitter Profile Photo

Such an ambitious project could only be done with the best researchers in the world. Thankfully, the most amazing team has already assembled. Join us! Read the paper at full-stack-alignment.ai/paper

Ryan Othniel Kearns (@ryanothkearns) 's Twitter Profile Photo

It was terrifically energising to work on this position paper. Floored by the ambition and optimism coming out of the Meaning Alignment Institute team and by the talented cadre they have assembled for this problem. Kudos Ryan Lowe 🥞 @ICML Joe Edelman 🥞 @ICML2025 Oliver Klingefjord 🥞, now the real work begins :)

Ryan Othniel Kearns (@ryanothkearns) 's Twitter Profile Photo

"What are markets, AI systems, and democratic institutions really for? They are not ends in themselves. We want markets to coordinate human needs and resources. We want democratic institutions to enable collective self-governance. Presumably, we want AI systems to augment human

The Institute for Ethics in AI (@ethicsinai) 's Twitter Profile Photo

New from Professor Philipp Koralus (@oxfordhailab) at the The Institute for Ethics in AI: AI that nudges us may quietly undermine autonomy. He proposes a Socratic design for AI – supporting truth-seeking, not steering choices.