Harry Mayne (@harrymayne5) Twitter Tweets • TwiCopy

Harry Mayne

@harrymayne5

+ Follow

Mech interp, explainability and evals @oiioxford @uniofoxford. PhD student | Prev @Cambridge_Uni

ID: 1369046889442783235

linkhttps://www.harrymayne.com calendar_today08-03-2021 22:06:38

135 Tweet

196 Takipçi

793 Takip Edilen

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Harry Mayne

@harrymayne5

3 months ago

Enjoyed speaking with Transformer about the state of AI safety evals. Important issue that I’m glad to see highlighted.

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

🚨🚨Introducing the FLAIR internship program!🚨🚨 We are looking for two talented students to join us for an internship working in FLAIR for 6 months (5th January to 4th July 2026)! For details and eligibility criteria, please check: foersterlab.com/internship/

thumb_up_off_alt120

chat_bubble_outline2

repeat20

shareShare

Harry Mayne

@harrymayne5

2 months ago

LLMs can attempt to explain their own decisions, “self-explanations”, but can we trust them? Our new paper, "LLMs Don’t Know Their Own Decision Boundaries", tests this for counterfactual explanations. arxiv.org/pdf/2509.09396 (EMNLP 2025)

thumb_up_off_alt12

chat_bubble_outline2

repeat5

shareShare

Harry Mayne

good girl

Harry Mayne

Foerster Lab for AI Research

Harry Mayne