Tiago Pimentel (@tpimentelms) 's Twitter Profile
Tiago Pimentel

@tpimentelms

Postdoc at @ETH_en. Formerly, PhD student at @Cambridge_Uni.

ID: 87818714

linkhttps://tpimentelms.github.io/ calendar_today05-11-2009 23:50:49

944 Tweet

1,1K Takipçi

275 Takip Edilen

Denis Sutter (@denissutte9310) 's Twitter Profile Photo

Thrilled to share that our paper “The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability?” got accepted to NeurIPS 2025 as a Spotlight. 🎉

Julian Minder (@jkminder) 's Twitter Profile Photo

My master's thesis "Understanding the Surfacing of Capabilities in Language Models", has been awarded the ETH Medal 🏅for Outstanding Thesis. Huge thanks to my supervisors Chris Wendler Bob West! inf.ethz.ch/news-and-event… Thesis: research-collection.ethz.ch/entities/publi…

Tiago Pimentel (@tpimentelms) 's Twitter Profile Photo

Late to the party, but very happy this paper got accepted to NeurIPS 2025 as a Spotlight! 😁 Main takeaway: Without prior assumptions about how DNNs encode concepts in their representations (eg, the linear representation hypothesis), we can claim any DNN implements any algorithm

Abhilasha Ravichander (@lasha_nlp) 's Twitter Profile Photo

It is PhD application season again 🍂 For those looking to do a PhD in AI, these are some useful resources 🤖: 1. Examples of statements of purpose (SOPs) for computer science PhD programs: cs-sop.org [1/4]

Bob West (@cervisiarius) 's Twitter Profile Photo

🚨New paper alert! 🚨 Tandem Training for Language Models arxiv.org/abs/2510.13551 Actions & thoughts of AI w/ superhuman skills will be hard for humans to follow, undermining human oversight of AI. We propose a new way to make AI produce human-understandable solutions. How?👉🧵

🚨New paper alert! 🚨

Tandem Training for Language Models
arxiv.org/abs/2510.13551

Actions & thoughts of AI w/ superhuman skills will be hard for humans to follow, undermining human oversight of AI. We propose a new way to make AI produce human-understandable solutions. How?👉🧵
Marius Mosbach (@mariusmosbach) 's Twitter Profile Photo

If you are thinking a lot about CoT and multi-agent communication these days, check out Michael's work below 👇. And make sure to keep an eye on his work going forward, more great things to come! 👨🏻‍🍳

Julian Minder (@jkminder) 's Twitter Profile Photo

New paper: Finetuning on narrow domains leaves traces behind. By looking at the difference in activations before and after finetuning, we can interpret what it was finetuned for. And so can our interpretability agent! 🧵

New paper: Finetuning on narrow domains leaves traces behind. By looking at the difference in activations before and after finetuning, we can interpret what it was finetuned for. And so can our interpretability agent! 🧵
Tiago Pimentel (@tpimentelms) 's Twitter Profile Photo

I'm preparing a camera-ready for #neurips2025 and I'm confused about whether Contribution statements are allowed after page 10 🤔 I'd put it next to, e.g., the acknowledgement section. Instructions imply that they are not allowed. Does anyone know for sure? NeurIPS Conference

Tiago Pimentel (@tpimentelms) 's Twitter Profile Photo

Working with Ethan is always amazing! If you're interested in psycholinguistics and language modelling, you should definitely apply :)