Neel Nanda (@neelnanda5) 's Twitter Profile
Neel Nanda

@neelnanda5

Mechanistic Interpretability lead DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!

ID: 1542528075128348674

linkhttp://neelnanda.io calendar_today30-06-2022 15:18:58

4,4K Tweet

25,25K Takipçi

117 Takip Edilen

Neel Nanda (@neelnanda5) 's Twitter Profile Photo

Obvious caveat: These are MY takes, and I'm sure other researchers disagree on points, or think I'm missing a crucial paper. But I think it's more useful this way. V1 is one of my most popular blog posts, so I hope v2 is useful to people! alignmentforum.org/posts/NfFST5Mi…