Curt Tigges
@curttigges
Science lead at @decode_research (parent org of @neuronpedia)
ID: 2815835304
http://curttigges.com 17-09-2014 23:38:03
4,4K Tweet
1,1K Followers
828 Following
Why is interpretability the key to dominance in AI? Not winning the scaling race, or banning China. Our answer to OSTP/NSF, w/ Goodfire's Tom McGrath Transluce's Sarah Schwettmann MIT's Dylan HadfieldMenell resilience.baulab.info/docs/AI_Action… Here's why:🧵 ↘️
🚀 New stuff on Neuronpedia - 🎙️ Podcast: The Babble by NotebookLM - 🔍 TopK Search by Token: a new way to search + API - 🪆 Matryoshka SAEs by David Chanin - 🔬 Probity, a probing library by Curt Tigges - 🧠 New Auto-Interp Models Demos +examples in thread➡️
Danielle Fong 🔆 Reinforcement Learning with Wormtongue Feedback