Polina Kirichenko
@polkirichenko
Machine learning researcher; prev. PhD at New York University, Visiting Researcher at @MetaAI FAIR Labs 🇺🇦
ID: 1059281382025977856
https://polkirichenko.github.io/ 05-11-2018 03:08:56
213 Tweet
3,3K Followers
1,1K Following
Can we understand & edit unanticipated mechanisms in LMs? We introduce sparse feature circuits, & use them to explain LM behaviors, discover & fix LM bugs, & build an automated interpretability pipeline! Preprint w/ Can Rager, Eric J. Michaud, Yonatan Belinkov, David Bau, Aaron Mueller
⏱️Few days left to submit your work to our #ECCV2024 Workshop on Uncertainty Quantification for Computer Vision European Conference on Computer Vision #ECCV2024 We welcome contributions on different facets of reliability of perception models. ⏲️Deadline: Wed 17th July 🎁 Bonus: we have an excellent speaker panel
Did you know that networks trained with different learning rates extract different features (and a different number of them!) from the data? Come by our poster at HiLD Workshop #ICML2024 tomorrow to discuss it with Ildus Sadrtdinov! Paper: openreview.net/forum?id=IID2D… 1/6